會議論文
| 學年 | 102 |
|---|---|
| 學期 | 2 |
| 發表日期 | 2014-06-03 |
| 作品名稱 | Apply the Dynamic N-gram to Extract the Keywords of Chinese News |
| 作品名稱(其他語言) | |
| 著者 | Ren-Xiang Lin; Heng-Li Yang |
| 作品所屬單位 | |
| 出版者 | |
| 會議名稱 | The 27th International Conference on Industrial, Engineering & Other Applications of Applied Intelligent Systems |
| 會議地點 | Kaohsiung, Taiwan |
| 摘要 | The explosive growth of information on the Internet has created a great demand for new and powerful tools to acquire useful information. The first step to retrieve information form Chinese article is word segmentation. But there are two major segmentation problems that might affect the accuracy of word segmentation performance, ambiguity and long words. In this paper, we propose a novel character-based approach, namely, dynamic N-gram (DNG) to deal with the two above problems of word segmentation and apply it to Chinese news articles to evaluate the accuracy of N-gram. The evaluation result indicated most of the readers agreed that dynamic N-gram approach could extract meaningful keywords. Even in different news categories, the keywords extraction results still have no significant difference. The primary contribution of this approach is that dynamic N-gram helps us to extract the most meaningful keywords in different types of Chinese articles without considering the number of grams. |
| 關鍵字 | |
| 語言 | en_US |
| 收錄於 | |
| 會議性質 | 國際 |
| 校內研討會地點 | 無 |
| 研討會時間 | 20140603~20140606 |
| 通訊作者 | |
| 國別 | TWN |
| 公開徵稿 | |
| 出版型式 | |
| 出處 | Modern Advances in Applied Intelligence |
| 相關連結 |
機構典藏連結 ( http://tkuir.lib.tku.edu.tw:8080/dspace/handle/987654321/128740 ) |