會議論文

學年 102
學期 2
發表日期 2014-06-03
作品名稱 Apply the Dynamic N-gram to Extract the Keywords of Chinese News
作品名稱(其他語言)
著者 Ren-Xiang Lin; Heng-Li Yang
作品所屬單位
出版者
會議名稱 The 27th International Conference on Industrial, Engineering & Other Applications of Applied Intelligent Systems
會議地點 Kaohsiung, Taiwan
摘要 The explosive growth of information on the Internet has created a great demand for new and powerful tools to acquire useful information. The first step to retrieve information form Chinese article is word segmentation. But there are two major segmentation problems that might affect the accuracy of word segmentation performance, ambiguity and long words. In this paper, we propose a novel character-based approach, namely, dynamic N-gram (DNG) to deal with the two above problems of word segmentation and apply it to Chinese news articles to evaluate the accuracy of N-gram. The evaluation result indicated most of the readers agreed that dynamic N-gram approach could extract meaningful keywords. Even in different news categories, the keywords extraction results still have no significant difference. The primary contribution of this approach is that dynamic N-gram helps us to extract the most meaningful keywords in different types of Chinese articles without considering the number of grams.
關鍵字
語言 en_US
收錄於
會議性質 國際
校內研討會地點
研討會時間 20140603~20140606
通訊作者
國別 TWN
公開徵稿
出版型式
出處 Modern Advances in Applied Intelligence
相關連結

機構典藏連結 ( http://tkuir.lib.tku.edu.tw:8080/dspace/handle/987654321/128740 )