The extraction of text/graphs from degraded documents | |
---|---|
學年 | 92 |
學期 | 1 |
發表日期 | 2004-01-05 |
作品名稱 | The extraction of text/graphs from degraded documents |
作品名稱(其他語言) | |
著者 | Yen, Shwu-huey; Chen, Yi-jin; Lin, Hwei-jen; Wang, Chia-zen |
作品所屬單位 | 淡江大學資訊工程學系 |
出版者 | |
會議名稱 | The 10th International Multi-Media Modeling Conference (MMM2004) |
會議地點 | Brisbane, Australia |
摘要 | This paper presents a method for improving the quality of degraded documents by noise removal and text enhancing. Histogram of a degraded document is analyzed to find out the approximate ranges of gray-value for text-, graph-, (i.e. photographs), and background-pixels. After the graph-pixels are identified, they are replaced by the background pixels. Agent-growing method described by S. H. Yen and M. C. Shih (2000) is then applied to smooth the noisy background and a document with clear readable condition for text and background is obtained. At last, graph pixels are recovered to get the final result such that the degraded document now has the text in much better quality and photographs preserved if there is any. Experiments to verify the efficacy of the proposed method and comparison to some existing techniques are also presented. |
關鍵字 | |
語言 | en |
收錄於 | EI |
會議性質 | 國際 |
校內研討會地點 | |
研討會時間 | 20040105~20040107 |
通訊作者 | |
國別 | AUS |
公開徵稿 | |
出版型式 | 紙本 |
出處 | Proceedings of The 10th International Multi-Media Modeling Conference, pp.181-186 |
相關連結 |
機構典藏連結 ( http://tkuir.lib.tku.edu.tw:8080/dspace/handle/987654321/37760 ) |