會議論文

學年 105
學期 1
發表日期 2016-08-15
作品名稱 Deep Web Query Interface Integration Based on Incremental Schema Matching and Merging
作品名稱(其他語言)
著者 Chichang Jou
作品所屬單位
出版者
會議名稱 The 3rd Multidisciplinary International Social Networks Conference on SocialInformatics 2016, Data Science 2016
會議地點 Kean University, New Jersey, USA
摘要 Data hidden inside the deep web are of much higher quality than those in the surface web. Internet users need to fill in query conditions in the HTML query interface and click the submit button to obtain deep web data. Unfortunately, deep web data from one site normally is insufficient for users. Users usually need to integrate information from several deep web sites. It is time-consuming to manually perform form filling for many web sites and to collect their query results. An integrated deep web query interface could help alleviate the above web users’ burdens. One of the key technologies in building such integrated query interface is schema matching and merging. Previous solutions usually perform schema matching and merging separately in a holistic approach by utilizing the statistical information of attributes of the involved schemas. That approach does not take user preference of the web sites into account. We propose new deep web query interface integration (DWQII) methodology based on incremental schema matching and merging. Our matching method is based on string similarity and synonyms of labels. Besides schema matching and merging, our system also automatically transforms query conditions from the integrated query interface into those suitable for individual web sites. Our methodology has the benefit of being able to easily supplement new deep web query interfaces into previously established integrated query interfaces. We design and implement DWQII using object oriented approach. To test DWQII, we integrate nine search interfaces in the books domain. These web sites are collected from the open directory dmoz.org, including Amazon, eBay, and other popular sites. We also conduct query experiments using our integrated query interface to verify feasibility and measure performance of the methodology.
關鍵字 Deep Web;Query Interface Integration;Incremental Schema Matching and Merging;Query Translation
語言 en_US
收錄於
會議性質 國際
校內研討會地點
研討會時間 20160815~20160817
通訊作者 Chichang Jou
國別 USA
公開徵稿
出版型式
出處 Proceedings of The 3rd Multidisciplinary International Social Networks Conference on SocialInformatics 2016, Data Science 2016
相關連結

機構典藏連結 ( http://tkuir.lib.tku.edu.tw:8080/dspace/handle/987654321/108735 )