教師資料查詢 | 類別: 期刊論文 | 教師: 許輝煌 Hsu Hui-huang (瀏覽個人網頁)

標題:Feature Selection for Identifying Protein Disordered Regions
學年98
學期2
出版(發表)日期2010/04/01
作品名稱Feature Selection for Identifying Protein Disordered Regions
作品名稱(其他語言)
著者Hsu, Hui-Huang; Hsieh, Cheng-Wei
單位淡江大學資訊工程學系
出版者Singapore: World Scientific Publishing Co. Pte. Ltd.
著錄名稱、卷期、頁數Biomedical Engineering: Applications, Basis and Communications 22(2), pp.119-125
摘要Determining the structure of a protein is not an easy task, which usually involved a time-consuming and costly process in the web lab. Using computational methods to predict a protein's tertiary structure from its primary structure (the amino acid sequence) is desirable. Disordered regions are segments of a protein that do not have a fixed conformation, which makes the structure prediction harder. Also, these disordered regions are functionally important for a protein. In this research, we would like to identify such regions with a focus on selecting a proper feature set. Three feature selection methods, namely F-score, information gain (IG), and k-medoids clustering, are used for feature selection. The support vector machine (SVM) is then used for classification. The results show that the classification accuracy can be raised with a smaller feature set. The k-medoids clustering feature selection can reduce the number of features from 440 to 150 and improve the accuracy from 84.66 to 86.81% in five-fold cross validation. It also has a more stable performance than F-score and IG.
關鍵字Disordered protein region; k-Medoids clustering; Feature selection; Proteomics
語言英文
ISSN1016-2372;1793-7132
期刊性質國外
收錄於SCI;EI
產學合作
通訊作者Hsu, Hui-Huang
審稿制度
國別新加坡
公開徵稿
出版型式紙本
相關連結
Google+ 推薦功能,讓全世界都能看到您的推薦!