A Comprehensive System for Identifying Internal Repeat Substructures of Proteins
學年 98
學期 2
發表日期 2010-02-15
作品名稱 A Comprehensive System for Identifying Internal Repeat Substructures of Proteins
作品名稱(其他語言)
著者 Kao, Hua-ying; Shih, Tsang-huang; Pai, Tun-wen; Lu, Ming-da; Hsu, Hui-huang
作品所屬單位 淡江大學資訊工程學系
出版者 IEEE Computer Society
會議名稱
會議地點 Krakow, Poland
摘要 Repetitive substructures within a protein play an important role in understanding protein folding and stability, biological function, and genome evolution. About 25% of all proteins contain repeat structures for eukaryote species and most of them do not have the resolved structural information yet. Therefore, this study aimed to design a comprehensive system for identifying internal repeats either from a protein sequence or structural information. In this study, we have curated a set of internal repeat units as a benchmark dataset for performing both sequence and structural alignment with respect to the query sequence or structure. Except for the traditional BLAST algorithms on amino acid sequence or the optimal structural superposition approaches on structures, a novel method employing the predicted secondary structure element information for internal repeat identification was proposed. Sequences were firstly transformed into Length Encoded Secondary Structure (LESS) profiles and followed by autocorrelation analyses. From the primary experimental results, the developed Internal Repeat Identification System (IRIS) can successfully identify internal repeats from those known protein structures, and the web system is freely available at http://iris.cs.ntou.edu.tw/.
關鍵字 Length Encoded Secondary Structure;internal repeat unit;secondary structure element;sequence alignment; solenoid;structure alignment
語言 en
收錄於
會議性質 國際
校內研討會地點
研討會時間 20100215~20100218
通訊作者
國別 POL
公開徵稿 Y
出版型式
出處 Proceedings of the Fourth International Conference on Complex, Intelligent and Software Intensive Systems (CISIS 2010), pp.689-693
相關連結

機構典藏連結 ( http://tkuir.lib.tku.edu.tw:8080/dspace/handle/987654321/75805 )

機構典藏連結