Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
531149 | Pattern Recognition | 2006 | 8 Pages |
Abstract
The annotation of proteins can be achieved by classifying the protein of interest into a certain known protein family to induce its functional and structural features. This paper presents a new method for classifying protein sequences based upon the hydropathy blocks occurring in protein sequences. First, a fixed-dimensional feature vector is generated for each protein sequence using the frequency of the hydropathy blocks occurring in the sequence. Then, the support vector machine (SVM) classifier is utilized to classify the protein sequences into the known protein families. The experimental results have shown that the proteins belonging to the same family or subfamily can be identified using features generated from the hydropathy blocks.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Vision and Pattern Recognition
Authors
De-Shuang Huang, Xing-Ming Zhao, Guang-Bin Huang, Yiu-Ming Cheung,