Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
4497788 | Journal of Theoretical Biology | 2009 | 4 Pages |
A key goal of the post-genomic era is to determine protein functions. In this paper, we proposed a global encoding method of protein sequence (GE) to descript global information of amino acid sequence, and then assign protein functional class using machine learning methods nearest neighbor algorithm (NNA). We predicted the function of 1818 Saccharomyces cerevisiae proteins which was used in Vazquez's global optimization method (GOM) except eight proteins which cannot get from the database now or whose sequence length is too short. Using our approach, the computed accuracy is better than Vazquez's global optimization method (GOM) in some cases. The experiment results show that our new method is efficient to predict functional class of unknown proteins.