کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
4499856 | 1624006 | 2016 | 8 صفحه PDF | دانلود رایگان |
• We model the relationship between protein interaction network and the function class network.
• Transductive learning is applied in the protein annotation process to facilitate simultaneous multi-label annotation.
• Quantitative and qualitative analysis are done to assess the biological significance of the results.
• Data fusion of results from bi-relation graphs is integrated using ensemble classifier for optimization of the results.
One of the challenging tasks of bioinformatics is to predict more accurate and confident protein functions from genomics and proteomics datasets. Computational approaches use a variety of high throughput experimental data, such as protein-protein interaction (PPI), protein sequences and phylogenetic profiles, to predict protein functions. This paper presents a method that uses transductive multi-label learning algorithm by integrating multiple data sources for classification. Multiple proteomics datasets are integrated to make inferences about functions of unknown proteins and use a directed bi-relational graph to assign labels to unannotated proteins. Our method, bi-relational graph based transductive multi-label function annotation (Bi-TMF) uses functional correlation and topological PPI network properties on both the training and testing datasets to predict protein functions through data fusion of the individual kernel result. The main purpose of our proposed method is to enhance the performance of classifier integration for protein function prediction algorithms. Experimental results demonstrate the effectiveness and efficiency of Bi-TMF on multi-sources datasets in yeast, human and mouse benchmarks. Bi-TMF outperforms other recently proposed methods.
Figure optionsDownload as PowerPoint slide
Journal: Mathematical Biosciences - Volume 274, April 2016, Pages 25–32