Article ID Journal Published Year Pages File Type
4496232 Journal of Theoretical Biology 2014 6 Pages PDF
Abstract

•We propose SVM ensemble based transfer learning method for large data analysis.•We validate the effective substitution of homolog GO knowledge for target GO knowledge.•We reduce the risk of model performance overestimation.

Membrane proteins play important roles in molecular trans-membrane transport, ligand–receptor recognition, cell–cell interaction, enzyme catalysis, host immune defense response and infectious disease pathways. Up to present, discriminating membrane proteins remains a challenging problem from the viewpoints of biological experimental determination and computational modeling. This work presents SVM ensemble based transfer learning model for membrane proteins discrimination (SVM-TLM). To reduce the data constraints on computational modeling, this method investigates the effectiveness of transferring the homolog knowledge to the target membrane proteins under the framework of probability weighted ensemble learning. As compared to multiple kernel learning based transfer learning model, the method takes the advantages of sparseness based SVM optimization on large data, thus more computationally efficient for large protein data analysis. The experiments on large membrane protein benchmark dataset show that SVM-TLM achieves significantly better cross validation performance than the baseline model.

Related Topics
Life Sciences Agricultural and Biological Sciences Agricultural and Biological Sciences (General)
Authors
,