Article ID Journal Published Year Pages File Type
8876534 Journal of Theoretical Biology 2018 28 Pages PDF
Abstract
Gram-negative bacterial secreted proteins are crucial for bacterial pathogenesis by making bacteria interact with their environments. Therefore, identification of bacterial secreted proteins becomes a significant process for the research of various diseases and the corresponding drugs. In this paper, we develop a feature design model named ACCP-KL-NMF by fusing PSSM-based auto-cross correlation analysis for features extraction and nonnegative matrix factorization algorithm based on Kullback-Leibler divergence for dimensionality reduction. Hence, a 150-dimensional feature vector is constructed on the training set. Then support vector machine is adopted as the classifier, and the most objective jackknife test is chosen for evaluating the accuracy. The ACCP-KL-NMF model yields the approving performance of the overall accuracy on the test set, and also outperforms the other three existing models. The numerical experimental results show that our model is effective and reliable for identification of Gram-negative bacterial secreted protein types. Moreover, it is anticipated that the proposed model could be beneficial for other biology sequence in future research.
Related Topics
Life Sciences Agricultural and Biological Sciences Agricultural and Biological Sciences (General)
Authors
, ,