Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
6920417 | Computers in Biology and Medicine | 2018 | 8 Pages |
Abstract
In this paper, we first launched an integrated approach, known as the five-step prediction method (FSPM), to solve the problem effectively by (1) using one-sided selection (OSS) to deal with imbalanced data, (2) extracting binary features from protein sequences, (3) incorporating binary relevance, classifier chains and multi-class transformation methods to simplify multi-label problems, (4) constructing different classifiers, and (5) implementing cross-validation and evaluating these classifiers. In 10-fold cross-validation, FSPM achieved an accuracy of 61.49% and an absolute-true rate of 60.17%. The results showed that FSPM is accurate and could be used as a powerful engine in multi-label systems. We also conducted a variety of statistical analyses of the predicted results to discuss the biological functions of lysine acetylation and sumoylation.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science Applications
Authors
Yingxi Yang, Hui Wang, Jun Ding, Yan Xu,