Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
8876932 | Journal of Theoretical Biology | 2018 | 31 Pages |
Abstract
Results of SVM experiments with and without oversampling gained by Jackknife tests show that oversampling methods have successfully decrease the imbalance of data sets, and the prediction accuracy of each class in each dataset is higher than 88.9%. With comparison with other protein subcellular localization methods, the method in this work reaches the best performance. The overall accuracies of ZD98, CL317 and ZW225 are 93.2%, 96.00% and 92.15% respectively, which are 2.4%, 8.0% and 8.2% higher than the best methods in the comparison. The excellent overall accuracy gained by the proposed method indicates that the feature representation and selection capture useful information of protein sequence and oversampling methods successfully solve the imbalance of sample numbers in SVM classification.
Keywords
Related Topics
Life Sciences
Agricultural and Biological Sciences
Agricultural and Biological Sciences (General)
Authors
Shengli Zhang, Xin Duan,