کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
2788195 1154390 2007 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Prediction of Subcellular Localization of Eukaryotic Proteins Using Position-Specific Profiles and Neural Network with Weighted Inputs
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی زیست شناسی تکاملی
پیش نمایش صفحه اول مقاله
Prediction of Subcellular Localization of Eukaryotic Proteins Using Position-Specific Profiles and Neural Network with Weighted Inputs
چکیده انگلیسی

Subcellular location is one of the key biological characteristics of proteins. Position-specific profiles (PSP) have been introduced as important characteristics of proteins in this article. In this study, to obtain position-specific profiles, the Position Specific Iterative-Basic Local Alignment Search Tool (PSI-BLAST) has been used to search for protein sequences in a database. Position-specific scoring matrices are extracted from the profiles as one class of characteristics. Four-part amino acid compositions and 1st–7th order dipeptide compositions have also been calculated as the other two classes of characteristics. Therefore, twelve characteristic vectors are extracted from each of the protein sequences. Next, the characteristic vectors are weighed by a simple weighing function and inputted into a BP neural network predictor named PSP-Weighted Neural Network (PSP-WNN). The Levenberg-Marquardt algorithm is employed to adjust the weight matrices and thresholds during the network training instead of the error back propagation algorithm. With a jackknife test on the RH2427 dataset, PSP-WNN has achieved a higher overall prediction accuracy of 88.4% rather than the prediction results by the general BP neural network, Markov model, and fuzzy k-nearest neighbors algorithm on this dataset. In addition, the prediction performance of PSP-WNN has been evaluated with a five-fold cross validation test on the PK7579 dataset and the prediction results have been consistently better than those of the previous method on the basis of several support vector machines, using compositions of both amino acids and amino acid pairs. These results indicate that PSP-WNN is a powerful tool for subcellular localization prediction. At the end of the article, influences on prediction accuracy using different weighting proportions among three characteristic vector categories have been discussed. An appropriate proportion is considered by increasing the prediction accuracy.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Genetics and Genomics - Volume 34, Issue 12, December 2007, Pages 1080–1087
نویسندگان
, , ,