کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1931612 1050558 2010 4 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Prediction of protein subcellular localization by weighted gene ontology terms
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی زیست شیمی
پیش نمایش صفحه اول مقاله
Prediction of protein subcellular localization by weighted gene ontology terms
چکیده انگلیسی

We develop a new weighting approach of gene ontology (GO) terms for predicting protein subcellular localization. The weights of individual GO terms, corresponding to their contribution to the prediction algorithm, are determined by the term-weighting methods used in text categorization. We evaluate several term-weighting methods, which are based on inverse document frequency, information gain, gain ratio, odds ratio, and chi-square and its variants. Additionally, we propose a new term-weighting method based on the logarithmic transformation of chi-square. The proposed term-weighting method performs better than other term-weighting methods, and also outperforms state-of-the-art subcellular prediction methods. Our proposed method achieves 98.1%, 99.3%, 98.1%, 98.1%, and 95.9% overall accuracies for the animal BaCelLo independent dataset (IDS), fungal BaCelLo IDS, animal Höglund IDS, fungal Höglund IDS, and PLOC dataset, respectively. Furthermore, the close correlation between high-weighted GO terms and subcellular localizations suggests that our proposed method appropriately weights GO terms according to their relevance to the localizations.

Research highlights
► A significantly improved performance using weighted gene ontology terms.
► Weighting of gene ontology terms by utilizing their discriminative capability.
► Identifying discriminative features of a protein for subcellular location.
► Robustness test for gene ontology terms as an information source for prediction.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Biochemical and Biophysical Research Communications - Volume 399, Issue 3, 27 August 2010, Pages 402–405
نویسندگان
,