کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1937918 1050728 2007 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Hum-mPLoc: An ensemble classifier for large-scale human protein subcellular location prediction by incorporating samples with multiple sites
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی زیست شیمی
پیش نمایش صفحه اول مقاله
Hum-mPLoc: An ensemble classifier for large-scale human protein subcellular location prediction by incorporating samples with multiple sites
چکیده انگلیسی

Proteins may simultaneously exist at, or move between, two or more different subcellular locations. Proteins with multiple locations or dynamic feature of this kind are particularly interesting because they may have some very special biological functions intriguing to investigators in both basic research and drug discovery. For instance, among the 6408 human protein entries that have experimentally observed subcellular location annotations in the Swiss-Prot database (version 50.7, released 19-Sept-2006), 973 (≈15%) have multiple location sites. The number of total human protein entries (except those annotated with “fragment” or those with less than 50 amino acids) in the same database is 14,370, meaning a gap of (14,370 − 6408) = 7962 entries for which no knowledge is available about their subcellular locations. Although one can use the computational approach to predict the desired information for the gap, so far all the existing methods for predicting human protein subcellular localization are limited in the case of single location site only. To overcome such a barrier, a new ensemble classifier, named Hum-mPLoc, was developed that can be used to deal with the case of multiple location sites as well. Hum-mPLoc is freely accessible to the public as a web server at http://202.120.37.186/bioinf/hum-multi. Meanwhile, for the convenience of people working in the relevant areas, Hum-mPLoc has been used to identify all human protein entries in the Swiss-Prot database that do not have subcellular location annotations or are annotated as being uncertain. The large-scale results thus obtained have been deposited in a downloadable file prepared with Microsoft Excel and named “Tab_Hum-mPLoc.xls”. This file is available at the same website and will be updated twice a year to include new entries of human proteins and reflect the continuous development of Hum-mPLoc.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Biochemical and Biophysical Research Communications - Volume 355, Issue 4, 20 April 2007, Pages 1006–1011
نویسندگان
, ,