کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
9471092 1320066 2005 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Percolation of annotation errors through hierarchically structured protein sequence databases
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک علوم کشاورزی و بیولوژیک (عمومی)
پیش نمایش صفحه اول مقاله
Percolation of annotation errors through hierarchically structured protein sequence databases
چکیده انگلیسی
Databases of protein sequences have grown rapidly in recent years as a result of genome sequencing projects. Annotating protein sequences with descriptions of their biological function ideally requires careful experimentation, but this work lags far behind. Instead, biological function is often imputed by copying annotations from similar protein sequences. This gives rise to annotation errors, and more seriously, to chains of misannotation. [Percolation of annotation errors in a database of protein sequences (2002)] developed a probabilistic framework for exploring the consequences of this percolation of errors through protein databases, and applied their theory to a simple database model. Here we apply the theory to hierarchically structured protein sequence databases, and draw conclusions about database quality at different levels of the hierarchy.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Mathematical Biosciences - Volume 193, Issue 2, February 2005, Pages 223-234
نویسندگان
, , , , ,