کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1182990 1491806 2014 4 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
The importance of recognizing and reporting sequence database contamination for proteomics
ترجمه فارسی عنوان
اهمیت تشخیص و گزارش گیری آلودگی پایگاه داده پروتئومیک
کلمات کلیدی
بانک اطلاعاتی، پروتئومیکس، متاپراستومیکس، آلودگی، تجزیه و تحلیل انفجار، کورتاژ
موضوعات مرتبط
مهندسی و علوم پایه شیمی شیمی آنالیزی یا شیمی تجزیه
چکیده انگلیسی


• Homology-based proteomics, proteogenomics, metaproteomics rely on large protein sequence database.
• Not all entries in a sequence database are of equal quality.
• We exemplify database contamination with two examples: the bacterium Enterococcus gallinarum EGD-AAK12 and the insect Ceratitis capitata.
• We incite database users to contribute to the overall quality of databases.

Advances in genome sequencing have made proteomic experiments more successful than ever. However, not all entries in a sequence database are of equal quality. Genome sequences are contaminated more frequently than is admitted. Contamination impacts homology-based proteomic, proteogenomic, and metaproteomic results. We highlight two examples in the National Center for Biotechnology Information non-redundant database (NCBInr) that are likely contaminated: the bacterium Enterococcus gallinarum EGD-AAK12 and the insect Ceratitis capitata. We hope to incite users of this and other databases to critically evaluate submitted sequences and to contribute to the overall quality of the database by signaling potential errors when possible.

Figure optionsDownload as PowerPoint slide

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: EuPA Open Proteomics - Volume 3, June 2014, Pages 246–249
نویسندگان
, , , ,