کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
8409569 1545107 2018 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Redundancy in two major compound databases
ترجمه فارسی عنوان
افزونگی در دو پایگاه داده مرکب اصلی
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی بیوتکنولوژی یا زیست‌فناوری
چکیده انگلیسی
Public repositories of compounds and activity data are of prime importance for pharmaceutical research in academic and industrial settings. Major databases have evolved over the years. Their growth is accompanied by an increasing tendency toward data sharing. This is a positive development but not without potential problems. Using ChEMBL and PubChem as examples, we show that crosstalk between databases also leads to substantial data redundancy that might not be obvious. Redundancy is an important issue because it biases data analysis and knowledge extraction and leads to inflated views of available compounds, assays and activity data. Going forward it will be important to further refine data exchange and deposition criteria and make redundancy as transparent as possible.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Drug Discovery Today - Volume 23, Issue 6, June 2018, Pages 1183-1186
نویسندگان
, , , , ,