Balanced ROC analysis (BAROC) protocol for the evaluation of protein similarities

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
1988413	1540394	2008	5 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

ROC, Receiver operating characteristics - ROC، ویژگی های عملکرد گیرنده

موضوعات مرتبط

علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی زیست شیمی

پیش نمایش صفحه اول مقاله

Balanced ROC analysis (BAROC) protocol for the evaluation of protein similarities

چکیده انگلیسی

Identification of problematic protein classes (domain types, protein families) that are difficult to predict from sequence is a key issue in genome annotation. ROC (Receiver Operating Characteristic) analysis is routinely used for the evaluation of protein similarities, however its results – the area under curve (AUC) values – are differentially biased for the various protein classes that are highly different in size. We show the bias can be compensated for by adjusting the length of the top list in a class-dependent fashion, so that the number of negatives within the top list will be equal to (or proportional with) the size of the positive class. Using this balanced protocol the problematic classes can be identified by their AUC values, or by a scatter diagram in which the AUC values are plotted against positive/negative ratio of the top list. The use of likelihood-ratio scoring (Kaján et al, Bioinformatics,22, 2865–2869, 2007) the bias caused by class imbalance can be further decreased.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Biochemical and Biophysical Methods - Volume 70, Issue 6, 24 April 2008, Pages 1210–1214

نویسندگان

Róbert Busa-Fekete, Attila Kertész-Farkas, András Kocsor, Sándor Pongor,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Balanced ROC analysis (BAROC) protocol for the evaluation of protein similarities

دسترسی سریع

ارتباط

English Website