کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
388003 660915 2009 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Uncertainty sampling-based active learning for protein–protein interaction extraction from biomedical literature
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Uncertainty sampling-based active learning for protein–protein interaction extraction from biomedical literature
چکیده انگلیسی

Protein–protein interaction (PPI) extraction from biomedical literature has become a research focus with the rapid growth of the number of biomedical literature. Many methods have been proposed for PPI extraction including natural language processing techniques and machine learning approaches. One problem of applying machine learning approaches to PPI extraction is that large amounts of data are available but the cost of correctly labeling it prohibits its use. To reduce the amount of human labeling effort while maintaining the PPI extraction performance, the paper presents an uncertainty sampling-based method of active learning (USAL) in a lexical feature-based SVM model to tag the most informative unlabeled samples. In addition, some specific samples are ignored to speed up learning process while maintaining desired accuracy. The experiment results on AIMED and CB corpora show that our method can reduce the labeling by 40% and 20%, respectively, without degrading the performance.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 36, Issue 7, September 2009, Pages 10344–10350
نویسندگان
, , ,