A comparative study of classifier combination applied to NLP tasks

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
528815	869611	2013	13 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Part-of-speech tagging - برچسب گذاری بخشی از گفتار Text analysis - تجزیه و تحلیل متن Classifier combination - ترکیبی طبقه بندی Natural Language Processing - پردازش زبان‌های طبیعی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو

پیش نمایش صفحه اول مقاله

A comparative study of classifier combination applied to NLP tasks

چکیده انگلیسی

The paper is devoted to a comparative study of classifier combination methods, which have been successfully applied to multiple tasks including Natural Language Processing (NLP) tasks. There is variety of classifier combination techniques and the major difficulty is to choose one that is the best fit for a particular task. In our study we explored the performance of a number of combination methods such as voting, Bayesian merging, behavior knowledge space, bagging, stacking, feature sub-spacing and cascading, for the part-of-speech tagging task using nine corpora in five languages. The results show that some methods that, currently, are not very popular could demonstrate much better performance. In addition, we learned how the corpus size and quality influence the combination methods performance. We also provide the results of applying the classifier combination methods to the other NLP tasks, such as name entity recognition and chunking. We believe that our study is the most exhaustive comparison made with combination methods applied to NLP tasks so far.

► Our NLP bibliographical analysis shows low balance in combination methods usage.
► We experiment with eight combination methods, 13 corpora, four classifiers, and four NLP tasks.
► Scenarios with low quality data or base classifiers can obtain major improvements.
► Classification can also be improved combining different versions of only one classifier.
► Stacking can combine integrating heterogeneous information from multiple sources.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Fusion - Volume 14, Issue 3, July 2013, Pages 255–267

نویسندگان

Fernando Enríquez, Fermín L. Cruz, F. Javier Ortega, Carlos G. Vallejo, José A. Troyano,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

A comparative study of classifier combination applied to NLP tasks

دسترسی سریع

ارتباط

English Website