کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
402486 676950 2012 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A comparative study on feature reduction approaches in Hindi and Bengali named entity recognition
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
A comparative study on feature reduction approaches in Hindi and Bengali named entity recognition
چکیده انگلیسی

Features used for named entity recognition (NER) are often high dimensional in nature. These cause overfitting when training data is not sufficient. Dimensionality reduction leads to performance enhancement in such situations. There are a number of approaches for dimensionality reduction based on feature selection and feature extraction. In this paper we perform a comprehensive and comparative study on different dimensionality reduction approaches applied to the NER task. To compare the performance of the various approaches we consider two Indian languages namely Hindi and Bengali. NER accuracies achieved in these languages are comparatively poor as yet, primarily due to scarcity of annotated corpus. For both the languages dimensionality reduction is found to improve performance of the classifiers. A Comparative study of the effectiveness of several dimensionality reduction techniques is presented in detail in this paper.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Knowledge-Based Systems - Volume 27, March 2012, Pages 322–332
نویسندگان
, , ,