کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6853179 658315 2016 19 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Exploiting meta features for dependency parsing and part-of-speech tagging
ترجمه فارسی عنوان
بهره برداری از ویژگی های متا برای تجزیه وابستگی و برچسب زدن از بخشی از گفتار
کلمات کلیدی
تجزیه وابستگی، پردازش زبان طبیعی، ویژگی های متا، برچسب گذاری بخشی از گفتار، رویکرد نیمه نظارت،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
In recent years, discriminative methods have achieved much progress in natural language processing tasks, such as parsing, part-of-speech tagging, and word segmentation. For these methods, conventional features in a relatively high dimensional feature space may suffer from sparseness and thus exhibit less discriminative power on unseen data. This article presents a learning framework of feature transformation, addressing the sparseness problem by transforming sparse conventional base features into less sparse high-level features (i.e. meta features) with the help of a large amount of automatically annotated data. The meta features are derived by bucketing similar base features according to the frequency in large data, and used together with base features in our final system. We apply the framework to part-of-speech tagging and dependency parsing. Experimental results show that our systems perform better than the baseline systems in both tasks on standard evaluation. For the dependency parsing task, our parsers achieve state-of-the-art accuracy on the Chinese data and comparable accuracy with the best known systems on the English data. Further analysis indicates that our proposed approach is effective in processing unseen data and features.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Artificial Intelligence - Volume 230, January 2016, Pages 173-191
نویسندگان
, , , ,