کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4944882 1438010 2017 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Multi-granularity sequence labeling model for acronym expansion identification
ترجمه فارسی عنوان
مدل برچسب زدن چند منظوره برای شناسایی گسترش مخفف
کلمات کلیدی
زمینه های تصادفی محض، شبکه عصبی، لغو، برچسب زدن، چند دانه بودن،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
Identifying expansion forms for acronyms is beneficial to many natural language processing and information retrieval tasks. In this work, we study the problem of finding expansions in texts for given acronym queries by modeling the problem as a sequence labeling task. However, it is challenging for traditional sequence labeling models like Conditional Random Fields (CRF) due to the complexity of the input sentences and the substructure of the categories. In this paper, we propose a Latent-state Neural Conditional Random Fields model (LNCRF) to deal with the challenges. On one hand, we extend CRF by coupling it with nonlinear hidden layers to learn multi-granularity hierarchical representations of the input data under the framework of Conditional Random Fields. On the other hand, we introduce latent variables to capture the fine granular information from the intrinsic substructures within the structured output labels implicitly. The experimental results on real data show that our model achieves the best performance against the state-of-the-art baselines.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 378, 1 February 2017, Pages 462-474
نویسندگان
, , ,