کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4960935 1446507 2017 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Resolving Entity Morphs based on Character-Word Embedding
ترجمه فارسی عنوان
حل مؤلفه متنی بر مبنای رمزگذاری کاراکتر ورد
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
چکیده انگلیسی

Morph is a special type of fake alternative names. Internet users use morphs to achieve certain goals such as expressing special sentiment or avoiding censorship. For example, Chinese internet users often replace “马景涛” (Ma Jingtao) with “咆哮教主” (Roar Bishop)1. “咆哮教主” (Roar Bishop) is a morph and “马景涛” (Ma Jingtao) is the target entity of "咆哮教主" Roar Bishop . This paper focuses on morph resolution: given a morph, figure out the entity that it really refers to After analyse the common characteristic of morphs and target entities from cross-source corpora, we exploit temporal and semantic constraints to collect target candidates. We propose a framework based on character-word embeddings and radical-character-word embeddings to rank target candidates. Our method does not need any human-annotated data. Experimental results demonstrate our approaches outperforms the state-of-the-art method. The results also show that the performance is better when morphs share any character with target entities.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Computer Science - Volume 108, 2017, Pages 48-57
نویسندگان
, , , , ,