کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
383110 660802 2016 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Learning to extract domain-specific relations from complex sentences
ترجمه فارسی عنوان
یادگیری برای استخراج روابط دامنه خاص از جملات پیچیده
کلمات کلیدی
استخراج اطلاعات باز؛ نقشه برداری ساختار؛ راه اندازی نمونه های آموزشی؛ نقشه برداری حریص
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی


• We propose SemIE, Semantic-based Information Extraction and Mapping.
• Our approach identifies significant relations and maps them to a semantic structure.
• Our approach bootstraps training examples from a pair of structured documents.
• The results show our approach outperforms current state-of-the-art system.
• The results prove the effectiveness of our approach in handling complex sentences.

Open Information Extraction (OIE) systems focus on identifying and extracting general relations from text. Most OIE systems utilize simple linguistic structure, such as part-of-speech or dependency features, to extract relations and arguments from a sentence. These approaches are simple and fast to implement, but suffer from two main drawbacks: i) they are less effective to handle complex sentences with multiple relations and shared arguments, and ii) they tend to extract overly-specific relations.This paper proposes an approach to Information Extraction called SemIE, which addresses both drawbacks. SemIE identifies significant relations from domain-specific text by utilizing a semantic structure that describes the domain of discourse. SemIE exploits the predicate-argument structure of a text, which is able to handle complex sentences. The semantics of the arguments are explicitly specified by mapping them to relevant concepts in the semantic structure.SemIE uses a semi-supervised learning approach to bootstrap training examples that cover all relations expressed in the semantic structure. SemIE inputs pairs of structured documents and uses a Greedy Mapping module to bootstrap a full set of training examples. The training examples are then used to learn the extraction and mapping rules.We evaluated the performance of SemIE by comparing it with OLLIE, a state-of-the-art OIE system. We tested SemIE and OLLIE on the task of extracting relations from text in the “movie” domain and found that on average, SemIE outperforms OLLIE. Furthermore, we also examined how the performance varies with sentence complexity and sentence length. The results prove the effectiveness of SemIE in handling complex sentences.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 60, 30 October 2016, Pages 107–117
نویسندگان
, , , ,