کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
558974 1451688 2016 28 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Assessing sentence similarity through lexical, syntactic and semantic analysis
ترجمه فارسی عنوان
بررسی حکم شباهت از طریق تجزیه و تحلیل واژگانی و نحوی و معنایی
کلمات کلیدی
بر اساس نمودار مدل; جمله ساده; استخراج رابطه; برنامه نویسی منطقی استقراء
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی


• A new sentence similarity measure based on lexical, syntactic, semantic analysis.
• It combines statistical and semantic methods to measure similarity between words.
• The measure was evaluated using state-of-art datasets: Li et al., SemEval 2012, CNN.
• It presents an application to eliminate redundancy in multi-document summarization.

The degree of similarity between sentences is assessed by sentence similarity methods. Sentence similarity methods play an important role in areas such as summarization, search, and categorization of texts, machine translation, etc. The current methods for assessing sentence similarity are based only on the similarity between the words in the sentences. Such methods either represent sentences as bag of words vectors or are restricted to the syntactic information of the sentences. Two important problems in language understanding are not addressed by such strategies: the word order and the meaning of the sentence as a whole. The new sentence similarity assessment measure presented here largely improves and refines a recently published method that takes into account the lexical, syntactic and semantic components of sentences. The new method was benchmarked using Li–McLean, showing that it outperforms the state of the art systems and achieves results comparable to the evaluation made by humans. Besides that, the method proposed was extensively tested using the SemEval 2012 sentence similarity test set and in the evaluation of the degree of similarity between summaries using the CNN-corpus. In both cases, the measure proposed here was proved effective and useful.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 39, September 2016, Pages 1–28
نویسندگان
, , , , ,