Semantic similarity of short texts in languages with a deficient natural language processing support

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
552061	873171	2013	10 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر سیستم های اطلاعاتی

پیش نمایش صفحه اول مقاله

Semantic similarity of short texts in languages with a deficient natural language processing support

چکیده انگلیسی

Measuring the semantic similarity of short texts is a noteworthy problem since short texts are widely used on the Internet, in the form of product descriptions or captions, image and webpage tags, news headlines, etc. This paper describes a methodology which can be used to create a software system capable of determining the semantic similarity of two given short texts. The proposed LInSTSS approach is particularly suitable for application in situations when no large, publicly available, electronic linguistic resources can be found for the desired language. We describe the basic working principles of the system architecture we propose, as well as the stages of its construction and use. Also, we explain the procedure used to generate a paraphrase corpus which is then utilized in the evaluation process. Finally, we analyze the evaluation results obtained from a system created for the Serbian language, and we discuss possible improvements which would increase system accuracy.

► How to create software for determining the semantic similarity of short texts
► How to cope when no linguistic resources can be found for the desired language?
► The procedure of paraphrase corpus creation is also described.
► Results are evaluated on a system created for the Serbian language.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Decision Support Systems - Volume 55, Issue 3, June 2013, Pages 710–719

نویسندگان

Bojan Furlan, Vuk Batanović, Boško Nikolić,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Semantic similarity of short texts in languages with a deficient natural language processing support

دسترسی سریع

ارتباط

English Website