کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
395077 665927 2010 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A short text modeling method combining semantic and statistical information
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
A short text modeling method combining semantic and statistical information
چکیده انگلیسی

A novel modeling method for a collection of short text snippets is presented in this paper to measure the similarity between pairs of snippets. The method takes account of both the semantic and statistical information within the short text snippets, and consists of three steps. Given a set of raw short text snippets, it first establishes the initial similarity between words by using a lexical database. The method then iteratively calculates both word similarity and short text similarity. Finally, a proximity matrix is constructed based on word similarity and used to convert the raw text snippets into vectors. Word similarity and text clustering experiments show that the proposed short text modeling method improves the performance of existing text-related information retrieval (IR) techniques.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 180, Issue 20, 15 October 2010, Pages 4031–4041
نویسندگان
, , , ,