کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
490382 707462 2013 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Combining Lexical and Semantic Features for Short Text Classification
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
پیش نمایش صفحه اول مقاله
Combining Lexical and Semantic Features for Short Text Classification
چکیده انگلیسی

In this paper, we propose a novel approach to classify short texts by combining both their lexical and semantic features. We present an improved measurement method for lexical feature selection and furthermore obtain the semantic features with the background knowledge repository which covers target category domains. The combination of lexical and semantic features is achieved by mapping words to topics with different weights. In this way, the dimensionality of feature space is reduced to the number of topics. We here use Wikipedia as background knowledge and employ Support Vector Machine (SVM) as classifier. The experiment results show that our approach has better effectiveness compared with existing methods for classifying short texts.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Computer Science - Volume 22, 2013, Pages 78-86