کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
385697 660869 2011 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A feature selection method based on improved fisher’s discriminant ratio for text sentiment classification
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
A feature selection method based on improved fisher’s discriminant ratio for text sentiment classification
چکیده انگلیسی

Owing to its openness, virtualization and sharing criterion, the Internet has been rapidly becoming a platform for people to express their opinion, attitude, feeling and emotion. As the subjectivity texts are often too many for people to go through, how to automatically classify them into different sentiment orientation categories (e.g. positive/negative) has become an important research problem. In this paper, based on Fisher’s discriminant ratio, an effective feature selection method is proposed for subjectivity text sentiment classification. In order to validate the proposed method, we compared it with the method based on Information Gain while Support Vector Machine is adopted as the classifier. Two experiments are conducted by combining different feature selection methods with two kinds of candidate feature sets. Under 2739 subjectivity documents of COAE2008s and 1006 car-related subjectivity documents, the experimental results indicate that the Fisher’s discriminant ratio based on word frequency estimation has the best performance respectively with accuracy 86.61% and 82.80% under two corpus while the candidate features are the words which appear in both positive and negative texts.

Research highlights
► Based on Fisher’s discriminant ratio, an effective feature selection method is proposed for subjectivity text sentiment classification.
► The proposed method is compared with the method based on Information Gain while Support Vector Machine is adopted as the classifier.
► Under 2739 subjectivity documents of COAE2008s and 1006 car-related subjectivity documents, the experimental results indicate that the Fisher’s discriminant ratio based on word frequency estimation has the best performance.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 38, Issue 7, July 2011, Pages 8696–8702
نویسندگان
, , , , ,