کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
10322655 660870 2011 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Comparison of metrics for feature selection in imbalanced text classification
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Comparison of metrics for feature selection in imbalanced text classification
چکیده انگلیسی
► Feature selections using Type-I metrics (χP2 and Gini index) achieve the comparable classification performances with those of the combination framework using Type-III metrics (signed χ2 and signed information gain). ► The performances with Type-II metrics (χ2 and information gain) are significantly degraded with increasing the degree of class imbalance. ► Type-III metrics produced the best performance; however, the optimization with these metrics is not easy in real applications. ► Type-I metrics serve as more simplified alternative methods for the combination framework. ► The classification performances using Type-I and Type-II metrics have positive correlations with the number of negative features.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 38, Issue 5, May 2011, Pages 4978-4989
نویسندگان
, , ,