کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
533549 870128 2011 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A multi-objective optimisation approach for class imbalance learning
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
A multi-objective optimisation approach for class imbalance learning
چکیده انگلیسی

Class imbalance limits the performance of most learning algorithms since they cannot cope with large differences between the number of samples in each class, resulting in a low predictive accuracy over the minority class. In this respect, several papers proposed algorithms aiming at achieving more balanced performance. However, balancing the recognition accuracies for each class very often harms the global accuracy. Indeed, in these cases the accuracy over the minority class increases while the accuracy over the majority one decreases. This paper proposes an approach to overcome this limitation: for each classification act, it chooses between the output of a classifier trained on the original skewed distribution and the output of a classifier trained according to a learning method addressing the course of imbalanced data. This choice is driven by a parameter whose value maximizes, on a validation set, two objective functions, i.e. the global accuracy and the accuracies for each class. A series of experiments on ten public datasets with different proportions between the majority and minority classes show that the proposed approach provides more balanced recognition accuracies than classifiers trained according to traditional learning methods for imbalanced data as well as larger global accuracy than classifiers trained on the original skewed distribution.

Research highlights
► Class imbalance limits the performance of most learning algorithms.
► Balancing the recognition accuracies for each class often harms the global accuracy.
► Reliability takes into account issues influencing a correct classification.
► Multi-objective or Pareto optimisation provides more balanced recognition performance.
► Two objective functions maximization, i.e. global accuracy and accuracies of each class.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition - Volume 44, Issue 8, August 2011, Pages 1801–1810
نویسندگان
,