کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
416253 681315 2016 14 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
High dimensional classifiers in the imbalanced case
ترجمه فارسی عنوان
طبقه بندی های بعدی در مورد عدم تعادل
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
چکیده انگلیسی

A binary classification problem is imbalanced when the number of samples from the two groups differs. For the high dimensional case, where the number of variables is much larger than the number of samples, imbalance leads to a bias in the classification. The independence classifier is studied theoretically and based on the analysis two new classifiers are suggested that can handle any imbalance ratio. The analytical results are supplemented by a simulation study, where the suggested classifiers in some aspects outperform multiple undersampling. For correlated data the ROAD classifier is considered and a suggestion is given for how to modify the classifier to handle the bias from imbalanced group sizes.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computational Statistics & Data Analysis - Volume 98, June 2016, Pages 46–59
نویسندگان
, ,