کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
406530 678092 2014 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
PDFOS: PDF estimation based over-sampling for imbalanced two-class problems
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
PDFOS: PDF estimation based over-sampling for imbalanced two-class problems
چکیده انگلیسی

This contribution proposes a novel probability density function (PDF) estimation based over-sampling (PDFOS) approach for two-class imbalanced classification problems. The classical Parzen-window kernel function is adopted to estimate the PDF of the positive class. Then according to the estimated PDF, synthetic instances are generated as the additional training data. The essential concept is to re-balance the class distribution of the original imbalanced data set under the principle that synthetic data sample follows the same statistical properties. Based on the over-sampled training data, the radial basis function (RBF) classifier is constructed by applying the orthogonal forward selection procedure, in which the classifier׳s structure and the parameters of RBF kernels are determined using a particle swarm optimisation algorithm based on the criterion of minimising the leave-one-out misclassification rate. The effectiveness of the proposed PDFOS approach is demonstrated by the empirical study on several imbalanced data sets.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 138, 22 August 2014, Pages 248–259
نویسندگان
, , , , ,