کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
563050 | 875467 | 2013 | 13 صفحه PDF | دانلود رایگان |
![عکس صفحه اول مقاله: Resampling methods for quality assessment of classifier performance and optimal number of features Resampling methods for quality assessment of classifier performance and optimal number of features](/preview/png/563050.png)
• Novel resampling based method.
• Choice of the best classifier among set of candidates.
• Estimation of the optimal feature set dimensionality.
• Algorithm tested on synthetic and real data.
We address two fundamental design issues of a classification system: the choice of the classifier and the dimensionality of the optimal feature subset. Resampling techniques are applied to estimate both the probability distribution of the misclassification rate (or any other figure of merit of a classifier) subject to the size of the feature set, and the probability distribution of the optimal dimensionality given a classification system and a misclassification rate. The latter allows for the estimation of confidence intervals for the optimal feature set size. Based on the former, a quality assessment for the classifier performance is proposed. Traditionally, the comparison of classification systems is accomplished for a fixed feature set. However, a different set may provide different results. The proposed method compares the classifiers independently of any pre-selected feature set. The algorithms are tested on 80 sets of synthetic examples and six standard databases of real data. The simulated data results are verified by an exhaustive search of the optimum and by two feature selection algorithms for the real data sets.
Journal: Signal Processing - Volume 93, Issue 11, November 2013, Pages 2956–2968