Estimation of the conditional risk in classification: The swapping method

Article ID	Journal	Published Year	Pages	File Type
417298	Computational Statistics & Data Analysis	2008	13 Pages	PDF

Abstract

The bias of the empirical error rate in supervised classification is studied. It is shown that this bias can be understood as a covariance between the classification rule and the labeling of the training data. From this result, a new penalized criterion is proposed to perform model selection in classification. Applications of the resulting algorithm to simulated and real data are presented.

Keywords

Model selection Classification

Related Topics

Physical Sciences and Engineering Computer Science Computational Theory and Mathematics

Preview

Estimation of the conditional risk in classification: The swapping method

Authors

Jean-Jacques Daudin, Tristan Mary-Huard,