Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
417845 | Computational Statistics & Data Analysis | 2009 | 10 Pages |
Abstract
It has been recognized that Classification trees (CART) are unstable; a small perturbation in the input variables or a fresh sample can lead to a very different classification tree. Some approaches exist that try to correct this instability. However, their benefits can, at present, be appreciated only qualitatively. A similarity measure between two classification trees is introduced that can measure their closeness. Its usefulness is illustrated with synthetic data on the impact of radioactivity deposit through the environment. In this context, a modified node level stabilizing technique, referred to as the NLS–REP method, is introduced and shown to be more stable than the classical CART method.
Related Topics
Physical Sciences and Engineering
Computer Science
Computational Theory and Mathematics
Authors
Bénédicte Briand, Gilles R. Ducharme, Vanessa Parache, Catherine Mercat-Rommens,