کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
403827 677360 2012 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Jmax-pruning: A facility for the information theoretic pruning of modular classification rules
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Jmax-pruning: A facility for the information theoretic pruning of modular classification rules
چکیده انگلیسی

The Prism family of algorithms induces modular classification rules in contrast to the Top Down Induction of Decision Trees (TDIDT) approach which induces classification rules in the intermediate form of a tree structure. Both approaches achieve a comparable classification accuracy. However in some cases Prism outperforms TDIDT. For both approaches pre-pruning facilities have been developed in order to prevent the induced classifiers from overfitting on noisy datasets, by cutting rule terms or whole rules or by truncating decision trees according to certain metrics. There have been many pre-pruning mechanisms developed for the TDIDT approach, but for the Prism family the only existing pre-pruning facility is J-pruning. J-pruning not only works on Prism algorithms but also on TDIDT. Although it has been shown that J-pruning produces good results, this work points out that J-pruning does not use its full potential. The original J-pruning facility is examined and the use of a new pre-pruning facility, called Jmax-pruning, is proposed and evaluated empirically. A possible pre-pruning facility for TDIDT based on Jmax-pruning is also discussed.


► We improve a rule pruning method for modular classification rules.
► We examine the information theoretical shortcoming of the J-pruning approach.
► Our Jmax-pruning is based on the rule’s maximum theoretical information content.
► Empirical results show a significant improvement of Jmax-pruning to J-pruning.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Knowledge-Based Systems - Volume 29, May 2012, Pages 12–19
نویسندگان
, ,