کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
385437 660865 2011 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Decision tree induction using a fast splitting attribute selection for large datasets
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Decision tree induction using a fast splitting attribute selection for large datasets
چکیده انگلیسی

Several algorithms have been proposed in the literature for building decision trees (DT) for large datasets, however almost all of them have memory restrictions because they need to keep in main memory the whole training set, or a big amount of it, and such algorithms that do not have memory restrictions, because they choose a subset of the training set, need extra time for doing this selection or have parameters that could be very difficult to determine. In this paper, we introduce a new algorithm that builds decision trees using a fast splitting attribute selection (DTFS) for large datasets. The proposed algorithm builds a DT without storing the whole training set in main memory and having only one parameter but being very stable regarding to it. Experimental results on both real and synthetic datasets show that our algorithm is faster than three of the most recent algorithms for building decision trees for large datasets, getting a competitive accuracy.


► This paper presents DTFS a new algorithm for building decision trees from large datasets.
► DTFS is faster than previous algorithms for building decision trees from large datasets.
► DTFS processes the instances incrementally and it does not store the whole training set in memory.
► If the number of attributes increases, DTFS has better behavior than previous algorithms.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 38, Issue 11, October 2011, Pages 14290–14300
نویسندگان
, , , ,