کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4946162 1439281 2017 29 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A less-greedy two-term Tsallis Entropy Information Metric approach for decision tree classification
ترجمه فارسی عنوان
یک روش مرسوم انتروپیک اطلاعاتی دوطرفه کمتر برای ترسیم برای طبقه بندی درخت تصمیم گیری
کلمات کلیدی
درختان تصمیم گیری، معیار تقسیم ویژگی، ساخت درخت، طبقه بندی،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
The construction of efficient and effective decision trees remains a key topic in machine learning because of their simplicity and flexibility. A lot of heuristic algorithms have been proposed to construct near-optimal decision trees. Most of them, however, are greedy algorithms that have the drawback of obtaining only local optimums. Besides, conventional split criteria they used, e.g. Shannon entropy, Gain Ratio and Gini index, are based on one-term that lack adaptability to different datasets. To address the above issues, we propose a less-greedy two-term Tsallis Entropy Information Metric (TEIM) algorithm with a new split criterion and a new construction method of decision trees. Firstly, the new split criterion is based on two-term Tsallis conditional entropy, which is better than conventional one-term split criteria. Secondly, the new tree construction is based on a two-stage approach that reduces the greediness and avoids local optimum to a certain extent. The TEIM algorithm takes advantages of the generalization ability of two-term Tsallis entropy and the low greediness property of two-stage approach. Experimental results on UCI datasets indicate that, compared with the state-of-the-art decision trees algorithms, the TEIM algorithm yields statistically significantly better decision trees and is more robust to noise.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Knowledge-Based Systems - Volume 120, 15 March 2017, Pages 34-42
نویسندگان
, , ,