کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
531807 869876 2016 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A hybrid classification algorithm by subspace partitioning through semi-supervised decision tree
ترجمه فارسی عنوان
یک الگوریتم طبقه بندی ترکیبی توسط پارتیشن بندی فضای فرعی از طریق درخت تصمیم گیری نیمه تحت نظارت
کلمات کلیدی
درخت تصمیم گیری؛ درخت تصمیمی نیمه نظارت؛ اندازه گیری غیرمنتظره؛ پارتیشن بندی فضای فرعی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
چکیده انگلیسی


• Propose the semi-supervised split criterion for decision trees.
• Combine the semi-supervised decision tree as subspace partitioning with other classifiers.
• Experiments on several datasets showed that the proposed method outperforms the existing ones.

Among data mining techniques, the decision tree is one of the more widely used methods for building classification models in the real world because of its simplicity and ease of interpretation. However, the method has some drawbacks, including instability, the nonsmooth nature of the decision boundary, and the possibility of overfitting. To overcome these problems, several works have utilized the relative advantages of other classifiers, such as logistic regression, support vector machine, and neural networks, in combination with a decision tree, in hybrid models which avoid the drawbacks of other models. Some hybrid models have used decision trees to quickly and efficiently partition the input space, and many studies have proved the effectiveness of the hybrid methods. However, there is room for further improvement by considering the topological properties of a dataset, because typical decision trees split nodes based only on the target variable. The proposed semi-supervised decision tree splits internal nodes by utilizing both labels and the structural characteristics of data for subspace partitioning, to improve the accuracy of classifiers applied to terminal nodes in the hybrid models. Experimental results confirm the superiority of the proposed algorithm and demonstrate the detailed characteristics of the algorithm.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition - Volume 60, December 2016, Pages 157–163
نویسندگان
,