کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4946795 1439418 2017 28 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Attribute-based Decision Graphs: A framework for multiclass data classification
ترجمه فارسی عنوان
نمودارهای تصمیمی مبتنی بر مشخصه: چارچوب طبقه بندی داده های چند طبقه
کلمات کلیدی
ساخت نمودار داده، طبقه بندی مبتنی بر نمودار، طبقه بندی چند طبقه، نمودارهای تصمیمی مبتنی بر مشخصه، ارزشهای ویژگی گمشده،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
Graph-based algorithms have been successfully applied in machine learning and data mining tasks. A simple but, widely used, approach to build graphs from vector-based data is to consider each data instance as a vertex and connecting pairs of it using a similarity measure. Although this abstraction presents some advantages, such as arbitrary shape representation of the original data, it is still tied to some drawbacks, for example, it is dependent on the choice of a pre-defined distance metric and is biased by the local information among data instances. Aiming at exploring alternative ways to build graphs from data, this paper proposes an algorithm for constructing a new type of graph, called Attribute-based Decision Graph-AbDG. Given a vector-based data set, an AbDG is built by partitioning each data attribute range into disjoint intervals and representing each interval as a vertex. The edges are then established between vertices from different attributes according to a pre-defined pattern. Classification is performed through a matching process among the attribute values of the new instance and AbDG. Moreover, AbDG provides an inner mechanism to handle missing attribute values, which contributes for expanding its applicability. Results of classification tasks have shown that AbDG is a competitive approach when compared to well-known multiclass algorithms. The main contribution of the proposed framework is the combination of the advantages of attribute-based and graph-based techniques to perform robust pattern matching data classification, while permitting the analysis the input data considering only a subset of its attributes.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neural Networks - Volume 85, January 2017, Pages 69-84
نویسندگان
, , ,