Article ID Journal Published Year Pages File Type
442601 Computers & Graphics 2014 15 Pages PDF
Abstract

•A novel approach to parameters space exploration in decision tree building.•Based on identified tasks, two new views are introduced.•An example using a real world data set is described, and used to justify new methodology.

Decision trees are an intuitive yet powerful tool for performing predictive data analysis in data mining. In order to generate an adequate predictive model from a data set, a data analyst has to assess the predictive quality of the decision trees derived from several combinations of working parameters. Except in very simple cases, this may be a tedious and error prone supervised task, since the parameter space is frequently huge. Analysts rely on their intuition and usually test just a few different parameter settings. In this work we present an interactive approach to facilitate the comprehension of the predictive power of large collections of decision trees by exploring large portions of the parameter space. For this, we developed novel views that allow us to visualize and analyze the predictive quality of hundreds of trees, working together with coordinated multiple views of tree representations (needed to understand the tree shapes and actual information herein), and aggregates of Receiver Operating Characteristic (ROC) and lift curves for assessing the predictive quality of the models. We developed a worked example using a data set from a Telecommunications company, showing how easy and natural it is to gain insight into the behavior of the data within our exploration tool, as compared with the traditional and widespread common practice of data analysts.

Graphical abstractFigure optionsDownload full-size imageDownload high-quality image (196 K)Download as PowerPoint slide

Related Topics
Physical Sciences and Engineering Computer Science Computer Graphics and Computer-Aided Design
Authors
, , , ,