Article ID Journal Published Year Pages File Type
4943434 Expert Systems with Applications 2017 33 Pages PDF
Abstract
The preprocessing stage in knowledge discovery projects is costly, normally taking between 50% and 80% of the total project time. It is in this stage that data in a relational database are transformed for applying a data mining technique. This stage is a complex task that demands from database designers a strong interaction with experts having a broad knowledge about the application domain. Frameworks aiming to systemize this stage have significant limitations when applied to Credit Behavioral Scoring solutions. This paper proposes a framework based on the Model Driven Development approach to systemize the mentioned stage. This work has three main contributions: 1) improving the discriminant power of data mining techniques by means of the construction of new input variables which embed temporal knowledge for the technique; 2) reducing the time of data transformation using automatic code generation, and 3) allowing artificial intelligence and statistics modelers to perform the data transformation without the help of database experts. In order to validate the proposed framework, two comparative studies were conducted. Experiments showed that the proposed framework delivers a performance equivalent or superior to those of existing frameworks and reduces the time of data transformation with a confidence level of 95%.
Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , ,