Article ID Journal Published Year Pages File Type
481526 European Journal of Operational Research 2009 9 Pages PDF
Abstract

Data mining aims to find patterns in organizational databases. However, most techniques in mining do not consider knowledge of the quality of the database. In this work, we show how to incorporate into classification mining recent advances in the data quality field that view a database as the product of an imprecise manufacturing process where the flaws/defects are captured in quality matrices. We develop a general purpose method of incorporating data quality matrices into the data mining classification task. Our work differs from existing data preparation techniques since while other approaches detect and fix errors to ensure consistency with the entire data set our work makes use of the apriori knowledge of how the data is produced/manufactured.

Related Topics
Physical Sciences and Engineering Computer Science Computer Science (General)
Authors
, ,