کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
495228 | 862821 | 2015 | 10 صفحه PDF | دانلود رایگان |

• Imputation data for monotone patterns of missing values.
• An estimation model of missing data based on multilayer perceptron.
• Combination of neural network and k-nearest neighbour-based multiple imputation.
• Comparison of the performance of proposed models with three classic procedures.
• Three classic single imputation models: mean/mode, regression and hot-deck.
The knowledge discovery process is supported by data files information gathered from collected data sets, which often contain errors in the form of missing values. Data imputation is the activity aimed at estimating values for missing data items. This study focuses on the development of automated data imputation models, based on artificial neural networks for monotone patterns of missing values. The present work proposes a single imputation approach relying on a multilayer perceptron whose training is conducted with different learning rules, and a multiple imputation approach based on the combination of multilayer perceptron and k-nearest neighbours. Eighteen real and simulated databases were exposed to a perturbation experiment with random generation of monotone missing data pattern. An empirical test was accomplished on these data sets, including both approaches (single and multiple imputations), and three classical single imputation procedures – mean/mode imputation, regression and hot-deck – were also considered. Therefore, the experiments involved five imputation methods. The results, considering different performance measures, demonstrated that, in comparison with traditional tools, both proposals improve the automation level and data quality offering a satisfactory performance.
Figure optionsDownload as PowerPoint slide
Journal: Applied Soft Computing - Volume 29, April 2015, Pages 65–74