کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
495664 862833 2013 20 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Partial imputation of unseen records to improve classification using a hybrid multi-layered artificial immune system and genetic algorithm
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
Partial imputation of unseen records to improve classification using a hybrid multi-layered artificial immune system and genetic algorithm
چکیده انگلیسی


• Genetic algorithm optimization is effective for partial imputation using MAIS.
• The hybrid MAIS and genetic algorithm improves performance of classifiers.
• Increased strength and resilience in the presence of escalating missing data.

Missing data in large insurance datasets affects the learning and classification accuracies in predictive modelling. Insurance datasets will continue to increase in size as more variables are added to aid in managing client risk and will therefore be even more vulnerable to missing data. This paper proposes a hybrid multi-layered artificial immune system and genetic algorithm for partial imputation of missing data in datasets with numerous variables. The multi-layered artificial immune system creates and stores antibodies that bind to and annihilate an antigen. The genetic algorithm optimises the learning process of a stimulated antibody. The evaluation of the imputation is performed using the RIPPER, k-nearest neighbour, naïve Bayes and logistic discriminant classifiers. The effect of the imputation on the classifiers is compared with that of the mean/mode and hot deck imputation methods. The results demonstrate that when missing data imputation is performed using the proposed hybrid method, the classification improves and the robustness to the amount of missing data is increased relative to the mean/mode method for data missing completely at random (MCAR) missing at random (MAR), and not missing at random (NMAR).The imputation performance is similar to or marginally better than that of the hot deck imputation.

Figure optionsDownload as PowerPoint slide

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Applied Soft Computing - Volume 13, Issue 12, December 2013, Pages 4461–4480
نویسندگان
, , , ,