کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
384355 | 660846 | 2012 | 16 صفحه PDF | دانلود رایگان |
The purpose of response modeling for direct marketing is to identify those customers who are likely to purchase a campaigned product, based upon customers’ behavioral history and other information available. Contrary to mass marketing strategy, well-developed response models used for targeting specific customers can contribute profits to firms by not only increasing revenues, but also lowering marketing costs. Endemic in customer data used for response modeling is a class imbalance problem: the proportion of respondents is small relative to non-respondents. In this paper, we propose a novel data balancing method based on clustering, under-sampling, and ensemble to deal with the class imbalance problem, and thus improve response models. Using publicly available response modeling data sets, we compared the proposed method with other data balancing methods in terms of prediction accuracy and profitability. To investigate the usability of the proposed algorithm, we also employed various prediction algorithms when building the response models. Based on the response rate and profit analysis, we found that our proposed method (1) improved the response model by increasing response rate as well as reducing performance variation, and (2) increased total profit by significantly boosting revenue.
► A data balancing method with clustering, under-sampling, and ensemble is proposed.
► Information loss is minimized by under-sampling the majority class with clustering.
► Predictive accuracy is improved by an ensemble of multiple under-sampled data sets.
► The proposed method works well for two actual marketing tasks.
Journal: Expert Systems with Applications - Volume 39, Issue 8, 15 June 2012, Pages 6738–6753