Instance sampling in credit scoring: An empirical study of sample size and balancing

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
7408814	1481454	2012	15 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Sample size - اندازهی نمونه Over-sampling - بیش از نمونه برداری balancing - تعادل Data pre-processing - داده پیش پردازش under-sampling - زیر نمونه برداری Credit scoring - نمره اعتبار

موضوعات مرتبط

علوم انسانی و اجتماعی مدیریت، کسب و کار و حسابداری کسب و کار و مدیریت بین المللی

پیش نمایش صفحه اول مقاله

Instance sampling in credit scoring: An empirical study of sample size and balancing

چکیده انگلیسی

To date, best practice in sampling credit applicants has been established based largely on expert opinion, which generally recommends that small samples of 1500 instances each of both goods and bads are sufficient, and that the heavily biased datasets observed should be balanced by undersampling the majority class. Consequently, the topics of sample sizes and sample balance have not been subject to either formal study in credit scoring, or empirical evaluations across different data conditions and algorithms of varying efficiency. This paper describes an empirical study of instance sampling in predicting consumer repayment behaviour, evaluating the relative accuracies of logistic regression, discriminant analysis, decision trees and neural networks on two datasets across 20 samples of increasing size and 29 rebalanced sample distributions created by gradually under- and over-sampling the goods and bads respectively. The paper makes a practical contribution to model building on credit scoring datasets, and provides evidence that using samples larger than those recommended in credit scoring practice provides a significant increase in accuracy across algorithms.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: International Journal of Forecasting - Volume 28, Issue 1, JanuaryâMarch 2012, Pages 224-238

نویسندگان

Sven F. Crone, Steven Finlay,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Instance sampling in credit scoring: An empirical study of sample size and balancing

دسترسی سریع

ارتباط

English Website