On the impact of disproportional samples in credit scoring models: An application to a Brazilian bank data

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
384145	660841	2012	8 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Performance measures - اندازه گیری عملکرد Classification models - مدل های طبقه بندی Credit scoring - نمره اعتبار

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

On the impact of disproportional samples in credit scoring models: An application to a Brazilian bank data

چکیده انگلیسی

Statistical methods have been widely employed to assess the capabilities of credit scoring classification models in order to reduce the risk of wrong decisions when granting credit facilities to clients. The predictive quality of a classification model can be evaluated based on measures such as sensitivity, specificity, predictive values, accuracy, correlation coefficients and information theoretical measures, such as relative entropy and mutual information. In this paper we analyze the performance of a naive logistic regression model (Hosmer & Lemeshow, 1989) and a logistic regression with state-dependent sample selection model (Cramer, 2004) applied to simulated data. Also, as a case study, the methodology is illustrated on a data set extracted from a Brazilian bank portfolio. Our simulation results so far revealed that there is no statistically significant difference in terms of predictive capacity between the naive logistic regression models and the logistic regression with state-dependent sample selection models. However, there is strong difference between the distributions of the estimated default probabilities from these two statistical modeling techniques, with the naive logistic regression models always underestimating such probabilities, particularly in the presence of balanced samples.

► We compare the logistic model with state-dependent sample selection with the usual.
► A large simulation study is performed and a credit bank data is considered.
► The predictive quality of the models was evaluated based on several measures.
► For both models the predictive capacity is not statistically different.
► But naive logistic regression models always underestimate the default probabilities.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 39, Issue 9, July 2012, Pages 8071–8078

نویسندگان

Francisco Louzada, Paulo H. Ferreira-Silva, Carlos A.R. Diniz,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

On the impact of disproportional samples in credit scoring models: An application to a Brazilian bank data

دسترسی سریع

ارتباط

English Website