Article ID Journal Published Year Pages File Type
480718 European Journal of Operational Research 2016 13 Pages PDF
Abstract

•We evaluate default prediction performance of machine learning/regression models.•Including boosted trees, random forests, penalised linear/semi-parametric logistic regression.•Using data on over 300,000 residential mortgage loans.•The results indicate varying degrees of predictive power.•Statistical tests suggest boosted regression trees outperform penalised logistic regression.

This paper evaluates the performance of a number of modelling approaches for future mortgage default status. Boosted regression trees, random forests, penalised linear and semi-parametric logistic regression models are applied to four portfolios of over 300,000 Irish owner-occupier mortgages. The main findings are that the selected approaches have varying degrees of predictive power and that boosted regression trees significantly outperform logistic regression. This suggests that boosted regression trees can be a useful addition to the current toolkit for mortgage credit risk assessment by banks and regulators.

Related Topics
Physical Sciences and Engineering Computer Science Computer Science (General)
Authors
, ,