Application of Machine Learning in Credit Risk Assessment: A Prelude to Smart Banking

Abstract:

A precise credit risk assessment system is always vital to any financial institution for impeccable and gainful functioning. In such an ever changing economy as the rate of loan defaults are gradually increasing, authorities of financial institutions are finding it more and more difficult to correctly assess loan requests and tackle the risks of loan defaulters. In light of these events this paper proposes a machine learning model which can precisely assess credit risk and predict possible loan defaulters for credit lending institutions. A comparative analysis has been made using tuned supervised learning algorithms such as Support Vector Machine, Random Forest, Extreme Gradient Boosting and Logistic Regression. Recursive Feature Elimination with Cross Validation and Principal Component Analysis have been used for dimensionality reduction. Metrics such as F1 score, AUC score and prediction accuracy have been used to evaluate each model. Among all the models, the combination of a tuned Support Vector Machine and Recursive Feature Elimination with Cross Validation have shown the most promise in identifying loan defaulters. Thus our model can assist financial institutions in accurately predicting loan defaulters and save them from incurring further loss.