SEARCH
 

Search

 

Результаты поиска 1 - 1 из 1
Начало | Пред. | 1 | След. | Конец


Overcoming the class imbalance in modeling the credit default

Roskoshenko V.Vl. Lomonosov Moscow State University (MSU), Moscow, Russian Federation ( roskoshenkoeco@mail.ru )

Journal: Finance and Credit, #11, 2019

Subject The banking sector faces the class imbalance of samples in modeling the credit default. Data pre-processing is traditionally the first option to choose in bank modeling, since it helps overcome the class imbalance. Available studies into such approaches and their comparison discuss a few methods or focus on very specific data. Moreover, previous researchers overlook approaches combining data pre-processing and ensemble-based solutions (stacking).
Objectives The study aims to find the best-fit option to overcome the class imbalance of each group of approaches applied to bank data on retail lending.
Methods The study employs mathematical modeling, statistical analysis and content analysis of sources.
Results Although being rather mathematically difficult, EditedNearestNeighbours approach proved to be most convenient for pre-processing of data. It excludes representatives of the dominant class, which are inadequate to the surrounding environment which is determined through clustering. RandomOverSampler also turned to meet expectations among combinations of data pre-processing and stacking approaches. It increases a percentage of the minority class randomly and appears to be most simple.
Conclusions and Relevance The article presents an exhaustive comparison of approaches to the class imbalance in samples. I selected the most appropriate approach from data pre-processing approaches and the best combination of data pre-processing and ensemble-based solution. The findings can be used for purposes of credit scoring and statistical modeling, when binary classification is required.


Результаты поиска 1 - 1 из 1
Начало | Пред. | 1 | След. | Конец


Отсортировано по релевантности | Сортировать по дате