Paper
11 March 2022 A comparison of default prediction models trained on different datasets: model-based clustering
Yaozhi Yang
Author Affiliations +
Proceedings Volume 12160, International Conference on Computational Modeling, Simulation, and Data Analysis (CMSDA 2021); 121600I (2022) https://doi.org/10.1117/12.2627614
Event: International Conference on Computational Modeling, Simulation, and Data Analysis (CMSDA 2021), 2021, Sanya, China
Abstract
This paper compares the models trained on the different mortgage markets of different states in the USA. We use the measured performance (AUC) to generate a similarity matrix and then generate clusters. We regard AUC as a similarity between two models and use t-SNE method combined with k-means to construct clusters and visualize them. There are two main findings. One is that the model of MA (Massachusetts) is relatively similar to the model of CA (California) not NY (New York) although MA is next to NY and far from CA. The other is that the crisis in 2008 makes the differences between the model of OK (Oklahoma) and the model of TX (Texas) more obvious.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Yaozhi Yang "A comparison of default prediction models trained on different datasets: model-based clustering", Proc. SPIE 12160, International Conference on Computational Modeling, Simulation, and Data Analysis (CMSDA 2021), 121600I (11 March 2022); https://doi.org/10.1117/12.2627614
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Model-based design

Performance modeling

Visualization

Machine learning

Statistical modeling

Back to Top