A Region-based Training Data Segmentation Strategy to Credit Scoring

Saia R.;Carta S.;Fenu G.;Pompianu L.
2022-01-01

Abstract

The rating of users requesting financial services is a growing task, especially in this historical period of the COVID-19 pandemic characterized by a dramatic increase in online activities, mainly related to e-commerce. This kind of assessment is a task manually performed in the past that today needs to be carried out by automatic credit scoring systems, due to the enormous number of requests to process. It follows that such systems play a crucial role for financial operators, as their effectiveness is directly related to gains and losses of money. Despite the huge investments in terms of financial and human resources devoted to the development of such systems, the state-of-the-art solutions are transversally affected by some well-known problems that make the development of credit scoring systems a challenging task, mainly related to the unbalance and heterogeneity of the involved data, problems to which it adds the scarcity of public datasets. The Region-based Training Data Segmentation (RTDS) strategy proposed in this work revolves around a divide-and-conquer approach, where the user classification depends on the results of several sub-classifications. In more detail, the training data is divided into regions that bound different users and features, which are used to train several classification models that will lead toward the final classification through a majority voting rule. Such a strategy relies on the consideration that the independent analysis of different users and features can lead to a more accurate classification than that offered by a single evaluation model trained on the entire dataset. The validation process carried out using three public real-world datasets with a different number of features. samples, and degree of data imbalance demonstrates the effectiveness of the proposed strategy. which outperforms the canonical training one in the context of all the datasets.
2022
Inglese
Proceedings of the 19th International Conference on Security and Cryptograph
978-989-758-590-6
SCITEPRESS
AV D MANUELL, 27A 2 ESQ, SETUBAL, 2910-595, PORTUGAL
1
275
282
8
https://www.scitepress.org/Link.aspx?doi=10.5220/0011137400003283
19th International Conference on Security and Cryptograph
Comitato scientifico
11-13 luglioo 2022
Lisbona
internazionale
scientifica
Business Intelligence
Decision Support System
Risk Assessment
Credit Scoring
Machine Learning
no
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
Saia, R.; Carta, S.; Fenu, G.; Pompianu, L.
273
4
4.1 Contributo in Atti di convegno
open
info:eu-repo/semantics/conferencePaper
Files in This Item:
File Size Format  
A Region-based Training Data Segmentation Strategy to Credit Scoring.pdf

open access

Type: versione editoriale
Size 343.13 kB
Format Adobe PDF
343.13 kB Adobe PDF View/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Questionnaire and social

Share on:
Impostazioni cookie