A discretized enriched technique to enhance machine learning performance in credit scoring

Saia R.;Carta S.;Reforgiato Recupero D.;Fenu G.;
2019-01-01

Abstract

The automated credit scoring tools play a crucial role in many financial environments, since they are able to perform a real-time evaluation of a user (e.g., a loan applicant) on the basis of several solvency criteria, without the aid of human operators. Such an automation allows who work and offer services in the financial area to take quick decisions with regard to different services, first and foremost those concerning the consumer credit, whose requests have exponentially increased over the last years. In order to face some well-known problems related to the state-of-the-art credit scoring approaches, this paper formalizes a novel data model that we called Discretized Enriched Data (DED), which operates by transforming the original feature space in order to improve the performance of the credit scoring machine learning algorithms. The idea behind the proposed DED model revolves around two processes, the first one aimed to reduce the number of feature patterns through a data discretization process, and the second one aimed to enrich the discretized data by adding several meta-features. The data discretization faces the problem of heterogeneity, which characterizes such a domain, whereas the data enrichment works on the related loss of information by adding meta-features that improve the data characterization. Our model has been evaluated in the context of real-world datasets with different sizes and levels of data unbalance, which are considered a benchmark in credit scoring literature. The obtained results indicate that it is able to improve the performance of one of the most performing machine learning algorithm largely used in this field, opening up new perspectives for the definition of more effective credit scoring solutions.
2019
Inglese
Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management
978-989-758-382-7
SciTePress
1
202
213
12
https://www.scitepress.org/ProceedingsDetails.aspx?ID=T4KTibRgTuo=&t=1
11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2019
Comitato scientifico
17-19 September 2019
Vienna, Austria
internazionale
scientifica
Algorithms; Business intelligence; Credit scoring; Decision support system; Machine learning
no
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
Saia, R.; Carta, S.; Reforgiato Recupero, D.; Fenu, G.; Saia, M.
273
5
4.1 Contributo in Atti di convegno
reserved
info:eu-repo/semantics/conferencePaper
Files in This Item:
File Size Format  
KDIR_2019_73_CR.pdf

Solo gestori archivio

Type: versione pre-print
Size 176.24 kB
Format Adobe PDF
176.24 kB Adobe PDF & nbsp; View / Open   Request a copy

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Questionnaire and social

Share on:
Impostazioni cookie