Feature Selection on Imbalanced Domains: A Stability-Based Analysis

Pes B.
2023-01-01

Abstract

A large body of literature has shown the beneficial impact of feature selection on the efficiency, interpretability, and generalization ability of machine learning models. Most of the existing studies, however, focus on the effectiveness of feature selection algorithms in identifying small subsets of predictive features, often neglecting the stability of the selection process, i.e., its robustness with respect to sample variation, which can be crucial for the actual exploitation of the results. In particular, little research has so far investigated the stability of feature selection methods in class-imbalanced domains, where some classes are underrepresented and any perturbation in the set of training records can strongly affect the final selection outcome. This work aims to investigate this important issue by studying the stability of different selection algorithms across high-dimensional datasets that present different levels of class imbalance. To this end, a methodological pipeline is discussed which allows a joint evaluation of the selection outcome both in terms of stability and final predictive performance. Although not exhaustive, our experiments provide very useful insight into which methods can be more stable on imbalanced data while still ensuring good generalization results.
2023
Inglese
Advances and Trends in Artificial Intelligence. Theory and Applications. IEA/AIE 2023.
9783031368189
9783031368196
Springer
13925 LNAI
14
27
14
https://link.springer.com/chapter/10.1007/978-3-031-36819-6_2
International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems (IEA-AIE 2023)
Esperti anonimi
July 19 - July 22, 2023
online conference
internazionale
scientifica
Machine Learning, Feature Selection, Selection Stability, High-dimensional Data, Genomic Data, Class Imbalance
no
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
Orru, P.; Pes, B.
273
2
4.1 Contributo in Atti di convegno
none
info:eu-repo/semantics/conferencePaper
Files in This Item:
There are no files associated with this item.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Questionnaire and social

Share on:
Impostazioni cookie