Ensemble feature selection for high-dimensional data: a stability analysis across multiple domains

Pes, Barbara
First
2020-01-01

Abstract

Selecting a subset of relevant features is crucial to the analysis of high-dimensional datasets coming from a number of application domains, such as biomedical data, document and image analysis. Since no single selection algorithm seems to be capable of ensuring optimal results in terms of both predictive performance and stability (i.e. robustness to changes in the input data), researchers have increasingly explored the effectiveness of “ensemble” approaches involving the combination of different selectors. While interesting proposals have been reported in the literature, most of them have been so far evaluated in a limited number of settings (e.g. with data from a single domain and in conjunction with specific selection approaches), leaving unanswered important questions about the large-scale applicability and utility of ensemble feature selection. To give a contribution to the field, this work presents an empirical study which encompasses different kinds of selection algorithms (filters and embedded methods, univariate and multivariate techniques) and different application domains. Specifically, we consider 18 classification tasks with heterogeneous characteristics (in terms of number of classes and instances-to-features ratio) and experimentally evaluate, for feature subsets of different cardinalities, the extent to which an ensemble approach turns out to be more robust than a single selector, thus providing useful insight for both researchers and practitioners.
2020
2019
Inglese
32
10
5951
5973
23
https://link.springer.com/article/10.1007/s00521-019-04082-3
Esperti anonimi
internazionale
scientifica
Ensemble approaches; Feature selection; High-dimensional data analysis; Stability of feature selection algorithms
no
Pes, Barbara
1.1 Articolo in rivista
info:eu-repo/semantics/article
1 Contributo su Rivista::1.1 Articolo in rivista
262
1
open
Files in This Item:
File Size Format  
Pes2020_Article_EnsembleFeatureSelectionForHig.pdf

open access

Description: Articolo principale
Type: versione editoriale
Size 4.29 MB
Format Adobe PDF
4.29 MB Adobe PDF View/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Questionnaire and social

Share on:
Impostazioni cookie