Evaluating feature selection robustness on high-dimensional data

Pes B.
First
2018-01-01

Abstract

With the explosive growth of high-dimensional data, feature selection has become a crucial step of machine learning tasks. Though most of the available works focus on devising selection strategies that are effective in identifying small subsets of predictive features, recent research has also highlighted the importance of investigating the robustness of the selection process with respect to sample variation. In presence of a high number of features, indeed, the selection outcome can be very sensitive to any perturbations in the set of training records, which limits the interpretability of the results and their subsequent exploitation in real-world applications. This study aims to provide more insight about this critical issue by analysing the robustness of some state-of-the-art selection methods, for different levels of data perturbation and different cardinalities of the selected feature subsets. Furthermore, we explore the extent to which the adoption of an ensemble selection strategy can make these algorithms more robust, without compromising their predictive performance. The results on five high-dimensional datasets, which are representatives of different domains, are presented and discussed.
2018
Inglese
Hybrid Artificial Intelligent Systems
978-3-319-92638-4
978-3-319-92639-1
Springer
Francisco Javier de Cos Juez, José Ramón Villar, Enrique A. de la Cal, Álvaro Herrero, Héctor Quintián, José António Sáez, Emilio Corchado
10870
235
247
13
https://link.springer.com/chapter/10.1007/978-3-319-92639-1_20
13th International Conference on Hybrid Artificial Intelligent Systems (HAIS 2018)
Contributo
Esperti anonimi
20-22 June 2018
Oviedo, Spain
internazionale
scientifica
Feature selection robustness; Ensemble techniques; High-dimensional data
no
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
Pes, B.
273
1
4.1 Contributo in Atti di convegno
reserved
info:eu-repo/semantics/conferencePaper
Files in This Item:
File Size Format  
HAIS2018.pdf

Solo gestori archivio

Type: versione post-print
Size 398.75 kB
Format Adobe PDF
398.75 kB Adobe PDF & nbsp; View / Open   Request a copy

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Questionnaire and social

Share on:
Impostazioni cookie