$content.nome.text $content.cognome.text

Feature Selection for high-dimensional data: the issue of stability

PES, BARBARA

2017-01-01

Abstract

Feature selection has become a necessary step to the analysis of high-dimensional datasets coming from several application domains (e.g., web data, document and image analysis, biological data). Though well-established methods exist to select highly discriminative features, discarding the ones that may be either redundant or irrelevant to the problem at hand, little attention has been so far given to the stability of these methods, in cases where the composition of the original dataset is perturbed to some extent (e.g., by adding new records or by random sampling). In this work, we highlight the importance of jointly considering both stability and predictive performance when the selection results are used for knowledge discovery and domain understanding. As a case study, we consider five popular feature selection algorithms, representatives of different selection approaches, and experimentally investigate their behaviour across three different domains: Internet advertisements, text categorization and biomedical data classification. Useful insight on the “intrinsic” stability of each algorithm seems to emerge, despite the peculiar characteristics of each dataset.

Scheda breve

Scheda completa

Scheda completa (DC)

         Anno 
       
        2017 
       
         Codice ISBN 
       
        978-1-5386-1759-5 
       
         Parole chiave 
       
        High-dimensional data; Feature selection; Feature selection stability; Knowledge discovery 
       
         Tipologia: 
       
        4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
wetice2017.pdf Solo gestori archivio Descrizione: Articolo principale Tipologia: versione post-print Dimensione 338.98 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	338.98 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Università degli Studi di Cagliari

Università degli Studi di Cagliari

Feature Selection for high-dimensional data: the issue of stability

PES, BARBARA

2017-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Feature Selection for high-dimensional data: the issue of stability

PES, BARBARA

2017-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Questionario e social

Scheda breve

Scheda completa

Scheda completa (DC)