Assessing similarity of feature selection techniques in high-dimensional domains

DESSI, NICOLETTA;PES, BARBARA
2013-01-01

Abstract

Recent research efforts attempt to combine multiple feature selection techniques instead of using a single one. However, this combination is often made on an “ad hoc” basis, depending on the specific problem at hand, without considering the degree of diversity/similarity of the involved methods. Moreover, though it is recognized that different techniques may return quite dissimilar outputs, especially in high dimensional/small sample size domains, few direct comparisons exist that quantify these differences and their implications on classification performance. This paper aims to provide a contribution in this direction by proposing a general methodology for assessing the similarity between the outputs of different feature selection methods in high dimensional classification problems. Using as benchmark the genomics domain, an empirical study has been conducted to compare some of the most popular feature selection methods, and useful insight has been obtained about their pattern of agreement.
2013
Inglese
34
12
1446
1453
8
http://www.sciencedirect.com/science/article/pii/S0167865513002018
Esperti anonimi
internazionale
scientifica
Feature Selection, Similarity Measures, High-Dimensional Data
no
Cannas, Lm; Dessi, Nicoletta; Pes, Barbara
1.1 Articolo in rivista
info:eu-repo/semantics/article
1 Contributo su Rivista::1.1 Articolo in rivista
262
3
open
Files in This Item:
File Size Format  
PRL2013_accepted_manuscript.pdf

open access

Description: Articolo principale
Type: versione post-print
Size 737.91 kB
Format Adobe PDF
737.91 kB Adobe PDF View/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Questionnaire and social

Share on:
Impostazioni cookie