A Progressive Filtering Approach to Hierarchical Text Categorization

ARMANO, GIULIANO;VARGIU, ELOISA
2008-01-01

Abstract

The continuous growth of available document collections makes text categorization a challenging task to be investigated. Real-world scenarios are typically characterized by a huge amount of non relevant documents with respect to the documents a user is really looking for. In this paper, we investigate how the ratio between relevant and non relevant documents affects the performances of a classifier system. In particular, to counterbalance the negative impact of imbalanced inputs, we propose a novel progressive filtering technique. Performed on the RCV1-v2 benchmark, experiments confirm the validity of the approach.
2008
5
28
32
5
Addis, A; Armano, Giuliano; Vargiu, Eloisa
1.1 Articolo in rivista
info:eu-repo/semantics/article
1 Contributo su Rivista::1.1 Articolo in rivista
262
3
none
Files in This Item:
There are no files associated with this item.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Questionnaire and social

Share on:
Impostazioni cookie