Manuela Sanguinetti

Long-term social media data collection at the University of Turin

Sanguinetti Manuela
2019-01-01

Abstract

We report on the collection of social media messages - from Twitter in particular - in the Italian language that is continuously going on since 2012 at the University of Turin. A number of smaller datasets have been extracted from the main collection and enriched with different kinds of annotations for linguistic purposes. Moreover, a few extra datasets have been collected independently and are now in the process of being merged with the main collection. We aim at making the resource available to the community to the best of our possibility, in accordance with the Terms of Service provided by the platforms where data have been gathered from.
2019
Inglese
Proceedings of the Fifth Italian Conference on Computational Linguistics CLiC-it 2018
9788831978682
Accademia University Press
Torino
ITALIA
[Ekaterina Abramova, et al.]
2253
40
45
6
5th Italian Conference on Computational Linguistics, CLiC-it 2018
Comitato scientifico
10-12 December 2018
Torino
nazionale
scientifica
no
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
Basile, Valerio; Lai, Mirko; Sanguinetti, Manuela
273
3
4.1 Contributo in Atti di convegno
open
info:eu-repo/semantics/conferencePaper
File in questo prodotto:
File Dimensione Formato  
clic2018_twita.pdf

accesso aperto

Descrizione: paper online
Tipologia: versione editoriale (VoR)
Dimensione 234.45 kB
Formato Adobe PDF
234.45 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Questionario e social

Condividi su:
Impostazioni cookie