$content.nome.text $content.cognome.text

Word-level and higher level annotation of the Sardinian Medieval Corpus

Nicoletta Puddu;STEIN, ACHIM

2018-01-01

Abstract

This paper is about the Sardinian Medieval Corpus (SMC), the first linguistically annotated digital resource of Medieval Sardinian. The first part presents the textual and linguistic characteristics and discusses them in the light of the problems they pose for both manual and automatic annotation. The second part describes the development of the first computational tools for the analysis of Medieval Sardinian, on the word level (lemmatization and part-of-speech tagging) and on the syntactic level (dependency parsing). It is shown how the manual and the automatic approach can be combined to build an annotated database effeciently, even for medieval texts.

Short Card

Tab complete

Full Sheet(DC)

         Anno 
       
        2018 
       
         Codice ISBN 
       
        9783901716430 
       
         Parole chiave 
       
        Historical corpora; Sardinian; Digital humanities 
       
         Type: 
       
        4.1 Contributo in Atti di convegno

Files in This Item:

File	Size	Format
Puddu_Stein_CRH2.pdf Solo gestori archivio Type: versione editoriale Size 222.17 kB Format Adobe PDF & nbsp; View / Open Request a copy	222.17 kB	Adobe PDF	& nbsp; View / Open Request a copy

University of Cagliari

University of Cagliari

Word-level and higher level annotation of the Sardinian Medieval Corpus

Nicoletta Puddu;STEIN, ACHIM

2018-01-01

Abstract

Short Card

Tab complete

Full Sheet(DC)

Word-level and higher level annotation of the Sardinian Medieval Corpus

Nicoletta Puddu;STEIN, ACHIM

2018-01-01

Abstract

Short Card Tab complete Full Sheet(DC)

Questionnaire and social

Short Card

Tab complete

Full Sheet(DC)