Sharing the TIGR corpus of spoken Italian: an ORD case study

Sharing the TIGR corpus of spoken Italian:
an ORD case study

For the study of spoken varieties of the Italian language, several corpora have been gathered from the 1990s onwards and have partially been made available on websites and DVDs. The TIGR corpus of spoken Italian, which was collected in Southern Switzerland in 2021 and 2022 within the InfinIta project (SNF grant no. 192771), is a unique language resource in this panorama because of the regional varieties of Italian it documents and because it includes not only audio data, transcripts and sociolinguistic data, but also video recordings. The goal of ShareTIGR is (a) to share this rather large dataset (23.5 hours of recordings, 115 speakers) for scientific use, respecting FAIR principles and data protection; (b) to discuss the various phases of this process as a case study of open research data practices in linguistics, engaging with potentially interested communities via scientific presentations and publications and via a lab blog and social media.

Network

Institute of Italian Studies
Università della Svizzera italiana
West Campus, Main Building
Via Buffi 13
6900 Lugano, Switzerland
tel +41 58 666 42 95
e-mail isi@usi.ch

Stay in touch

Team

Corpus

Blog

Publications

Contacts

Quicklinks

Share

Print

Stay in touch