OLAC Record
oai:dspace-clarin-it.ilc.cnr.it:000-c0-111/1049

Metadata
Title:KIParla - KIPasti transcripts
Bibliographic Citation:http://hdl.handle.net/20.500.11752/OPEN-1049
Creator:Mauri, Caterina
Ballarè, Silvia
Zucchini, Eleonora
Date (W3CDTF):2025-10-07T05:23:59Z
Date Available:2025-10-07T05:23:59Z
Description:The KIPasti corpus is part of the larger KIParla collection (www.kiparla.it), which can be freely queried through the NoSketch Engine interface. The ParlaBO corpus was compiled within the framework of “DiverSIta – Diversity in spoken Italian” project, funded by the Italian Ministry of University and Research (MUR) (PRIN 2022 PNRR Call). It consists of over 40 hours of spoken data collected in thirteen different Italian regions (Abruzzo, Basilicata, Calabria, Campania, Emilia-Romagna, Lazio, Lombardy, Marche, Apulia, Sardinia, Tuscany, Umbria, Veneto) during mealtime conversations, generally within family settings. The interactions, recorded between 2020 and 2024, involved 145 speakers with different origins, ages, education levels, and occupations. Italian is predominantly used in all interactions, but in most of them (78%), various passages in dialect are also present. The transcriptions have been anonymized. Overall, the module is made up of 63 conversations. This repository contains: - metadata for both speakers (occupation, gender, age, origin, L1, educational achievement) and conversations (collection point, year, languages used), in the metadata subfolder - descriptions of the set of transcription conventions used for this module - for each conversation you will find: .eaf file in eaf/ folder (time-aligned Jefferson-style transcriptions); .txt file in linear-jefferson/ folder (linearized Jefferson-style transcription); .txt file in linear-orthographic/ folder (linearized transcription retaining only orthographic words); .tsv file in tsv/ folder (tokenised version of the transcription). More information can be found in the README.md file. Due to GDPR restrictions, pseudo-anonymized audio files (MP3) are available under a restricted-access license. To request access, please contact the corpus coordinators through the KIParla website and follow the provided procedure. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Identifier (URI):http://hdl.handle.net/20.500.11752/OPEN-1049
Language:Italian
Language (ISO639):ita
Publisher:Alma Mater Studiorum – Università di Bologna
Rights:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
http://creativecommons.org/licenses/by-nc-sa/4.0/
Subject:kitchen-table conversations
spontaneous speech
human-human spoken dialogues
spoken Italian
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ILC-CNR for CLARIN-IT repository hosted at Institute for Computational Linguistics "A. Zampolli", National Research Council, in Pisa
Description:  http://www.language-archives.org/archive/dspace-clarin-it.ilc.cnr.it
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:dspace-clarin-it.ilc.cnr.it:000-c0-111/1049
DateStamp:  2025-10-07
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Mauri, Caterina; Ballarè, Silvia; Zucchini, Eleonora. 2025. Alma Mater Studiorum – Università di Bologna.
Terms: area_Europe country_IT dcmi_Text iso639_ita olac_primary_text


http://www.language-archives.org/item.php/oai:dspace-clarin-it.ilc.cnr.it:000-c0-111/1049
Up-to-date as of: Wed Oct 8 0:33:10 EDT 2025