OLAC Record oai:dspace-clarin-it.ilc.cnr.it:20.500.11752/ILC-1001 |
Metadata | ||
Title: | Corpus Parole (3 milions words) | |
Bibliographic Citation: | http://hdl.handle.net/20.500.11752/ILC-1001 | |
Creator: | Sara Goggi, Sara Goggi remo Bindi, Lisa Biagini e Sergio Rossi | |
Date (W3CDTF): | 2023-07-24T12:45:54Z | |
Date Available: | 2023-07-24T12:45:54Z | |
Description: | The PAROLE project (Preparatory Action for Linguistic Resources Organization for Language Engineering) has produced a set of harmonized corpora and lexicons for a large number of European languages. Each corpus, made up of 20 million words, was built up as reference corpus for Human Language Technology applications, to provide full information about a large variety of text types in the language considered, to represent the use of contemporary language and to become the first nucleus of an electronic text library. The texts have been stored using a common format following the standards recommended in the CES (Corpus Encoding Standard), according to flexibility and multifunctionality criteria. The texts belong to a wide range of media and genres, selected in proportions aimed at reflecting their prominence within the society, classified according to medium, genre, topic and time of production. | |
Identifier (URI): | http://hdl.handle.net/20.500.11752/ILC-1001 | |
Language: | Italian | |
Language (ISO639): | ita | |
Publisher: | Istituto di Linguistica Computazionale “A. Zampolli” - Consiglio Nazionale delle Ricerche (ILC-CNR) | |
Rights: | Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) | |
http://creativecommons.org/licenses/by-nc-nd/4.0/ | ||
Subject: | Corpus | |
Corpus linguistics | ||
Databases | ||
Type: | corpus | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | ILC-CNR for CLARIN-IT repository hosted at Institute for Computational Linguistics "A. Zampolli", National Research Council, in Pisa | |
Description: | http://www.language-archives.org/archive/dspace-clarin-it.ilc.cnr.it | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:dspace-clarin-it.ilc.cnr.it:20.500.11752/ILC-1001 | |
DateStamp: | 2023-07-24 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Sara Goggi, Sara Goggi remo Bindi, Lisa Biagini e Sergio Rossi. 2023. Istituto di Linguistica Computazionale “A. Zampolli” - Consiglio Nazionale delle Ricerche (ILC-CNR). | |
Terms: | area_Europe country_IT dcmi_Text iso639_ita olac_primary_text |