OLAC Record
oai:catalogue.elra.info:ELRA-W0128

Metadata
Title:ECPC Corpus (European Comparable and Parallel Corpora of Parliamentary Speeches Archive) – set 1
Access Rights: Rights available for: attribution
Date Available (W3CDTF):2018-12-21
Date Issued (W3CDTF):2018-12-21
Description:The European Comparable and Parallel Corpora of Parliamentary Speeches Archive (ECPC), compiled at the Universitat Jaume I (Spain), is a collection of XML metatextually tagged corpora containing speeches from three European chambers (the European Parliament, the British House of Commons, and the Spanish Congreso de los Diputados). It is a bilingual, bidirectional written corpus in English and Spanish described by Zanettin (2012). This first set (ECPC_EP-05) consists of (1) a "clean" version in XML of European Parliament's 2005 daily sessions; (2) a POS-tagged version of the 2005 daily sessions; and (3) a sentence-based aligned version of 2005 daily sessions. In its raw format, ECPC_EP-05 contains 3,668,476 tokens/words (excluding tagging) in English distributed over 60 utf-8 files and 3,993,867 tokens/words (excluding tagging) in Spanish distributed over 60 utf-8 files.ECPC_EP-05 by MARÍA CALZADA PÉREZ (as coordinator of the ECPC Research Group, Universitat Jaume I, Spain) is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC-BY-NC-SA 4.0: http://creativecommons.org/licenses/by-nc-sa/4.0). All corpora in the ECPC Archive have been funded by: Universitat Jaume I (UJI-B2017-25 P1·1B2012-64); Generalitat Valenciana (AICO/2017/082): Ministerio de Educación, Cultura y Deporte (FFI2008-01610/FILO; HUM2005-03756/FILO).
Identifier:ELRA-W0128
ISLRN: 036-939-425-010-1
Identifier (URI):https://catalog.elra.info/en-us/repository/browse/ELRA-W0128/
Language:English
Spanish; Castilian
Language (ISO639):eng
spa
Medium:downloadable
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-W0128
DateStamp:  2018-12-21
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2018. ELRA (European Language Resources Association).
Terms: area_Europe country_ES country_GB dcmi_Text iso639_eng iso639_spa olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-W0128
Up-to-date as of: Fri Apr 19 6:29:53 EDT 2024