OLAC Record
oai:www.clarin.si:11356/1167

Metadata
Title:Slovenian parliamentary corpus SlovParl 2.0
Bibliographic Citation:http://hdl.handle.net/11356/1167
Creator:Pančur, Andrej
Šorn, Mojca
Erjavec, Tomaž
Date (W3CDTF):2017-11-24T20:28:49Z
Date Available:2017-11-24T20:28:49Z
Description:The SlovParl corpus contains minutes of the Assembly of the Republic of Slovenia for the legislative period 1990-1992, i.e. it covers the period before, during, and after Slovenia became an independent country in 1991. The corpus comprises 232 sessions, 58,813 speeches and 10.8 million words. The corpus contains extensive meta-data about the speakers, a typology of sessions etc. and structural and editorial annotations. This item comprises three datasets: - the corpus in TEI (module Transcriptions of speech); - the corpus in TEI with added automatic linguistic annotation: tokenisation, MSD tagging and lemmatisation; - the corpus in vertical format used by various concordancers, e.g. CWB and Sketch Engine; this format is simpler and smaller but does not contain all the information from the source TEI. The SlovParl data originally come from https://github.com/SIstory/SlovParl, but have been converted to use TEI elements for speech. The first version of this resource is presented in the paper: Pančur, Andrej. "Označevanje zbirke zapisnikov sej slovenskega parlamenta s smernicami TEI." In the Proceedings of the Conference on Language Technologies & Digital Humanities (Tomaž Erjavec and Darja Fišer, eds.) 142-148. Ljubljana: Znanstvena založba Filozofske fakultete v Ljubljani, 2016.
Identifier (URI):http://hdl.handle.net/11356/1167
Language:Slovenian
Language (ISO639):slv
Publisher:Institute of Contemporary History
Replaces (URI):http://hdl.handle.net/11356/1075
Rights:Creative Commons - Attribution 4.0 International (CC BY 4.0)
https://creativecommons.org/licenses/by/4.0/
Subject:Slovenian Parliament
parliamentary debates
TEI
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  Slovenian language resource repository CLARIN.SI
Description:  http://www.language-archives.org/archive/clarin.si
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.clarin.si:11356/1167
DateStamp:  2018-11-14
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Pančur, Andrej; Šorn, Mojca; Erjavec, Tomaž. 2017. Institute of Contemporary History.
Terms: area_Europe country_SI dcmi_Text iso639_slv olac_primary_text


http://www.language-archives.org/item.php/oai:www.clarin.si:11356/1167
Up-to-date as of: Thu Dec 5 9:50:20 EST 2019