OLAC Record

Title:Slovenian parliamentary corpus siParl 1.0 (1990-2018)
Bibliographic Citation:http://hdl.handle.net/11356/1236
Creator:Pančur, Andrej
Erjavec, Tomaž
Ojsteršek, Mihael
Šorn, Mojca
Blaj Hribar, Neja
Date (W3CDTF):2019-05-03T18:35:55Z
Date Available:2019-05-03T18:35:55Z
Description:The siParl corpus contains minutes of the Assembly of the Republic of Slovenia for 11th legislative period 1990-1992, minutes of the National Assembly of the Republic of Slovenia from the 1st to the 7th legislative period 1992-2018, minutes of the working bodies of the National Assembly of the Republic of Slovenia from the 2nd to the 7th legislative period 1996-2018, and minutes of the the Council of the President of the National Assembly from the 2nd to the 7th legislative period 1996-2018. The corpus comprises over a million speeches or 195 million words. The corpus contains basic meta-data about the speakers, a typology of sessions etc. and structural and editorial annotations. This item comprises three datasets: - the corpus in TEI (module Transcriptions of speech); - the corpus in TEI with added automatic linguistic annotation: tokenisation, MSD tagging and lemmatisation; - the linguisticaly annotated corpus in vertical format used by various concordancers, e.g. CWB and Sketch Engine; this format is simpler and smaller but does not contain all the information from the source TEI. A preliminary version of this resource is presented in the paper: Pančur, Andrej, Mojca Šorn and Tomaž Erjavec (2018). "SlovParl 2.0: The Collection of Slovene Parliamentary Debates from the Period of Secession." Darja Fišer and Maria Eskevich and Franciska de Jong (eds.), Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan, May 2018. http://lrec-conf.org/workshops/lrec2018/W2/summaries/4_W2.html
Identifier (URI):http://hdl.handle.net/11356/1236
Language (ISO639):slv
Publisher:Institute of Contemporary History
Rights:Creative Commons - Attribution 4.0 International (CC BY 4.0)
Subject:parliamentary debates
Slovenian Parliament
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  Slovenian language resource repository CLARIN.SI
Description:  http://www.language-archives.org/archive/clarin.si
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.clarin.si:11356/1236
DateStamp:  2019-05-03
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Pančur, Andrej; Erjavec, Tomaž; Ojsteršek, Mihael; Šorn, Mojca; Blaj Hribar, Neja. 2019. Institute of Contemporary History.
Terms: area_Europe country_SI dcmi_Text iso639_slv olac_primary_text

Up-to-date as of: Tue Aug 20 10:27:25 EDT 2019