OLAC Record
oai:www.clarin.si:11356/1034

Metadata
Title:Written corpus ccKres 1.0
Bibliographic Citation:http://hdl.handle.net/11356/1034
Creator:Logar, Nataša
Erjavec, Tomaž
Krek, Simon
Grčar, Miha
Holozan, Peter
Date (W3CDTF):2015-06-01T08:57:14Z
Date Available:2015-06-01T08:57:14Z
Description:Corpus ccKres consists of 9,376 documents, each containing information about the source (e.g. newspapers, magazines), year of publication, text type (fiction, newspaper), the title and author if they are known. The corpus is POS-tagged and lemmatised, and encoded in XML TEI format (Text Encoding Initiative P5). The ccKres corpus contains approximately 9% of the Kres corpus, a balanced corpus of Slovene: http://eng.slovenscina.eu/korpusi/kres.
Identifier (URI):http://hdl.handle.net/11356/1034
Language:Slovenian
Language (ISO639):slv
Publisher:Centre for Language Resources and Technologies, University of Ljubljana
Rights:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
https://creativecommons.org/licenses/by-nc-sa/4.0/
Subject:TEI
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  Slovenian language resource repository CLARIN.SI
Description:  http://www.language-archives.org/archive/clarin.si
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.clarin.si:11356/1034
DateStamp:  2017-09-29
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Logar, Nataša; Erjavec, Tomaž; Krek, Simon; Grčar, Miha; Holozan, Peter. 2015. Centre for Language Resources and Technologies, University of Ljubljana.
Terms: area_Europe country_SI dcmi_Text iso639_slv olac_primary_text


http://www.language-archives.org/item.php/oai:www.clarin.si:11356/1034
Up-to-date as of: Wed Jul 17 9:50:17 EDT 2019