OLAC Record
oai:www.clarin.si:11356/1046

Metadata
Title:Gos corpus n-grams 1.0
Bibliographic Citation:http://hdl.handle.net/11356/1046
Creator:Dobrovoljc, Kaja
Date (W3CDTF):2015-08-01T13:55:47Z
Date Available:2015-08-01T13:55:47Z
Description:This is a collection of n-grams extracted from the Gos corpus of spoken Slovene. http://hdl.handle.net/11356/1040. In addition to the separate lists of n-grams for tokens and their attributes (normalized form, morphosyntacic tag, lemma), an adjusted frequency list with statistical substring reduction has also been added (as described in O'Donnell 2011). Only n-grams within sentences have been counted.
Identifier (URI):http://hdl.handle.net/11356/1046
Is Replaced By (URI):http://hdl.handle.net/11356/1195
Language:Slovenian
Language (ISO639):slv
Publisher:Trojina, Institute for Applied Slovene Studies
Faculty of Arts, University of Ljubljana
Rights:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
https://creativecommons.org/licenses/by-sa/4.0/
Subject:n-grams
wordlist
multiword expressions
Slovenian language
Subject (ISO639):slv
Type:lexicalConceptualResource
Type (DCMI):Text
Type (OLAC):lexicon

OLAC Info

Archive:  Slovenian language resource repository CLARIN.SI
Description:  http://www.language-archives.org/archive/clarin.si
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.clarin.si:11356/1046
DateStamp:  2018-08-03
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Dobrovoljc, Kaja. 2015. Trojina, Institute for Applied Slovene Studies.
Terms: area_Europe country_SI dcmi_Text iso639_slv olac_lexicon

Inferred Metadata

Country: Slovenia
Area: Europe


http://www.language-archives.org/item.php/oai:www.clarin.si:11356/1046
Up-to-date as of: Thu Dec 5 9:50:04 EST 2019