OLAC Record
oai:www.clarin.si:11356/1227

Metadata
Title:Corpus extraction tool LIST 1.0
Bibliographic Citation:http://hdl.handle.net/11356/1227
Creator:Krsnik, Luka
Arhar Holdt, Špela
Čibej, Jaka
Dobrovoljc, Kaja
Ključevšek, Aleksander
Krek, Simon
Robnik-Šikonja, Marko
Date (W3CDTF):2019-03-27T12:41:31Z
Date Available:2019-03-27T12:41:31Z
Description:The LIST corpus extraction tool is a Java program for extracting lists from text corpora on the levels of characters, word parts, words, and word sets. It supports VERT and TEI P5 XML formats and outputs .CSV files that can be imported into Microsoft Excel or similar statistical processing software.
Identifier (URI):http://hdl.handle.net/11356/1227
Language:Slovenian
English
Language (ISO639):slv
eng
Publisher:Centre for Language Resources and Technologies, University of Ljubljana
Faculty of Computer and Information Science, University of Ljubljana
Jožef Stefan Institute
Rights:Apache License 2.0
http://opensource.org/licenses/Apache-2.0
Subject:corpus linguistics
text processing
extraction
characters
word parts
words
word sets
n-grams
morphology
Type:toolService
Type (DCMI):Software

OLAC Info

Archive:  Slovenian language resource repository CLARIN.SI
Description:  http://www.language-archives.org/archive/clarin.si
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.clarin.si:11356/1227
DateStamp:  2019-05-08
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Krsnik, Luka; Arhar Holdt, Špela; Čibej, Jaka; Dobrovoljc, Kaja; Ključevšek, Aleksander; Krek, Simon; Robnik-Šikonja, Marko. 2019. Centre for Language Resources and Technologies, University of Ljubljana.
Terms: area_Europe country_GB country_SI dcmi_Software iso639_eng iso639_slv


http://www.language-archives.org/item.php/oai:www.clarin.si:11356/1227
Up-to-date as of: Tue Aug 20 10:27:25 EDT 2019