OLAC Record
oai:lindat.mff.cuni.cz:11234/LRT-1483

Metadata
Title:Word representations for multiple languages
Bibliographic Citation:http://hdl.handle.net/11234/LRT-1483
Creator:Müller, Thomas
Schütze, Hinrich
Date (W3CDTF):2015-06-08T09:25:01Z
Date Available:2015-06-08T09:25:01Z
Description:Dictionaries with different representations for various languages. Representations include brown clusters of different sizes and morphological dictionaries extracted using different morphological analyzers. All representations cover the most frequent 250,000 word types on the Wikipedia version of the respective language. Analzers used: MAGYARLANC (Hungarian, Zsibrita et al. (2013)), FREELING (English and Spanish, Padro and Stanilovsky (2012)), SMOR (German, Schmid et al. (2004)), an MA from Charles University (Czech, Hajic (2001)) and LATMOR (Latin, Springmann et al. (2014)).
Identifier (URI):http://hdl.handle.net/11234/LRT-1483
Language:English
German
Latin
Hungarian
Spanish
Czech
Language (ISO639):eng
deu
lat
hun
spa
ces
Publisher:Center for Information and Language Processing, University of Munich
Rights:Creative Commons - Attribution 3.0 Unported (CC BY 3.0)
http://creativecommons.org/licenses/by/3.0/
Subject:morphological dictionary
morphological analysis
PoS tagging
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/LRT-1483
DateStamp:  2016-04-06
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Müller, Thomas; Schütze, Hinrich. 2015. Center for Information and Language Processing, University of Munich.
Terms: area_Europe country_CZ country_DE country_ES country_GB country_HU country_VA dcmi_Text iso639_ces iso639_deu iso639_eng iso639_hun iso639_lat iso639_spa olac_primary_text


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/LRT-1483
Up-to-date as of: Sat Apr 13 8:52:30 EDT 2019