OLAC Record
oai:lindat.mff.cuni.cz:11858/00-097C-0000-0001-4904-2

Metadata
Title:Feature-based tagger
Bibliographic Citation:http://hdl.handle.net/11858/00-097C-0000-0001-4904-2
Creator:Hajič, Jan
Date (W3CDTF):2011-06-28T09:42:24Z
Date Available:2009-11-02T09:22:59Z
Description:The Feature-based (exponential model) Tagger is a fast implementation of the Czech tagger developed at UFAL and described in the PDT 1.0 documentation (Czech Language Tagging page). In order to get the best possible results, the tagger requires preprocessing by a Czech morphological module with a very high coverage. This module covers a superset of the Czech "FM" morphology. Both the morphological module and the tagger are supplied as binary executables, together with all necessary precompiled Czech data. Input must be in the ISO Latin 2 (iso-8859-2) code and follow the csts.dtd definition, and output is produced in the same way (ISO Latin 2 code, csts.dtd). (As is the case with many of the tools provided with PDT 1.0, both executables also accept - and then produce - a "simplified SGML", which is not a real, valid SGML, but simply contains at least the tags for words, punctuation, and sentence breaks, one item per line.)
Identifier (URI):http://hdl.handle.net/11858/00-097C-0000-0001-4904-2
Language:No linguistic content
Language (ISO639):zxx
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Rights:PDT 2.0 License
https://lindat.mff.cuni.cz/repository/xmlui/page/license-pdt2
Subject:morphology
tagger
Type:toolService
Type (DCMI):Software

OLAC Info

Archive:  LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11858/00-097C-0000-0001-4904-2
DateStamp:  2020-02-19
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Hajič, Jan. 2011. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: dcmi_Software iso639_zxx


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11858/00-097C-0000-0001-4904-2
Up-to-date as of: Thu Feb 20 8:39:30 EST 2020