OLAC Record
oai:lindat.mff.cuni.cz:11372/LRT-340

Metadata
Title:XCES parallel corpora generator
Bibliographic Citation:http://hdl.handle.net/11372/LRT-340
Contributor:Tufiş, Dan
Ion, Radu
Date (W3CDTF):2014-07-30T21:18:08Z
Date Available:2014-07-30T21:18:08Z
Description:XCESGen is a series of tools to generate parallel corpora in [[http://www.xces.org|XCES]] format: --- metacategories annotation: every word receives a category that defines a tag-set subset. --- chunking: adjacent word phrases are marked and named: noun phrases, verb phrases, prepositional phrases, etc. --- lemma/morpho-syntactic label annotation: it uses [[http://www.clarin.eu/tools/ttl-tokenizing-tagging-and-lemmatizing-free-running-texts|TTL]]. --- sense annotation: with [[http://www.clarin.eu/tools/word-sense-disambiguation-tool|WSDTool]]/[[http://www.clarin.eu/tools/synwsd|SynWSD]]. --- link annotation: with [[http://www.clarin.eu/tools/lexpar-word-linker|LexPar]].
Identifier (URI):http://hdl.handle.net/11372/LRT-340
Language:English
Romanian
Language (ISO639):eng
ron
Publisher:Research Institute for Artificial Intelligence, Romanian Academy of Sciences
Type:toolService
Type (DCMI):Software

OLAC Info

Archive:  LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11372/LRT-340
DateStamp:  2016-04-06
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Tufiş, Dan; Ion, Radu. 2014. Research Institute for Artificial Intelligence, Romanian Academy of Sciences.
Terms: area_Europe country_GB country_RO dcmi_Software iso639_eng iso639_ron


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11372/LRT-340
Up-to-date as of: Sun Nov 26 2:05:35 EST 2017