OLAC Record

Bibliographic Citation:http://hdl.handle.net/11372/LRT-1229
Contributor:Branco, António
Silva, João
Date (W3CDTF):2014-07-30T21:28:16Z
Date Available:2014-07-30T21:28:16Z
Description:Automatic part of speech tagger for Portuguese. Assigns a single morpho-syntactic tag, from the tagset here (http://lxsuite.di.fc.ul.pt/lx-suite.html), to every token. The tag is attached to the token, using a / (slash) symbol as separator: um exemplo → um/IA exemplo/CN Each individual token in multi-token expressions gets the tag of that expression prefixed by "L" and followed by the number of its position within the expression: de maneira a que → de/LCJ1 maneira/LCJ2 a/LCJ3 que/LCJ4 This tagger was developed with TnT software over 90% of a small, 260k token, accurately hand tagged corpus. Accuracy of 96.87% was obtained.
Identifier (URI):http://hdl.handle.net/11372/LRT-1229
Language (ISO639):por
Publisher:NLX-Natural Language and Speech Group, University of Lisbon
Type (DCMI):Software


Archive:  LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11372/LRT-1229
DateStamp:  2016-04-06
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Branco, António; Silva, João. 2014. NLX-Natural Language and Speech Group, University of Lisbon.
Terms: area_Europe country_PT dcmi_Software iso639_por

Up-to-date as of: Sat Apr 13 8:52:14 EDT 2019