Contributor:Branco, António
Silva, João
Description:Automatic part of speech tagger for Portuguese. Assigns a single morpho-syntactic tag, from the tagset here (http://lxsuite.di.fc.ul.pt/lx-suite.html), to every token. The tag is attached to the token, using a / (slash) symbol as separator: um exemplo → um/IA exemplo/CN Each individual token in multi-token expressions gets the tag of that expression prefixed by "L" and followed by the number of its position within the expression: de maneira a que → de/LCJ1 maneira/LCJ2 a/LCJ3 que/LCJ4 This tagger was developed with TnT software over 90% of a small, 260k token, accurately hand tagged corpus. Accuracy of 96.87% was obtained.
Language (ISO639):por
Publisher:NLX-Natural Language and Speech Group, University of Lisbon
Type (DCMI):Software


Citation: Branco, António; Silva, João. 2014. NLX-Natural Language and Speech Group, University of Lisbon.
