![]() |
OLAC Record oai:lindat.mff.cuni.cz:11372/LRT-1229 |
| Metadata | ||
| Title: | LX-Tagger | |
| Bibliographic Citation: | http://hdl.handle.net/11372/LRT-1229 | |
| Contributor: | Branco, António | |
| Silva, João | ||
| Date (W3CDTF): | 2014-07-30T21:28:16Z | |
| Date Available: | 2014-07-30T21:28:16Z | |
| Description: | Automatic part of speech tagger for Portuguese. Assigns a single morpho-syntactic tag, from the tagset here (http://lxsuite.di.fc.ul.pt/lx-suite.html), to every token. The tag is attached to the token, using a / (slash) symbol as separator: um exemplo → um/IA exemplo/CN Each individual token in multi-token expressions gets the tag of that expression prefixed by "L" and followed by the number of its position within the expression: de maneira a que → de/LCJ1 maneira/LCJ2 a/LCJ3 que/LCJ4 This tagger was developed with TnT software over 90% of a small, 260k token, accurately hand tagged corpus. Accuracy of 96.87% was obtained. | |
| Identifier (URI): | http://hdl.handle.net/11372/LRT-1229 | |
| Language: | Portuguese | |
| Language (ISO639): | por | |
| Publisher: | NLX-Natural Language and Speech Group, University of Lisbon | |
| Type: | toolService | |
| Type (DCMI): | Software | |
OLAC Info |
||
| Archive: | LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University | |
| Description: | http://www.language-archives.org/archive/lindat.mff.cuni.cz | |
| GetRecord: | OAI-PMH request for OLAC format | |
| GetRecord: | Pre-generated XML file | |
OAI Info |
||
| OaiIdentifier: | oai:lindat.mff.cuni.cz:11372/LRT-1229 | |
| DateStamp: | 2016-04-06 | |
| GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
| Citation: | Branco, António; Silva, João. 2014. NLX-Natural Language and Speech Group, University of Lisbon. | |
| Terms: | area_Europe country_PT dcmi_Software iso639_por | |