OLAC Record
oai:lindat.mff.cuni.cz:11372/LRT-1227

Metadata
Title:LX-Splitter
Bibliographic Citation:http://hdl.handle.net/11372/LRT-1227
Contributor:Branco, António
Silva, João
Date (W3CDTF):2014-07-30T21:28:15Z
Date Available:2014-07-30T21:28:15Z
Description:Automatic segmenter of paragraphs and sentences of Portuguese. Marks sentence boundaries with s…/s, and paragraph boundaries with p…/p. Unwraps sentences split over different lines. A f-score of 99.94% was obtained when testing on a 12,000 sentence corpus accurately hand tagged with respect to sentence and paragraph boundaries.
Identifier (URI):http://hdl.handle.net/11372/LRT-1227
Language:Portuguese
Language (ISO639):por
Publisher:NLX-Natural Language and Speech Group, University of Lisbon
Type:toolService
Type (DCMI):Software

OLAC Info

Archive:  LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11372/LRT-1227
DateStamp:  2016-04-06
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Branco, António; Silva, João. 2014. NLX-Natural Language and Speech Group, University of Lisbon.
Terms: area_Europe country_PT dcmi_Software iso639_por


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11372/LRT-1227
Up-to-date as of: Mon Feb 10 15:12:30 EST 2020