OLAC Record
oai:lindat.mff.cuni.cz:11372/LRT-292

Metadata
Title:repetitiveness checker
Bibliographic Citation:http://hdl.handle.net/11372/LRT-292
Contributor:Jongejan, Bart
Date (W3CDTF):2014-07-30T21:17:28Z
Date Available:2014-07-30T21:17:28Z
Description:1) Finds repeated sequences of words in documents (repetitiveness checker) 2) Finds common sequences of words in several documents (version comparison) A sequence of words consists of minimally two words. There is no upper limit of the number of words in a sequence, but sequences do not transgress sentence delimiters. There are several weight functions to choose from, each defining "good" sequences in a different way, based on word frequency, sequence lenght and number of repetitions.
Identifier (URI):http://hdl.handle.net/11372/LRT-292
Language:No linguistic content
Language (ISO639):zxx
Publisher:Center for Sprogteknologi, University of Copenhagen
Type:toolService
Type (DCMI):Software

OLAC Info

Archive:  LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11372/LRT-292
DateStamp:  2016-04-06
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Jongejan, Bart. 2014. Center for Sprogteknologi, University of Copenhagen.
Terms: dcmi_Software iso639_zxx


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11372/LRT-292
Up-to-date as of: Mon Feb 10 15:10:32 EST 2020