Title:repetitiveness checker
Contributor:Jongejan, Bart
Date (W3CDTF):2014-07-30T21:17:28Z
Description:1) Finds repeated sequences of words in documents (repetitiveness checker) 2) Finds common sequences of words in several documents (version comparison) A sequence of words consists of minimally two words. There is no upper limit of the number of words in a sequence, but sequences do not transgress sentence delimiters. There are several weight functions to choose from, each defining "good" sequences in a different way, based on word frequency, sequence lenght and number of repetitions.
Identifier (URI):http://hdl.handle.net/11372/LRT-292
Language:No linguistic content
Language (ISO639):zxx
Publisher:Center for Sprogteknologi, University of Copenhagen
Citation: Jongejan, Bart. 2014. Center for Sprogteknologi, University of Copenhagen.
