OLAC Record
oai:catalogue.elra.info:ELRA-L0096

Metadata
Title:MCL - Multifunctional Computational Lexicon of Contemporary Portuguese
Abstract:MCL is a 26,443 lemma Frequency Lexicon with 140,315 tokens extracted from CORLEX, a contemporary Portuguese corpus (16,210,438 words). In order to extract the lexicon, all the different lexical forms occurring in the corpus were indexed and subsequently tagged morphosyntactically and lemmatised by PALAVROSO. Each lemma in MCL is followed by morphosyntactic and quantitative information.
Access Rights:Rights available for: Commercial Use, Research Use
Date Available (W3CDTF):2016-01-20
Date Issued (W3CDTF):2016-01-20
Date Modified (W3CDTF):2016-01-20
Description:Monolingual Lexicons
MCL is a 26,443 lemma Frequency Lexicon with 140,315 tokens, with the minimum lemma frequency of 6, extracted from CORLEX, a contemporary Portuguese corpus (16,210,438 words). CORLEX is a subcorpus of the Reference Corpus of Contemporary Portuguese and contains written and spoken texts of several types, being genre diversity a characteristic of this corpus. CORLEX contains mainly journalistic texts (56% of the written corpus and 53% of the whole corpus). In order to extract the lexicon, all the different lexical forms occurring in the corpus were indexed and subsequently tagged morphosyntactically and lemmatised by PALAVROSO. Each lemma in MCL is followed by morphosyntactic and quantitative information. The same information is given regarding each lemma token (inflected forms and some compounds). The lexicon indexations are listed in alphabetical order or decreasing frequency order.
Identifier:ELRA-L0096
http://catalog.elra.info/product_info.php?products_id=1254
Language:Portuguese
Language (ISO639):por
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-L0096
DateStamp:  2016-01-20
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2016. ELRA (European Language Resources Association).
Terms: area_Europe country_PT dcmi_Text iso639_por olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-L0096
Up-to-date as of: Wed Mar 27 8:17:27 EDT 2019