OLAC Record
oai:catalogue.elra.info:ELRA-L0094

Metadata
Title:CEPLEXicon
Abstract:CEPLEXicon results from the automatic tagging of two corpora, using a tagger and the POS tag set. The automatic tagging was followed by a partial manual revision. This lexicon covers all the speech produced by seven monolingual Portuguese children aged 1;02.00 to 3;11.12, in a total of 114 files, each corresponding to 40-50 minutes of child-adult interaction in a naturalistic setting. The lexicon is presented in .xls format and includes 2201 lemmas, the number of occurrences of each lemma in three different age periods, frequency of the lemma in each period and age of first occurrence for each child.
Access Rights:Rights available for: Research Use, Commercial Use
Date Available (W3CDTF):2015-04-15
Date Issued (W3CDTF):2015-04-15
Date Modified (W3CDTF):2015-04-15
Description:Monolingual Lexicons
CEPLEXicon is a lexicon based on two different corpora of child speech ? Santos corpus (Santos, 2006, Santos et al., 2014, see http://www.clul.ul.pt/resources/546?lang=en) and Freitas corpus (Freitas, 1997, Freitas et al. 2012). This lexicon results from the automatic tagging of the two corpora, using a tagger and the POS tag set produced in the research unit ANAGRAMA (Centro de Lingu?stica da Universidade de Lisboa - CLUL) (G?n?reux, Hendrickx & Mendes, 2012). The automatic tagging was followed by a partial manual revision (as described in the manual). This lexicon covers all the speech produced by seven monolingual Portuguese children aged 1;02.00 to 3;11.12, in a total of 114 files, each corresponding to 40-50 minutes of child-adult interaction in a naturalistic setting. The lexicon is presented in .xls format and includes 2201 lemmas, the number of occurrences of each lemma in three different age periods (<2 years of age; ≥ 2 and < 3 years of age; ≥ 3 years of age), frequency of the lemma in each period and age of first occurrence for each child. CEPLEXicon was developed at ANAGRAMA (CLUL, Faculdade de Letras da Universidade de Lisboa), under the project Complement Clauses in the Acquisition of Portuguese (PTDC/CLE-LIN/120897/2010), funded by Funda??o para a Ci?ncia e Tecnologia.
Identifier:ELRA-L0094
http://catalog.elra.info/product_info.php?products_id=1244
Language:Portuguese
Language (ISO639):por
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-L0094
DateStamp:  2015-04-15
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2015. ELRA (European Language Resources Association).
Terms: area_Europe country_PT dcmi_Text iso639_por olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-L0094
Up-to-date as of: Wed Mar 27 8:17:25 EDT 2019