OLAC Record
oai:www.ldc.upenn.edu:LDC96L15

Metadata
Title:CALLHOME Mandarin Chinese Lexicon
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Huang, Shudong, et al. CALLHOME Mandarin Chinese Lexicon LDC96L15. Web Download. Philadelphia: Linguistic Data Consortium, 1996
Contributor:Huang, Shudong
Bian, Xuejun
Wu, Grace
McLemore, Cynthia
Date (W3CDTF):1996
Description:The CALLHOME Mandarin Chinese collection includes a lexical component. The CALLHOME Mandarin Lexicon consists of 44,405 words and contains separate information fields with phonological, morphological and frequency information for each word. The token coverage by the LDC Mandarin lexicon of words occurring in the 20 LDC Mandarin CALLHOME devtest transcripts (ten minutes of conversation each) is 98%. Orthographic Chinese characters are GB-encoded and are simplified in the Mainland style. A representation of the headword in tone pinyin with strictly lexical tone, i.e. not reflecting phonetic/phonological processes is also provided. Here is a sample page from the lexicon. The transcripts and documentation (LDC96T16) are available separately, as is a corpus of telephone speech (LDC96S34).
Extent:Corpus size: 1856 KB
Identifier:LDC96L15
https://catalog.ldc.upenn.edu/LDC96L15
ISBN: 1-58563-079-9
ISLRN: 969-490-893-990-1
DOI: 10.35111/ysmr-h820
Language:Mandarin Chinese
Language (ISO639):cmn
License:CALLHOME Lexicon Agreement (Commercial): https://catalog.ldc.upenn.edu/license/callhome-lexicon-commercial.pdf
CALLHOME Lexicon Agreement (Non-Commercial): https://catalog.ldc.upenn.edu/license/callhome-lexicon-non-commercial.pdf
CALLHOME Lexicon Agreement (Non-Member): https://catalog.ldc.upenn.edu/license/callhome-lexicon-non-member.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC96L15
Subject:Mandarin Chinese language
Subject (ISO639):cmn
Type (DCMI):Text
Type (OLAC):lexicon

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC96L15
DateStamp:  2020-11-30
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Huang, Shudong; Bian, Xuejun; Wu, Grace; McLemore, Cynthia. 1996. Linguistic Data Consortium.
Terms: area_Asia country_CN dcmi_Text iso639_cmn olac_lexicon

Inferred Metadata

Country: China
Area: Asia


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC96L15
Up-to-date as of: Mon Mar 25 7:19:55 EDT 2024