OLAC Record oai:www.ldc.upenn.edu:LDC96L15 |
Metadata | ||
Title: | CALLHOME Mandarin Chinese Lexicon | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Huang, Shudong, et al. CALLHOME Mandarin Chinese Lexicon LDC96L15. Web Download. Philadelphia: Linguistic Data Consortium, 1996 | |
Contributor: | Huang, Shudong | |
Bian, Xuejun | ||
Wu, Grace | ||
McLemore, Cynthia | ||
Date (W3CDTF): | 1996 | |
Description: | The CALLHOME Mandarin Chinese collection includes a lexical component. The CALLHOME Mandarin Lexicon consists of 44,405 words and contains separate information fields with phonological, morphological and frequency information for each word. The token coverage by the LDC Mandarin lexicon of words occurring in the 20 LDC Mandarin CALLHOME devtest transcripts (ten minutes of conversation each) is 98%. Orthographic Chinese characters are GB-encoded and are simplified in the Mainland style. A representation of the headword in tone pinyin with strictly lexical tone, i.e. not reflecting phonetic/phonological processes is also provided. Here is a sample page from the lexicon. The transcripts and documentation (LDC96T16) are available separately, as is a corpus of telephone speech (LDC96S34). | |
Extent: | Corpus size: 1856 KB | |
Identifier: | LDC96L15 | |
https://catalog.ldc.upenn.edu/LDC96L15 | ||
ISBN: 1-58563-079-9 | ||
ISLRN: 969-490-893-990-1 | ||
DOI: 10.35111/ysmr-h820 | ||
Language: | Mandarin Chinese | |
Language (ISO639): | cmn | |
License: | CALLHOME Lexicon Agreement (Commercial): https://catalog.ldc.upenn.edu/license/callhome-lexicon-commercial.pdf | |
CALLHOME Lexicon Agreement (Non-Commercial): https://catalog.ldc.upenn.edu/license/callhome-lexicon-non-commercial.pdf | ||
CALLHOME Lexicon Agreement (Non-Member): https://catalog.ldc.upenn.edu/license/callhome-lexicon-non-member.pdf | ||
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC96L15 | |
Subject: | Mandarin Chinese language | |
Subject (ISO639): | cmn | |
Type (DCMI): | Text | |
Type (OLAC): | lexicon | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC96L15 | |
DateStamp: | 2020-11-30 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Huang, Shudong; Bian, Xuejun; Wu, Grace; McLemore, Cynthia. 1996. Linguistic Data Consortium. | |
Terms: | area_Asia country_CN dcmi_Text iso639_cmn olac_lexicon | |
Inferred Metadata | ||
Country: | China | |
Area: | Asia |