OLAC Record
oai:www.ldc.upenn.edu:LDC96S55

Metadata
Title:CALLFRIEND Mandarin Chinese-Mainland Dialect
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Canavan, Alexandra, and George Zipperlen. CALLFRIEND Mandarin Chinese-Mainland Dialect LDC96S55. Web Download. Philadelphia: Linguistic Data Consortium, 1996
Contributor:Canavan, Alexandra
Zipperlen, George
Date (W3CDTF):1996
Description:*Introduction* CALLFRIEND Mandarin Chinese-Mainland Dialect was developed by the Linguistic Data Consortium (LDC) and consists of approximately 24 hours of unscripted telephone conversations between native speakers of the Mandarin Chinese dialect spoken in mainland China. The CALLFRIEND series is a collection of telephone conversations in several languages conducted by LDC in support of language identification technology development. Languages covered in the collection include American English, Canadian French, Egyptian Arabic, Farsi, German, Hindi, Japanese, Korean, Mandarin Chinese, Spanish, Tamil and Vietnamese. An updated edition of this corpus is available as CALLFRIEND Mandarin Chinese-Mainland Dialect Second Edition (LDC2018S09). The second edition updates the audio files to wav format, simplifies the directory structure and adds documentation and metadata. *Data* The corpus consists of 60 unscripted telephone conversations, lasting between 5-30 minutes. The corpus also includes documentation describing speaker information (sex, age, education, callee telephone number) and call information (channel quality, number of speakers). For each conversation, both the caller and callee are native speakers of Mandarin Chinese from Mainland China. All calls are domestic and were placed inside the continental United States and Canada. Callers in the "Mainland" and "Taiwan" collections of CALLFRIEND Mandarin were identified primarily on the basis of specific attributes in their speech characteristic of geographic origin. *Updates* There are no updates at this time.
Extent:Corpus size: 1379520 KB
Format:Sampling Rate: 8000
Sampling Format: 2-channel ulaw
Identifier:LDC96S55
https://catalog.ldc.upenn.edu/LDC96S55
ISBN: 1-58563-070-5
ISLRN: 608-636-717-091-6
Language:Mandarin Chinese
Language (ISO639):cmn
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/LDC%20User%20Agreement%20for%20Non-Members.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC96S55
Rights Holder:Portions © 1996 Trustees of the University of Pennsylvania
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC96S55
DateStamp:  2019-12-12
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Canavan, Alexandra; Zipperlen, George. 1996. Linguistic Data Consortium.
Terms: area_Asia country_CN dcmi_Sound iso639_cmn olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC96S55
Up-to-date as of: Sat Jan 18 13:56:13 EST 2020