OLAC Record
oai:www.ldc.upenn.edu:LDC2019S23

Metadata
Title:Magic Data Chinese Mandarin Conversational Speech
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Beijing Magic Data Technology Co.. Magic Data Chinese Mandarin Conversational Speech LDC2019S23. Web Download. Philadelphia: Linguistic Data Consortium, 2019
Contributor:Beijing Magic Data Technology Co.
Date (W3CDTF):2019
Date Issued (W3CDTF):2019-12-05
Description:*Introduction* Magic Data Chinese Mandarin Conversational Speech was developed by Beijing Magic Data Technology Co., Ltd. and consists of approximately 10 hours of Mandarin conversational speech from 60 speakers. Each conversation was recorded on multiple devices and is presented in multiple forms, resulting in a total of approximately 60 hours of audio with corresponding transcripts. *Data* All participants were native speakers of Mandarin in Mainland China from accent regions across the country. Speakers were paired for conversations on a range of topics, including travel, fitness, games, sports and pets. Speech data was recorded on mobile devices and is presented as 16kHz, 16-bit flac compressed pcm wav. Most files are single channel; however, a stereo version of each conversation is also included. Transcript data is contained in UTF-8 encoded plain text TextGrids. Metadata such as topic, collection date, mobile device and speaker demographic information is found in the documentation accompanying this release. *Samples* Please view this stereo speech sample and transcript sample. *Updates* None at this time.
Extent:Corpus size: 3741458 KB
Format:Sampling Rate: 16000
Sampling Format: pcm
Identifier:LDC2019S23
https://catalog.ldc.upenn.edu/LDC2019S23
ISBN: 1-58563-911-7
ISLRN: 636-430-467-703-3
Language:Mandarin Chinese
Language (ISO639):cmn
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2019S23
Rights Holder:Portions © 2019 Beijing Magic Data Technology Co., Ltd., © 2019 Trustees of the University of Pennsylvania
Type (DCMI):Sound
Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2019S23
DateStamp:  2020-06-12
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Beijing Magic Data Technology Co. 2019. Linguistic Data Consortium.
Terms: area_Asia country_CN dcmi_Sound dcmi_Text iso639_cmn olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2019S23
Up-to-date as of: Mon Sep 7 10:38:32 EDT 2020