OLAC Record

Title:Mandarin Chinese Speech Synthesis Corpus (Basic Corpus)
Access Rights: Rights available for: nonCommercialUse, commercialUse
Date Available (W3CDTF):2007-01-17
Date Issued (W3CDTF):2007-01-17
Date Modified (W3CDTF):2007-01-17
Description:This corpus contains the recordings of 1 native Chinese speaker (female).The corpus is composed of 20 texts with 109,227 words and has been proofread manually. The corpus contents include: phrases, digit strings, letter strings, uncommon words, neutral tone, final retroflexion, Latin alphabet, interrogative sentences, 282 English words.The speaker has been recorded in a professional recording studio over 2 channels: microphone and glottis wave (fundamental frequency) signals for a total of 18.2 hours.Speech samples are stored as sequences of 16-bit 44,1 kHz PCM on two channels. The total data size is 5.67 Gb for a total of 12,679 files. The data is encoded in GB-2312 format.The transcriptions include labels for four-class pause boundaries.This database is aimed to be used within text-to-speech and speech synthesis applications.
ISLRN: 137-453-512-467-4
Identifier (URI):http://catalog.elra.info/en-us/repository/browse/ELRA-S0228_01/
Language (ISO639):zho
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text


Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0228-01
DateStamp:  2007-01-17
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2007. ELRA (European Language Resources Association).
Terms: dcmi_Sound iso639_zho olac_primary_text

Up-to-date as of: Thu Apr 2 14:41:20 EDT 2020