OLAC Record
oai:catalogue.elra.info:ELRA-S0399

Metadata
Title:GlobalPhone Multilingual Model Package
Abstract:The GlobalPhone Multilingual Model Package contains about 22 hours of transcribed read speech spoken by native speakers in 22 languages (Arabic, Bulgarian, Chinese-Mandarin, Chinese-Shanghai, Croatian, Czech, French, German, Hausa, Japanese, Korean, Polish, Portuguese (Brazilian), Russian, Spanish (Latin America), Swahili, Swedish, Tamil, Thai, Turkish, Ukrainian, and Vietnamese). The GlobalPhone Multilingual Model Package covers about 1 hour of transcribed speech from 10 speakers (5 male, 5 female) from each of the above listed 22 languages.
Access Rights:Rights available for: Commercial Use, Research Use
Date Available (W3CDTF):2018-10-02
Date Issued (W3CDTF):2018-07-04
Date Modified (W3CDTF):2018-10-02
Description:Desktop/Microphone
The GlobalPhone Multilingual Model Package contains about 22 hours of transcribed read speech spoken by native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Mandarin (ELRA-S0193), Chinese-Shanghai (ELRA-S0194), Croatian (ELRA-S0195), Czech (ELRA-S0196), French (ELRA-S0197), German (ELRA-S0198), Hausa (ELRA-S0347), Japanese (ELRA-S0199), Korean (ELRA-S0200), Polish (ELRA-S0320), Portuguese (Brazilian) (ELRA-S0201), Russian (ELRA-S0202), Spanish (Latin America) (ELRA-S0203), Swahili (ELRA-S0375), Swedish (ELRA-S0204), Tamil (ELRA-S0205), Thai (ELRA-S0321), Turkish (ELRA-S0206), Ukrainian (ELRA-S0377), and Vietnamese (ELRA-S0322). The GlobalPhone Multilingual Model Package covers about 1 hour of transcribed speech from 10 speakers (5 male, 5 female) from each of the above listed 22 languages, i.e. on average about 6 minutes or about 41 utterances per speaker from a total of 220 speakers. The package is designed for various tasks in multilingual speech processing research and development, such as (1) multilingual acoustic modeling, (2) multilingual speech synthesis, (3) automatic dictionary generation in multiple languages, and (4) multilingual speech processing with low resources.
Identifier:ELRA-S0399
http://catalog.elra.info/product_info.php?products_id=1324
Language:Arabic
Bulgarian
Croatian
Czech
French
German
Hausa
Japanese
Korean
Polish
Portuguese
Russian
Spanish, Castilian
Swahili (macrolanguage); Swahili
Swedish
Tamil
Thai
Turkish
Ukrainian
Vietnamese
Chinese
Language (ISO639):ara
bul
ces
fra
deu
hau
jpn
kor
pol
por
rus
spa
swa
swe
tam
tha
tur
ukr
vie
zho
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0399
DateStamp:  2018-10-02
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2018. ELRA (European Language Resources Association).
Terms: area_Africa area_Asia area_Europe country_BG country_CZ country_DE country_ES country_FR country_IN country_JP country_KR country_NG country_PL country_PT country_RU country_SE country_TH country_TR country_UA country_VN dcmi_Sound iso639_ara iso639_bul iso639_ces iso639_deu iso639_fra iso639_hau iso639_jpn iso639_kor iso639_pol iso639_por iso639_rus iso639_spa iso639_swa iso639_swe iso639_tam iso639_tha iso639_tur iso639_ukr iso639_vie iso639_zho olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0399
Up-to-date as of: Wed Mar 27 8:17:38 EDT 2019