OLAC Record: GlobalPhone Multilingual Model Package

OLAC Record
oai:catalogue.elra.info:ELRA-S0399

Metadata

Title: GlobalPhone Multilingual Model Package

Access Rights: Rights available for: nonCommercialUse, commercialUse

Coverage: Brazil

Latin America and the Caribbean

Date Available (W3CDTF): 2018-10-02

Date Issued (W3CDTF): 2018-10-02

Description: The GlobalPhone Multilingual Model Package contains about 22 hours of transcribed read speech spoken by native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Mandarin (ELRA-S0193), Chinese-Shanghai (ELRA-S0194), Croatian (ELRA-S0195), Czech (ELRA-S0196), French (ELRA-S0197), German (ELRA-S0198), Hausa (ELRA-S0347), Japanese (ELRA-S0199), Korean (ELRA-S0200), Polish (ELRA-S0320), Portuguese (Brazilian) (ELRA-S0201), Russian (ELRA-S0202), Spanish (Latin America) (ELRA-S0203), Swahili (ELRA-S0375), Swedish (ELRA-S0204), Tamil (ELRA-S0205), Thai (ELRA-S0321), Turkish (ELRA-S0206), Ukrainian (ELRA-S0377), and Vietnamese (ELRA-S0322). The GlobalPhone Multilingual Model Package covers about 1 hour of transcribed speech from 10 speakers (5 male, 5 female) from each of the above listed 22 languages, i.e. on average about 6 minutes or about 41 utterances per speaker from a total of 220 speakers. The package is designed for various tasks in multilingual speech processing research and development, such as (1) multilingual acoustic modeling, (2) multilingual speech synthesis, (3) automatic dictionary generation in multiple languages, and (4) multilingual speech processing with low resources.

Identifier: ELRA-S0399

ISLRN: 204-945-263-927-6

Identifier (URI): https://catalog.elra.info/en-us/repository/browse/ELRA-S0399/

Language: Tamil

Bulgarian

Czech

Chinese

Swahili (macrolanguage)

German

Korean

Arabic

Thai

Portuguese

Russian

Croatian

Ukrainian

Japanese

Spanish; Castilian

Hausa

Polish

French

Turkish

Swedish

Vietnamese

Language (ISO639): tam

bul

ces

zho

swa

deu

kor

ara

tha

por

rus

hrv

ukr

jpn

spa

hau

pol

fra

tur

swe

vie

Medium: Not specified

Publisher: ELRA (European Language Resources Association)

Type (DCMI): Sound

Type (OLAC): primary_text

OLAC Info

Archive: ELRA Catalogue of Language Resources

Description: http://www.language-archives.org/archive/catalogue.elra.info

GetRecord: OAI-PMH request for OLAC format

GetRecord: Pre-generated XML file

OAI Info

OaiIdentifier: oai:catalogue.elra.info:ELRA-S0399

DateStamp: 2018-10-02

GetRecord: OAI-PMH request for simple DC format

Search Info
Citation: n.a. 2018. ELRA (European Language Resources Association).
Terms: area_Africa area_Asia area_Europe country_BG country_CZ country_DE country_ES country_FR country_HR country_IN country_JP country_KR country_NG country_PL country_PT country_RU country_SE country_TH country_TR country_UA country_VN dcmi_Sound iso639_ara iso639_bul iso639_ces iso639_deu iso639_fra iso639_hau iso639_hrv iso639_jpn iso639_kor iso639_pol iso639_por iso639_rus iso639_spa iso639_swa iso639_swe iso639_tam iso639_tha iso639_tur iso639_ukr iso639_vie iso639_zho olac_primary_text

http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0399
Up-to-date as of: Wed Oct 1 0:56:26 EDT 2025

Metadata
Title:		GlobalPhone Multilingual Model Package
Access Rights:		Rights available for: nonCommercialUse, commercialUse
Coverage:		Brazil
Coverage:		Latin America and the Caribbean
Date Available (W3CDTF):		2018-10-02
Date Issued (W3CDTF):		2018-10-02
Description:		The GlobalPhone Multilingual Model Package contains about 22 hours of transcribed read speech spoken by native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Mandarin (ELRA-S0193), Chinese-Shanghai (ELRA-S0194), Croatian (ELRA-S0195), Czech (ELRA-S0196), French (ELRA-S0197), German (ELRA-S0198), Hausa (ELRA-S0347), Japanese (ELRA-S0199), Korean (ELRA-S0200), Polish (ELRA-S0320), Portuguese (Brazilian) (ELRA-S0201), Russian (ELRA-S0202), Spanish (Latin America) (ELRA-S0203), Swahili (ELRA-S0375), Swedish (ELRA-S0204), Tamil (ELRA-S0205), Thai (ELRA-S0321), Turkish (ELRA-S0206), Ukrainian (ELRA-S0377), and Vietnamese (ELRA-S0322). The GlobalPhone Multilingual Model Package covers about 1 hour of transcribed speech from 10 speakers (5 male, 5 female) from each of the above listed 22 languages, i.e. on average about 6 minutes or about 41 utterances per speaker from a total of 220 speakers. The package is designed for various tasks in multilingual speech processing research and development, such as (1) multilingual acoustic modeling, (2) multilingual speech synthesis, (3) automatic dictionary generation in multiple languages, and (4) multilingual speech processing with low resources.
Identifier:		ELRA-S0399
Identifier:		ISLRN: 204-945-263-927-6
Identifier (URI):		https://catalog.elra.info/en-us/repository/browse/ELRA-S0399/
Language:		Tamil
		Bulgarian
		Czech
		Chinese
		Swahili (macrolanguage)
		German
		Korean
		Arabic
		Thai
		Portuguese
		Russian
		Croatian
		Ukrainian
		Japanese
		Spanish; Castilian
		Hausa
		Polish
		French
		Turkish
		Swedish
		Vietnamese
Language (ISO639):		tam
		bul
		ces
		zho
		swa
		deu
		kor
		ara
		tha
		por
		rus
		hrv
		ukr
		jpn
		spa
		hau
		pol
		fra
		tur
		swe
		vie
Medium:		Not specified
Publisher:		ELRA (European Language Resources Association)
Type (DCMI):		Sound
Type (OLAC):		primary_text
OLAC Info
Archive:		ELRA Catalogue of Language Resources
Description:		http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:		OAI-PMH request for OLAC format
GetRecord:		Pre-generated XML file
OAI Info
OaiIdentifier:		oai:catalogue.elra.info:ELRA-S0399
DateStamp:		2018-10-02
GetRecord:		OAI-PMH request for simple DC format
Search Info
Citation:		n.a. 2018. ELRA (European Language Resources Association).
Terms:		area_Africa area_Asia area_Europe country_BG country_CZ country_DE country_ES country_FR country_HR country_IN country_JP country_KR country_NG country_PL country_PT country_RU country_SE country_TH country_TR country_UA country_VN dcmi_Sound iso639_ara iso639_bul iso639_ces iso639_deu iso639_fra iso639_hau iso639_hrv iso639_jpn iso639_kor iso639_pol iso639_por iso639_rus iso639_spa iso639_swa iso639_swe iso639_tam iso639_tha iso639_tur iso639_ukr iso639_vie iso639_zho olac_primary_text