OLAC Record

Title:Danish SpeechDat-Car - Full database
Access Rights: Rights available for: nonCommercialUse, commercialUse
Date Available (W3CDTF):2002-10-09
Date Issued (W3CDTF):2002-10-09
Date Modified (W3CDTF):2007-02-22
Description:The Danish SpeechDat-Car comprises the recordings of 300 Danish speakers from 5 different regions (162 males, 138 females), recorded over the GSM telephone network, and in a car. This database is partitioned into 15 DVDs (53 GB), plus 1 CD-ROM for e.g. non-signal files and documentation. The speech databases made within the SpeechDat-Car project were validated by SPEX, the Netherlands, to assess their compliance with the SpeechDat-Car format and content specifications.The speech data files are in two formats. Four of the microphones were recorded on the computer in the boot of the car. The speech data are stored as sequences of 16 kHz, 16 bit and uncompressed. The fifth microphone was connected to the cell phone, and was recorded on a remote machine, with compressed data stored as sequences of 8 bit A-law 8.kHz. Each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.Each speaker uttered the following items:2 voice activation keywords1 sequence of 10 isolated digits7 connected digits : 1 sheet number (5+ digits), 1 spontaneous telephone number, 3 read telephone numbers, 1 credit card number (14-16 digits), 1 PIN code (6 digits)3 dates : 1 spontaneous date (e.g. birthday), 1 prompted date, 1 relative or general date expression2 word spotting phrases using an application word (embedded)4 isolated digits7 spelled words : 1 spontaneous (own forename or surname), 1 spelling of directory city name, 4 real word/name, 1 artificial name for coverage1 money amount1 natural number7 directory assistance names : 1 spontaneous (own forename or surname), 1 city of birth / growing up (spontaneous), 2 most frequent cities, 2 most frequent company/agency, 1 "forename surname"9 phonetically rich sentences2 time phrases : 1 time of day (spontaneous), 1 time phrase (word style)4 phonetically rich words67 application words: 13 mobile phone application words, 22 IVR function keywords, 32 car products keywords2 additional language dependent keywordsPrompts for spontaneous speech2 additional keywords from a list of 10The following age distribution has been obtained: 84 speakers are between 18 and 30, 99 speakers are between 31 and 45, 98 speakers are between 46 and 60, and 19 speakers are over 60.A pronunciation lexicon with a phonemic transcription in SAMPA is also included.
ISLRN: 392-627-715-651-4
Identifier (URI):https://catalog.elra.info/en-us/repository/browse/ELRA-S0132_01/
Language (ISO639):dan
Medium:Not specified
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text


Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0132-01
DateStamp:  2002-10-09
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2002. ELRA (European Language Resources Association).
Terms: area_Europe country_DK dcmi_Sound iso639_dan olac_primary_text

Up-to-date as of: Fri Apr 19 6:28:21 EDT 2024