OLAC Record: Speech Controlled Computing

OLAC Record
oai:www.ldc.upenn.edu:LDC2006S30

Metadata

Title: Speech Controlled Computing

Access Rights: Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining

Bibliographic Citation: Cieri, Christopher, et al. Speech Controlled Computing LDC2006S30. Web Download. Philadelphia: Linguistic Data Consortium, 2006

Contributor: Cieri, Christopher

Miller, David

Martey, Nii O.

Maeda, Kazuaki

Date (W3CDTF): 2006

Date Issued (W3CDTF): 2006-03-24

Description: *Introduction* Speech Controlled Computing was developed by the Linguistic Data Consortium (LDC) and consists of 261,535 files of American English utterances. The Speech Controlled Computing corpus was designed to support the development of small footprint, embedded ASR applications in the domain of voice control for the home. It consists of audio files generated from recording sessions that took place between December 2003 and July 2004 with 125 speakers of American English from four dialect regions, three age groups and two gender groups, pronouncing isolated words. The four primary dialect regions covered by the corpus are North, South, West and Midland as defined by Williams Labov's Atlas of North American English. The three primary age groups covered by the corpus are 18-29, 30-49 and 50+. *Data* The recordings were conducted in a sound-attenuated room at LDC with the AKG C4000B studio condenser microphone. The omni-directional mode of the C4000B was used. Each speaker read a randomized word list consisting of 2,100 words (100 distinct words appearing 21 times each). Speech utterances were digitized and recorded to a DAT, as well as to a hard disk drive via the Townshend DATLINK+ digital audio interface. All of the audio files are single-channel 48 kHz 16-bit PCM wav files. Speech utterances were audited as they were recorded, and any utterances detected by the recorder that were not spoken clearly or correctly were re-recorded. This included extraneous clicks, coughs, sighs, and breathing that may have corrupted the recorded words. Utterances that were spoken too soft or too loud were also re-recorded. The digitized utterances were automatically segmented and aligned to the word list. Then each utterance was audited and the segmentation was checked, and corrected if necessary, by an annotator using an auditing and segmenting tool developed by LDC. Finally, sound files containing individual utterances were generated using the alignment and segmentation information. The sound files for this corpus were created with 100 msec of silent time before and after each utterance. Any files that contained noticeable clipping were automatically removed. *Samples* For an example of the data in this corpus, please listen to this sample (WAV) *Updates* None at this time.

Extent: Corpus size: 3984588 KB

Format: Sampling Rate: 48000

Sampling Format: pcm

Identifier: LDC2006S30

https://catalog.ldc.upenn.edu/LDC2006S30

ISBN: 1-58563-380-1

ISLRN: 185-835-412-868-8

DOI: 10.35111/6kce-vx52

Language: English

Language (ISO639): eng

License: Speech Controlled Computing (Non-Members): https://catalog.ldc.upenn.edu/license/speech-controlled-computing.pdf

Medium: Distribution: Web Download

Publisher: Linguistic Data Consortium

Publisher (URI): https://www.ldc.upenn.edu

Relation (URI): https://catalog.ldc.upenn.edu/docs/LDC2006S30

Rights Holder: © 2003-2006 Trustees of the University of Pennsylvania

Type (DCMI): Sound

Type (OLAC): primary_text

OLAC Info

Archive: The LDC Corpus Catalog

Description: http://www.language-archives.org/archive/www.ldc.upenn.edu

GetRecord: OAI-PMH request for OLAC format

GetRecord: Pre-generated XML file

OAI Info

OaiIdentifier: oai:www.ldc.upenn.edu:LDC2006S30

DateStamp: 2021-05-14

GetRecord: OAI-PMH request for simple DC format

Search Info
Citation: Cieri, Christopher; Miller, David; Martey, Nii O.; Maeda, Kazuaki. 2006. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text

http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2006S30
Up-to-date as of: Wed Oct 29 7:00:54 EDT 2025

Metadata
Title:		Speech Controlled Computing
Access Rights:		Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:		Cieri, Christopher, et al. Speech Controlled Computing LDC2006S30. Web Download. Philadelphia: Linguistic Data Consortium, 2006
Contributor:		Cieri, Christopher
		Miller, David
		Martey, Nii O.
		Maeda, Kazuaki
Date (W3CDTF):		2006
Date Issued (W3CDTF):		2006-03-24
Description:		Introduction Speech Controlled Computing was developed by the Linguistic Data Consortium (LDC) and consists of 261,535 files of American English utterances. The Speech Controlled Computing corpus was designed to support the development of small footprint, embedded ASR applications in the domain of voice control for the home. It consists of audio files generated from recording sessions that took place between December 2003 and July 2004 with 125 speakers of American English from four dialect regions, three age groups and two gender groups, pronouncing isolated words. The four primary dialect regions covered by the corpus are North, South, West and Midland as defined by Williams Labov's Atlas of North American English. The three primary age groups covered by the corpus are 18-29, 30-49 and 50+. Data The recordings were conducted in a sound-attenuated room at LDC with the AKG C4000B studio condenser microphone. The omni-directional mode of the C4000B was used. Each speaker read a randomized word list consisting of 2,100 words (100 distinct words appearing 21 times each). Speech utterances were digitized and recorded to a DAT, as well as to a hard disk drive via the Townshend DATLINK+ digital audio interface. All of the audio files are single-channel 48 kHz 16-bit PCM wav files. Speech utterances were audited as they were recorded, and any utterances detected by the recorder that were not spoken clearly or correctly were re-recorded. This included extraneous clicks, coughs, sighs, and breathing that may have corrupted the recorded words. Utterances that were spoken too soft or too loud were also re-recorded. The digitized utterances were automatically segmented and aligned to the word list. Then each utterance was audited and the segmentation was checked, and corrected if necessary, by an annotator using an auditing and segmenting tool developed by LDC. Finally, sound files containing individual utterances were generated using the alignment and segmentation information. The sound files for this corpus were created with 100 msec of silent time before and after each utterance. Any files that contained noticeable clipping were automatically removed. Samples For an example of the data in this corpus, please listen to this sample (WAV) Updates None at this time.
Extent:		Corpus size: 3984588 KB
Format:		Sampling Rate: 48000
Format:		Sampling Format: pcm
Identifier:		LDC2006S30
		https://catalog.ldc.upenn.edu/LDC2006S30
		ISBN: 1-58563-380-1
		ISLRN: 185-835-412-868-8
		DOI: 10.35111/6kce-vx52
Language:		English
Language (ISO639):		eng
License:		Speech Controlled Computing (Non-Members): https://catalog.ldc.upenn.edu/license/speech-controlled-computing.pdf
Medium:		Distribution: Web Download
Publisher:		Linguistic Data Consortium
Publisher (URI):		https://www.ldc.upenn.edu
Relation (URI):		https://catalog.ldc.upenn.edu/docs/LDC2006S30
Rights Holder:		© 2003-2006 Trustees of the University of Pennsylvania
Type (DCMI):		Sound
Type (OLAC):		primary_text
OLAC Info
Archive:		The LDC Corpus Catalog
Description:		http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:		OAI-PMH request for OLAC format
GetRecord:		Pre-generated XML file
OAI Info
OaiIdentifier:		oai:www.ldc.upenn.edu:LDC2006S30
DateStamp:		2021-05-14
GetRecord:		OAI-PMH request for simple DC format
Search Info
Citation:		Cieri, Christopher; Miller, David; Martey, Nii O.; Maeda, Kazuaki. 2006. Linguistic Data Consortium.
Terms:		area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text