OLAC Record: ATIS0 Pilot

OLAC Record
oai:www.ldc.upenn.edu:LDC93S4B

Metadata

Title: ATIS0 Pilot

Access Rights: Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining

Bibliographic Citation: Hemphill, Charles T., et al. ATIS0 Pilot LDC93S4B. Web Download. Philadelphia: Linguistic Data Consortium, 1993

Contributor: Hemphill, Charles T.

Godfrey, John J.

Doddington, George R.

Garofolo, John S.

Fiscus, Jonathan G.

Dahlgren, Nancy

Fisher, William

Tjaden, Brett

Pallett, David

Date (W3CDTF): 1993

Description: *Introduction* ATIS0 Pilot is comprised of about four hours of spontaneous speech, read speech and other material from participants in the ATIS collection. Other corpora in the collection are: ATIS0 Read (LDC93S4B-2) and ATIS0 SD-Read (LDC93S4B-3). ATIS0 Complete (LDC93S4A) contains all three corpora. The ATIS (Air Travel Information Services) collection was developed to support the research and development of speech understanding systems. Participants were presented with various hypothetical travel planning scenarios and asked to solve them by interacting with partially or completely automated ATIS systems. The resulting utterances were recorded and transcribed. Data was collected in the early 1990s at five US sites: Raytheon BBN, Carnegie Mellon University, MIT Laboratory for Computer Science, National Institute for Standards and Technology and SRI International. ATIS0 Pilot contains spontaneous utterances elicited in a "Wizard-of-Oz" simulation, along with a relational database containing travel information (excluding connecting flights). In that data set, 36 speakers (24 male, 12 female) produced a total of 912 utterances. *Data* ATIS speech data was recorded at 16kHz sample rate, 16-bit quantization, from two microphones: a close-talking (Sennheiser HMD414) and a desk-top (Crown PCC-160) model. Utterances were transcribed. *Samples* Please view this audio sample (wav) and transcript sample (txt). *Updates* None at this time.

Format: Sampling Rate: 16000

Sampling Format: 1-channel pcm

Identifier: LDC93S4B

https://catalog.ldc.upenn.edu/LDC93S4B

ISBN: 1-58563-002-0

ISLRN: 477-521-980-972-9

DOI: 10.35111/4t8c-r397

Language: English

Language (ISO639): eng

License: LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf

Medium: Distribution: Web Download

Publisher: Linguistic Data Consortium

Publisher (URI): https://www.ldc.upenn.edu

Relation (URI): https://catalog.ldc.upenn.edu/docs/LDC93S4B

Rights Holder: Portions © 1993 Trustees of the University of Pennsylvania

Type (DCMI): Sound

Type (OLAC): primary_text

OLAC Info

Archive: The LDC Corpus Catalog

Description: http://www.language-archives.org/archive/www.ldc.upenn.edu

GetRecord: OAI-PMH request for OLAC format

GetRecord: Pre-generated XML file

OAI Info

OaiIdentifier: oai:www.ldc.upenn.edu:LDC93S4B

DateStamp: 2024-05-08

GetRecord: OAI-PMH request for simple DC format

Search Info
Citation: Hemphill, Charles T.; Godfrey, John J.; Doddington, George R.; Garofolo, John S.; Fiscus, Jonathan G.; Dahlgren, Nancy; Fisher, William; Tjaden, Brett; Pallett, David. 1993. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text

http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC93S4B
Up-to-date as of: Wed Oct 29 7:00:29 EDT 2025

Metadata
Title:		ATIS0 Pilot
Access Rights:		Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:		Hemphill, Charles T., et al. ATIS0 Pilot LDC93S4B. Web Download. Philadelphia: Linguistic Data Consortium, 1993
Contributor:		Hemphill, Charles T.
		Godfrey, John J.
		Doddington, George R.
		Garofolo, John S.
		Fiscus, Jonathan G.
		Dahlgren, Nancy
		Fisher, William
		Tjaden, Brett
		Pallett, David
Date (W3CDTF):		1993
Description:		Introduction ATIS0 Pilot is comprised of about four hours of spontaneous speech, read speech and other material from participants in the ATIS collection. Other corpora in the collection are: ATIS0 Read (LDC93S4B-2) and ATIS0 SD-Read (LDC93S4B-3). ATIS0 Complete (LDC93S4A) contains all three corpora. The ATIS (Air Travel Information Services) collection was developed to support the research and development of speech understanding systems. Participants were presented with various hypothetical travel planning scenarios and asked to solve them by interacting with partially or completely automated ATIS systems. The resulting utterances were recorded and transcribed. Data was collected in the early 1990s at five US sites: Raytheon BBN, Carnegie Mellon University, MIT Laboratory for Computer Science, National Institute for Standards and Technology and SRI International. ATIS0 Pilot contains spontaneous utterances elicited in a "Wizard-of-Oz" simulation, along with a relational database containing travel information (excluding connecting flights). In that data set, 36 speakers (24 male, 12 female) produced a total of 912 utterances. Data ATIS speech data was recorded at 16kHz sample rate, 16-bit quantization, from two microphones: a close-talking (Sennheiser HMD414) and a desk-top (Crown PCC-160) model. Utterances were transcribed. Samples Please view this audio sample (wav) and transcript sample (txt). Updates None at this time.
Format:		Sampling Rate: 16000
Format:		Sampling Format: 1-channel pcm
Identifier:		LDC93S4B
		https://catalog.ldc.upenn.edu/LDC93S4B
		ISBN: 1-58563-002-0
		ISLRN: 477-521-980-972-9
		DOI: 10.35111/4t8c-r397
Language:		English
Language (ISO639):		eng
License:		LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:		Distribution: Web Download
Publisher:		Linguistic Data Consortium
Publisher (URI):		https://www.ldc.upenn.edu
Relation (URI):		https://catalog.ldc.upenn.edu/docs/LDC93S4B
Rights Holder:		Portions © 1993 Trustees of the University of Pennsylvania
Type (DCMI):		Sound
Type (OLAC):		primary_text
OLAC Info
Archive:		The LDC Corpus Catalog
Description:		http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:		OAI-PMH request for OLAC format
GetRecord:		Pre-generated XML file
OAI Info
OaiIdentifier:		oai:www.ldc.upenn.edu:LDC93S4B
DateStamp:		2024-05-08
GetRecord:		OAI-PMH request for simple DC format
Search Info
Citation:		Hemphill, Charles T.; Godfrey, John J.; Doddington, George R.; Garofolo, John S.; Fiscus, Jonathan G.; Dahlgren, Nancy; Fisher, William; Tjaden, Brett; Pallett, David. 1993. Linguistic Data Consortium.
Terms:		area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text