OLAC Record oai:www.ldc.upenn.edu:LDC93S4A |
Metadata | ||
Title: | ATIS0 Complete | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Hemphill, Charles T., et al. ATIS0 Complete LDC93S4A. Web Download. Philadelphia: Linguistic Data Consortium, 1993 | |
Contributor: | Hemphill, Charles T. | |
Godfrey, John J. | ||
Doddington, George R. | ||
Garofolo, John S. | ||
Fiscus, Jonathan G. | ||
Date (W3CDTF): | 1993 | |
Description: | *Introduction* ATIS0 Complete is comprised of about 20 hours of spontaneous speech, read speech and other material from participants in the ATIS collection that is contained in the following corpora: ATIS0 Pilot (LDC93S4B), ATIS0 Read (LDC93S4B-2) and ATIS0 SD-Read (LDC93S4B-3). The ATIS (Air Travel Information Services) collection was developed to support the research and development of speech understanding systems. Participants were presented with various hypothetical travel planning scenarios and asked to solve them by interacting with partially or completely automated ATIS systems. The resulting utterances were recorded and transcribed. Data was collected in the early 1990s at five US sites: Raytheon BBN, Carnegie Mellon University, MIT Laboratory for Computer Science, National Institute for Standards and Technology and SRI International. ATIS0 Pilot contains spontaneous utterances elicited in a "Wizard-of-Oz" simulation, along with a relational database containing travel information (excluding connecting flights). In that data set, 36 speakers (24 male, 12 female) produced a total of 912 utterances. ATIS0 Read contains "read" versions of the spontaneous utterances for 20 of the 36 speakers above (11 male, 9 female), for a total of 478 productions. This is supplemented by a set of 40 "adaptation" sentences read by each of the 20 speakers. ATIS0 SD-Read contains "read" speech in the ATIS domain for ten of the speakers (five male, five female) in ATIS0 Pilot constituting 3,171 utterances, or approximately 317 utterances per speaker. *Data* ATIS speech data was recorded at 16kHz sample rate, 16-bit quantization, from two microphones: a close-talking (Sennheiser HMD414) and a desk-top (Crown PCC-160) model. Utterances were transcribed. *Samples* Please view this audio sample (wav) and transcript sample (txt). *Updates* None at this time. | |
Extent: | Corpus size: 2579704 KB | |
Format: | Sampling Rate: 16000 | |
Sampling Format: pcm | ||
Identifier: | LDC93S4A | |
https://catalog.ldc.upenn.edu/LDC93S4A | ||
ISBN: 1-58563-001-2 | ||
ISLRN: 101-041-175-695-3 | ||
DOI: 10.35111/g20s-2a57 | ||
Language: | English | |
Language (ISO639): | eng | |
License: | LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC93S4A | |
Rights Holder: | Portions © 1993 Trustees of the University of Pennsylvania | |
Type (DCMI): | Sound | |
Text | ||
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC93S4A | |
DateStamp: | 2024-05-08 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Hemphill, Charles T.; Godfrey, John J.; Doddington, George R.; Garofolo, John S.; Fiscus, Jonathan G. 1993. Linguistic Data Consortium. | |
Terms: | area_Europe country_GB dcmi_Sound dcmi_Text iso639_eng olac_primary_text |