OLAC Record
oai:www.ldc.upenn.edu:LDC94S14B

Metadata
Title:Air Traffic Control BOS
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Godfrey, John J.. Air Traffic Control BOS LDC94S14B. Web Download. Philadelphia: Linguistic Data Consortium, 1994
Contributor:Godfrey, John J.
Date (W3CDTF):1994
Description:LDC94S14A - Complete ATC0 corpus LDC94S14B - ATC0 Logan International LDC94S14C - ATC0 Washington National LDC94S14D - ATC0 Dallas Fort Worth *Introduction* The Air Traffic Control Corpus (ATC0) is a set of recorded speech for use in supporting research and development activities in the area of robust speech recognition in domains similar to air traffic control (several speakers, noisy channels, relatively small vocabulary, constrained languaged, etc.) The audio data is composed of voice communication traffic between various controllers and pilots. *Data* The audio files are 8 KHz, 16-bit linear sampled data, representing continuous monitoring, without squelch or silence elimination, of a single FAA frequency for one to two hours. There are also files which indicate the amplitude of the received AM carrier signal at 10 msec. intervals. Full transcripts, including the start and end times of each transmission, are provided for each audio file. Each flight is identified by its flight number. ATC0 consists of three subcorpora, one for each airport in which the transmissions were collected -- Dallas Fort Worth (DFW), Logan International (BOS) and Washington National (DCA). The complete set contains approximately 70 hours of controller and pilot transmissions collected via antennas and radio receivers which were located in the vicinity of the respective airports. Detailed information regarding the collection process and the equipment used can be found on in the file, "atc.doc" in the "doc" directory. The ATC0 Corpus was collected by Texas Instruments under contract to DARPA. It was produced on CD-ROM by the National Institute of Standards and Technology for distribution by the Linguistic Data Consortium. *Updates* This corpus is now available as a downloadable file, some documentation may still refer to the original CD-ROMs. Relative to the CD-ROMs produced in 1994 by NIST, the sphere files were renamed with the .sph extension, instead of the .wav extension.
Format:Sampling Rate: 8000
Sampling Format: 1-channel pcm
Identifier:LDC94S14B
https://catalog.ldc.upenn.edu/LDC94S14B
ISBN: 1-58563-025X
ISLRN: 303-675-958-561-9
DOI: 10.35111/92e3-w996
Language:English
Language (ISO639):eng
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Rights Holder:Portions © 1994 Trustees of the University of Pennsylvania
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC94S14B
DateStamp:  2020-11-30
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Godfrey, John J. 1994. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC94S14B
Up-to-date as of: Fri Dec 6 7:47:06 EST 2024