OLAC Record oai:www.ldc.upenn.edu:LDC93S8 |
Metadata | ||
Title: | Switchboard Credit Card | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Godfrey, John J., and Ed Holliman. Switchboard Credit Card LDC93S8. Web Download. Philadelphia: Linguistic Data Consortium, 1993 | |
Contributor: | Godfrey, John J. | |
Holliman, Ed | ||
Date (W3CDTF): | 1993 | |
Description: | *Introduction* Switchboard Credit Card was developed by NIST and contains approximately eight hours of audio on the topic of "Credit Card Use" from the Switchboard corpus. This publication also includes transcriptions, time-alignments and wordspotting targets. The full Switchboard corpus (LDC97S62) is a collection of about 2,400 two-sided telephone conversations among 543 speakers (302 male, 241 female) from all areas of the United States. A computer-driven robot operator system handled the calls, giving the caller appropriate recorded prompts, selecting and dialing another person (the callee) to take part in a conversation, introducing a topic for discussion and recording the speech from the two subjects into separate channels until the conversation was finished. About 70 topics were provided, of which about 50 were used frequently. Selection of topics and callees was constrained so that: (1) no two speakers would converse together more than once and (2) no one spoke more than once on a given topic. It was originally collected by Texas Instruments in 1990-1, under DARPA sponsorship. The first release of the corpus was published by NIST. *Data* Audio files are presented as two-channel, 8-bit, 8 kHz u-law encoded audio waveform files with standard NIST SPHERE headers. Transcriptions are text files containing interleaved transcriptions of both channels. Their headers describe various parameters of the conversation. Time-alignment files list out the individually transcribed words with their start times and durations. Wordspotting target files contain a reference list of occurrences for a key word. *Samples* * Audio sample (wav) * Transcription (txt) * Time alignment (mrk) * Wordspotting (ref) *Updates* None at this time. | |
Extent: | Corpus size: 215356 KB | |
Format: | Sampling Rate: 8000 | |
Sampling Format: 2-channel ulaw | ||
Identifier: | LDC93S8 | |
https://catalog.ldc.upenn.edu/LDC93S8 | ||
ISBN: 1-58563-016-0 | ||
ISLRN: 427-743-343-017-3 | ||
DOI: 10.35111/cmtf-v363 | ||
Language: | English | |
Language (ISO639): | eng | |
License: | LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC93S8 | |
Rights Holder: | Portions © 1993 Trustees of the University of Pennsylvania | |
Type (DCMI): | Sound | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC93S8 | |
DateStamp: | 2024-05-16 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Godfrey, John J.; Holliman, Ed. 1993. Linguistic Data Consortium. | |
Terms: | area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text |