OLAC Record: Switchboard Credit Card

OLAC Record
oai:www.ldc.upenn.edu:LDC93S8

Metadata

Title: Switchboard Credit Card

Access Rights: Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining

Bibliographic Citation: Godfrey, John J., and Ed Holliman. Switchboard Credit Card LDC93S8. Web Download. Philadelphia: Linguistic Data Consortium, 1993

Contributor: Godfrey, John J.

Holliman, Ed

Date (W3CDTF): 1993

Description: *Introduction* Switchboard Credit Card was developed by NIST and contains approximately eight hours of audio on the topic of "Credit Card Use" from the Switchboard corpus. This publication also includes transcriptions, time-alignments and wordspotting targets. The full Switchboard corpus (LDC97S62) is a collection of about 2,400 two-sided telephone conversations among 543 speakers (302 male, 241 female) from all areas of the United States. A computer-driven robot operator system handled the calls, giving the caller appropriate recorded prompts, selecting and dialing another person (the callee) to take part in a conversation, introducing a topic for discussion and recording the speech from the two subjects into separate channels until the conversation was finished. About 70 topics were provided, of which about 50 were used frequently. Selection of topics and callees was constrained so that: (1) no two speakers would converse together more than once and (2) no one spoke more than once on a given topic. It was originally collected by Texas Instruments in 1990-1, under DARPA sponsorship. The first release of the corpus was published by NIST. *Data* Audio files are presented as two-channel, 8-bit, 8 kHz u-law encoded audio waveform files with standard NIST SPHERE headers. Transcriptions are text files containing interleaved transcriptions of both channels. Their headers describe various parameters of the conversation. Time-alignment files list out the individually transcribed words with their start times and durations. Wordspotting target files contain a reference list of occurrences for a key word. *Samples* * Audio sample (wav) * Transcription (txt) * Time alignment (mrk) * Wordspotting (ref) *Updates* None at this time.

Extent: Corpus size: 215356 KB

Format: Sampling Rate: 8000

Sampling Format: 2-channel ulaw

Identifier: LDC93S8

https://catalog.ldc.upenn.edu/LDC93S8

ISBN: 1-58563-016-0

ISLRN: 427-743-343-017-3

DOI: 10.35111/cmtf-v363

Language: English

Language (ISO639): eng

License: LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf

Medium: Distribution: Web Download

Publisher: Linguistic Data Consortium

Publisher (URI): https://www.ldc.upenn.edu

Relation (URI): https://catalog.ldc.upenn.edu/docs/LDC93S8

Rights Holder: Portions © 1993 Trustees of the University of Pennsylvania

Type (DCMI): Sound

Type (OLAC): primary_text

OLAC Info

Archive: The LDC Corpus Catalog

Description: http://www.language-archives.org/archive/www.ldc.upenn.edu

GetRecord: OAI-PMH request for OLAC format

GetRecord: Pre-generated XML file

OAI Info

OaiIdentifier: oai:www.ldc.upenn.edu:LDC93S8

DateStamp: 2024-05-16

GetRecord: OAI-PMH request for simple DC format

Search Info
Citation: Godfrey, John J.; Holliman, Ed. 1993. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text

http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC93S8
Up-to-date as of: Wed Oct 29 7:00:30 EDT 2025

Metadata
Title:		Switchboard Credit Card
Access Rights:		Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:		Godfrey, John J., and Ed Holliman. Switchboard Credit Card LDC93S8. Web Download. Philadelphia: Linguistic Data Consortium, 1993
Contributor:		Godfrey, John J.
Contributor:		Holliman, Ed
Date (W3CDTF):		1993
Description:		Introduction Switchboard Credit Card was developed by NIST and contains approximately eight hours of audio on the topic of "Credit Card Use" from the Switchboard corpus. This publication also includes transcriptions, time-alignments and wordspotting targets. The full Switchboard corpus (LDC97S62) is a collection of about 2,400 two-sided telephone conversations among 543 speakers (302 male, 241 female) from all areas of the United States. A computer-driven robot operator system handled the calls, giving the caller appropriate recorded prompts, selecting and dialing another person (the callee) to take part in a conversation, introducing a topic for discussion and recording the speech from the two subjects into separate channels until the conversation was finished. About 70 topics were provided, of which about 50 were used frequently. Selection of topics and callees was constrained so that: (1) no two speakers would converse together more than once and (2) no one spoke more than once on a given topic. It was originally collected by Texas Instruments in 1990-1, under DARPA sponsorship. The first release of the corpus was published by NIST. Data Audio files are presented as two-channel, 8-bit, 8 kHz u-law encoded audio waveform files with standard NIST SPHERE headers. Transcriptions are text files containing interleaved transcriptions of both channels. Their headers describe various parameters of the conversation. Time-alignment files list out the individually transcribed words with their start times and durations. Wordspotting target files contain a reference list of occurrences for a key word. Samples * Audio sample (wav) * Transcription (txt) * Time alignment (mrk) * Wordspotting (ref) Updates None at this time.
Extent:		Corpus size: 215356 KB
Format:		Sampling Rate: 8000
Format:		Sampling Format: 2-channel ulaw
Identifier:		LDC93S8
		https://catalog.ldc.upenn.edu/LDC93S8
		ISBN: 1-58563-016-0
		ISLRN: 427-743-343-017-3
		DOI: 10.35111/cmtf-v363
Language:		English
Language (ISO639):		eng
License:		LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:		Distribution: Web Download
Publisher:		Linguistic Data Consortium
Publisher (URI):		https://www.ldc.upenn.edu
Relation (URI):		https://catalog.ldc.upenn.edu/docs/LDC93S8
Rights Holder:		Portions © 1993 Trustees of the University of Pennsylvania
Type (DCMI):		Sound
Type (OLAC):		primary_text
OLAC Info
Archive:		The LDC Corpus Catalog
Description:		http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:		OAI-PMH request for OLAC format
GetRecord:		Pre-generated XML file
OAI Info
OaiIdentifier:		oai:www.ldc.upenn.edu:LDC93S8
DateStamp:		2024-05-16
GetRecord:		OAI-PMH request for simple DC format
Search Info
Citation:		Godfrey, John J.; Holliman, Ed. 1993. Linguistic Data Consortium.
Terms:		area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text