OLAC Record oai:www.ldc.upenn.edu:LDC2001S15 |
Metadata | ||
Title: | Switchboard Cellular Part 1 Transcribed Audio | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Graff, David, Kevin Walker, and David Miller. Switchboard Cellular Part 1 Transcribed Audio LDC2001S15. Web Download. Philadelphia: Linguistic Data Consortium, 2001 | |
Contributor: | Graff, David | |
Walker, Kevin | ||
Miller, David | ||
Date (W3CDTF): | 2001 | |
Description: | *Introduction* Switchboard Cellular Part 1 Transcribed Audio was developed by the Linguistic Data Consortium (LDC) and consists of approximately 24 hours of English telephone conversations collected by LDC between 1999-2000. This release contains the speech data files that correspond to Switchboard Cellular Part 1 Transcription (LDC2001T14). The full set of conversations (approximately 109 hours) from the Switchboard Part 1 study is available in Switchboard Cellular Part 1 Audio (LDC2001S13). Switchboard Cellular Part 2 Audio (LDC2004S07) contains approximately 200 hours of English telephone conversations collected by LDC in the Switchboard Part 2 study. The Switchboard Part 1 cellular collection focused primarily on GSM cellular phone technology. The project's goal was to target 190 subjects balanced by gender and under varied environmental conditions to participate in (10+) five to six minute conversations on GSM cellular phones. The speech data was collected for research, development, and evaluation of automatic systems for speech-to-text conversion, talker identification, language identification and speech signal detection purposes. *Data* Each speech file consists of a 1,024-byte ASCII-formatted Sphere header, followed by two-channel interleaved mu-law sample data. The mu-law samples represent the actual digital data transmission from the telephone service provider (MCI), as captured separately for each side of the telephone conversation by LDC's telephone collection platform. The header also indicates the caller_pin, callee_pin, topic_id, cellular service/handset information and speaker demographic information. The documentation also contains reports on clipped files. *Updates* There are no updates at this time. | |
Format: | Sampling Rate: 8000 | |
Sampling Format: 2-channel ulaw | ||
Identifier: | LDC2001S15 | |
https://catalog.ldc.upenn.edu/LDC2001S15 | ||
ISBN: 1-58563-215-5 | ||
ISLRN: 183-382-664-496-3 | ||
DOI: 10.35111/3wcn-6c29 | ||
Language: | English | |
Language (ISO639): | eng | |
License: | LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC2001S15 | |
Rights Holder: | Portions © 1999-2001 Trustees of the University of Pennsylvania | |
Type (DCMI): | Sound | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC2001S15 | |
DateStamp: | 2020-11-30 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Graff, David; Walker, Kevin; Miller, David. 2001. Linguistic Data Consortium. | |
Terms: | area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text |