OLAC Record oai:www.ldc.upenn.edu:LDC94S21 |
Metadata | ||
Title: | MACROPHONE | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Bernstein, Jared, Kelsey Taussig, and Jack Godfrey. MACROPHONE LDC94S21. Web Download. Philadelphia: Linguistic Data Consortium, 1994 | |
Contributor: | Bernstein, Jared | |
Taussig, Kelsey | ||
Godfrey, Jack | ||
Date (W3CDTF): | 1994 | |
Description: | *Introduction* MACROPHONE consists of approximately 200,000 utterances by 5,000 speakers. It is designed to provide material sufficient and suitable for research, development and evaluation of automatic speech recognition technology for common telephone applications, such as shopping, transportation, database access and autodialing. In addition to application-oriented phrases and numerous digit strings, seven sentences are spoken by each talker to provide ensemble phoneme, diphone and triphone coverage of the language. The spoken material also refers to times, locations, monetary amounts, spellings and interactive operations. *Data* The utterances were collected automatically over the telephone network by recording directly from a T1 connection in 8 kHz, 8-bit mu-law format. The participants, roughly equal numbers of males and females, were solicited by a marketing firm from all regions of the United States. They ranged in age from the teens to the seventies and represented a broad range of educations and incomes as well. Each recorded utterance is accompanied by an orthographic transcription which also notes any unusual acoustic events or anomalies. Macrophone is the American English contribution to an international database of telephone speech corpora called POLYPHONE. Similar data sets are expected for major languages of the world and at least some of these will be made available through LDC. Prospects are currently good for American Spanish (by early 1995), Dutch, Standard French, Standard German, Japanese, Mandarin Chinese, Swiss French and Danish versions of POLYPHONE, all with basically the same structure and methods of collection. MACROPHONE was collected at SRI under LDC sponsorship. A paper describing it was presented at ICASSP-94: "Macrophone: An American English Telephone Speech Corpus for the POLYPHONE Project," by Jared Bernstein, Kelsey Taussig and Jack Godfrey. *Samples* Please listen to this audio sample. *Updates* None at this time. | |
Extent: | Corpus size: 4910733 KB | |
Format: | Sampling Rate: 8000 | |
Sampling Format: 1-channel ulaw compressed | ||
Identifier: | LDC94S21 | |
https://catalog.ldc.upenn.edu/LDC94S21 | ||
ISBN: 1-58563-034-9 | ||
ISLRN: 593-364-872-062-5 | ||
DOI: 10.35111/0mh7-c698 | ||
Language: | English | |
Language (ISO639): | eng | |
License: | LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC94S21 | |
Rights Holder: | Portions © 1994 Trustees of the University of Pennsylvania | |
Type (DCMI): | Sound | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC94S21 | |
DateStamp: | 2020-11-30 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Bernstein, Jared; Taussig, Kelsey; Godfrey, Jack. 1994. Linguistic Data Consortium. | |
Terms: | area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text |