OLAC Record
oai:www.ldc.upenn.edu:LDC98S67

Metadata
Title:HTIMIT
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Reynolds, Douglas. HTIMIT LDC98S67. Web Download. Philadelphia: Linguistic Data Consortium, 1998
Contributor:Reynolds, Douglas
Date (W3CDTF):1998
Description:*Introduction* The HTIMIT corpus is a re-recording of a subset of the TIMIT corpus through different telephone handsets. The aim was to create a corpus for the study of telephone transducer effects on speech which minimized confounding factors, such as variable telephone channels and background noise. HTIMIT was created by playing ten TIMIT sentences from 192 male and 192 females through a stereo loudspeaker into different transducers positioned directly in front of the loudspeaker and digitizing the output from the transducers. Ten (10) transducers (telephone handsets) were used. Most of these were not new; handsets with obvious damage were not used, but in order to obtain some diversity with a limited number of handsets, handsets were selected to have variable sound characteristics, transducer designs or, in the case of electrets, different grill designs. Further information about the handsets is provided in the corpus documentation. *Data* The collection procedure was not ideal with respect to realism of sound transduction, but it does allow for the collection of speech from a large number of speakers repeating identical speech on each instance. Furthermore, coupled with the phonetic markings from the original TIMIT corpus, HTIMIT offers the ability to study handset transducer effects on speech recognition systems. To address the realism of the sound transduction in HTIMIT, a second corpus using the same handsets but with live people speaking into the handsets is also available. This corpus is called the Lincoln Laboratory Handset Database (LLHDB) LDC98S68. *Updates* There are no updates at this time.
Format:Sampling Rate: 8000
Sampling Format: 1-channel pcm
Identifier:LDC98S67
https://catalog.ldc.upenn.edu/LDC98S67
ISBN: 1-58563-130-2
ISLRN: 866-042-083-505-7
DOI: 10.35111/xk0c-xj95
Language:English
Language (ISO639):eng
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC98S67
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC98S67
DateStamp:  2020-11-30
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Reynolds, Douglas. 1998. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC98S67
Up-to-date as of: Mon Mar 25 7:20:02 EDT 2024