OLAC Record
oai:www.ldc.upenn.edu:LDC93S10

Metadata
Title:TIDIGITS
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:R. Gary Leonard, and George Doddington. TIDIGITS LDC93S10. Web Download. Philadelphia: Linguistic Data Consortium, 1993
Contributor:R. Gary Leonard
Doddington, George R.
Date (W3CDTF):1993
Description:*Introduction* TIDIGITS was developed by Texas Instruments, Inc. (TI) and consists of approximately 13 hours of digit sequences in English spoken by over 300 men, women, and children. This corpus contains speech which was originally designed and collected at TI for the purpose of designing and evaluating algorithms for speaker-independent recognition of connected digit sequences. *Data* The corpus was collected at TI in 1982 in a quiet acoustic enclosure using an Electro-Voice RE-16 Dynamic Cardiod microphone, digitized at 20kHz. The waveform files are single channel, 16-bit files in the NIST SPHERE format. There are 326 speakers (111 men, 114 women, 50 boys and 51 girls) each pronouncing 77 digit sequences. Each speaker group is partitioned into test and training subsets. Speaker metadata includes gender, age, and dialect. *Samples* * Audio (sphere) *Updates* As of April, 2015, TIDIGITS is also available in flac compressed wav. This package is available to licensees as an additional download. Not included in this version are the folders relating to handling the shortened sphere files of the original corpus.
Extent:Corpus size: 1597000 KB
Format:Sampling Rate: 20000
Sampling Format: pcm
Identifier:LDC93S10
https://catalog.ldc.upenn.edu/LDC93S10
ISBN: 1-58563-018-7
ISLRN: 177-353-807-744-3
DOI: 10.35111/72xz-6x59
Language:English
Language (ISO639):eng
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC93S10
Rights Holder:Portions © 1993 Trustees of the University of Pennsylvania
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC93S10
DateStamp:  2024-04-11
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: R. Gary Leonard; Doddington, George R. 1993. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC93S10
Up-to-date as of: Fri Dec 6 7:47:01 EST 2024