OLAC Record oai:www.ldc.upenn.edu:LDC96S38 |
Metadata | ||
Title: | DCIEM/HCRC | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Taylor, Martin, et al. DCIEM/HCRC LDC96S38. Web Download. Philadelphia: Linguistic Data Consortium, 1996 | |
Contributor: | Taylor, Martin | |
Bard, Ellen Gurman | ||
Sotillo, Cathy | ||
McKelvie, David | ||
Anderson, Anne | ||
Date (W3CDTF): | 1996 | |
Description: | *Introduction* DCIEM/HCRC was developed by the Defence and Civil Institute of Environmental Medicine in Canada and the Human Communication Research Centre at the University of Edinburgh and the University of Glasgow. It contains approximately 23 hours of English speech data along with corresponding transcripts from 36 participants, 34 male and 2 female. This release contains the materials used to collect all 216 spoken dialogues digital audio, orthographic transcriptions, documentation and source code for tools. The dialogues were selected to provide balanced representation at different points in a sleep deprivation experiment. *Data* The top-level directory contains the following files: * 0dir.txt: A complete listing of all files, giving the CD on which each can be found. * 0direye.txt: A complete listing of all dialogues, giving the CD on which each can be found, in a form more convenient for visual scanning. * read.me: A readme file, with the part and CD number changing from one CD to the next. The top-level directory contains the following directories: * doc/ ASCII and/or PostScript(TM) versions of various documents on the corpus: START HERE * lib/ Resources for included tools * trn_all/ All the transcripts * etc/ Information about participants and maps * src/ UNIX(TM) scripts and C sources for useful tools, emacs interface, world wide web interface and a Microsoft Windows(tm) sound playing program. In addition to the common directories, each also contains * run1/ * run2/ Any run/ directory contains sampled audio, transcripts, and maps for one of the six runs of the sleep deprivation experiment. Each conversation directory has the following files: * NIST header (.nst) * sampled speech (.ses) * annotated orthographic transcription(.trn) * giver's map (.gmp) * follower's map (.fmp) * TEI entry-point (.sgm) Audio data is presented as 2-channel, 16-bit, 20 kHz ses files. Metadata including participant age, gender, and birthplace are included. The materials have been designed to be easily accessible to users with different equipment and a variety of needs from those who merely wish to generate hardcopies of the orthographic transcriptions to those who require computational analyses of the speech material. All the text files (transcriptions and documentation) should be readable and printable via most systems. The maps are intended for printing via POSTSCRIPT printers and the speech files are provided with human-readable standard headers, enabling them to be played by a wide range of environments for processing sampled speech. *Samples* Please view this speech sample and transcript sample. *Updates* There are no updates at this time. | |
Extent: | Corpus size: 6551869 KB | |
Format: | Sampling Rate: 20000 | |
Sampling Format: 2-channel pcm | ||
Identifier: | LDC96S38 | |
https://catalog.ldc.upenn.edu/LDC96S38 | ||
ISBN: 1-58563-089-6 | ||
ISLRN: 139-466-600-760-1 | ||
DOI: 10.35111/4540-j072 | ||
Language: | English | |
Language (ISO639): | eng | |
License: | LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC96S38 | |
Rights Holder: | Portions © 1995 Defence and Civil Institute of Environmental Medicine, © 1996 Trustees of the University of Pennsylvania | |
Type (DCMI): | Sound | |
Text | ||
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC96S38 | |
DateStamp: | 2024-09-25 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Taylor, Martin; Bard, Ellen Gurman; Sotillo, Cathy; McKelvie, David; Anderson, Anne. 1996. Linguistic Data Consortium. | |
Terms: | area_Europe country_GB dcmi_Sound dcmi_Text iso639_eng olac_primary_text |