OLAC Record
oai:www.ldc.upenn.edu:LDC96S38

Metadata
Title:DCIEM/HCRC
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Taylor, Martin, et al. DCIEM/HCRC LDC96S38. Web Download. Philadelphia: Linguistic Data Consortium, 1996
Contributor:Taylor, Martin
Bard, Ellen Gurman
Sotillo, Cathy
McKelvie, David
Anderson, Anne
Date (W3CDTF):1996
Description:*Introduction* DCIEM/HCRC was developed by the Defence and Civil Institute of Environmental Medicine in Canada and the Human Communication Research Centre at the University of Edinburgh and the University of Glasgow. It contains approximately 23 hours of English speech data along with corresponding transcripts from 36 participants, 34 male and 2 female. This release contains the materials used to collect all 216 spoken dialogues digital audio, orthographic transcriptions, documentation and source code for tools. The dialogues were selected to provide balanced representation at different points in a sleep deprivation experiment. *Data* The top-level directory contains the following files: * 0dir.txt: A complete listing of all files, giving the CD on which each can be found. * 0direye.txt: A complete listing of all dialogues, giving the CD on which each can be found, in a form more convenient for visual scanning. * read.me: A readme file, with the part and CD number changing from one CD to the next. The top-level directory contains the following directories: * doc/ ASCII and/or PostScript(TM) versions of various documents on the corpus: START HERE * lib/ Resources for included tools * trn_all/ All the transcripts * etc/ Information about participants and maps * src/ UNIX(TM) scripts and C sources for useful tools, emacs interface, world wide web interface and a Microsoft Windows(tm) sound playing program. In addition to the common directories, each also contains * run1/ * run2/ Any run/ directory contains sampled audio, transcripts, and maps for one of the six runs of the sleep deprivation experiment. Each conversation directory has the following files: * NIST header (.nst) * sampled speech (.ses) * annotated orthographic transcription(.trn) * giver's map (.gmp) * follower's map (.fmp) * TEI entry-point (.sgm) Audio data is presented as 2-channel, 16-bit, 20 kHz ses files. Metadata including participant age, gender, and birthplace are included. The materials have been designed to be easily accessible to users with different equipment and a variety of needs from those who merely wish to generate hardcopies of the orthographic transcriptions to those who require computational analyses of the speech material. All the text files (transcriptions and documentation) should be readable and printable via most systems. The maps are intended for printing via POSTSCRIPT printers and the speech files are provided with human-readable standard headers, enabling them to be played by a wide range of environments for processing sampled speech. *Samples* Please view this speech sample and transcript sample. *Updates* There are no updates at this time.
Extent:Corpus size: 6551869 KB
Format:Sampling Rate: 20000
Sampling Format: 2-channel pcm
Identifier:LDC96S38
https://catalog.ldc.upenn.edu/LDC96S38
ISBN: 1-58563-089-6
ISLRN: 139-466-600-760-1
DOI: 10.35111/4540-j072
Language:English
Language (ISO639):eng
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC96S38
Rights Holder:Portions © 1995 Defence and Civil Institute of Environmental Medicine, © 1996 Trustees of the University of Pennsylvania
Type (DCMI):Sound
Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC96S38
DateStamp:  2024-09-25
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Taylor, Martin; Bard, Ellen Gurman; Sotillo, Cathy; McKelvie, David; Anderson, Anne. 1996. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Sound dcmi_Text iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC96S38
Up-to-date as of: Fri Dec 6 7:47:14 EST 2024