OLAC Record: DIRHA English WSJ Audio

OLAC Record
oai:www.ldc.upenn.edu:LDC2018S01

Metadata

Title: DIRHA English WSJ Audio

Access Rights: Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining

Bibliographic Citation: Ravanelli, Mirco, Luca Cristoforetti, and Maurizio Omologo. DIRHA English WSJ Audio LDC2018S01. Web Download. Philadelphia: Linguistic Data Consortium, 2018

Contributor: Ravanelli, Mirco

Cristoforetti, Luca

Omologo, Maurizio

Date (W3CDTF): 2018

Date Issued (W3CDTF): 2018-01-16

Description: *Introduction* DIRHA English WSJ Audio was developed as part of the Distant-Speech Interaction for Robust Home Applications (DIRHA) Project which addressed natural spontaneous speech interaction with distant microphones in a domestic environment. It is comprised of approximately 85 hours of real and simulated read speech by six native American English speakers. The target utterances were taken from CSR-I (WSJ0) Complete (LDC93S6A), specifically, the 5,000 word subset of read speech from Wall Street Journal news text. This release contains signals of different characteristics in terms of noise and reverberation making it suitable for various multi-microphone signal processing and distant speech recognition tasks. The corpus can be coupled with related Kaldi baselines and tools that are available here. *Data* Speech was collected in a real apartment setting with typical domestic background noise and inter/intra-room reverberation effects. A total of 32 microphones were placed in the living-room (26 microphones) and in the kitchen (6 microphones). The original recordings were made at a sampling frequency of 48 kHz. However, for the sake of compactness, the released signals in this publication are in wav format with 16 kHz sampling frequency and 16 bit resolution. Annotations for each acoustic sequence are included in xml format, such as microphone positions, speaker id, speaker gender and speaker position. Additional metadata about the speakers and images of the apartment setting are also provided. Consult the documentation accompanying this release for more information about the collection. *Samples* Please view this audio sample and annotation sample. *Updates* None at this time.

Extent: Corpus size: 9686640 KB

Format: Sampling Rate: 16000

Sampling Format: pcm

Identifier: LDC2018S01

https://catalog.ldc.upenn.edu/LDC2018S01

ISBN: 1-58563-831-5

ISLRN: 112-363-425-685-7

DOI: 10.35111/2j6c-6z19

Language: English

Language (ISO639): eng

License: DIRHA English WSJ Audio Agreement: https://catalog.ldc.upenn.edu/license/dirha-english-wsj-audio-agreement.pdf

Medium: Distribution: Web Download

Publisher: Linguistic Data Consortium

Publisher (URI): https://www.ldc.upenn.edu

Relation (URI): https://catalog.ldc.upenn.edu/docs/LDC2018S01

Rights Holder: Portions © 1987-1989 Dow Jones & Company, Inc., © 2018 Fondazione Bruno Kessler, © 1996, 2018 Trustees of the University of Pennsylvania

Type (DCMI): Sound

Text

Type (OLAC): primary_text

OLAC Info

Archive: The LDC Corpus Catalog

Description: http://www.language-archives.org/archive/www.ldc.upenn.edu

GetRecord: OAI-PMH request for OLAC format

GetRecord: Pre-generated XML file

OAI Info

OaiIdentifier: oai:www.ldc.upenn.edu:LDC2018S01

DateStamp: 2020-11-30

GetRecord: OAI-PMH request for simple DC format

Search Info
Citation: Ravanelli, Mirco; Cristoforetti, Luca; Omologo, Maurizio. 2018. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Sound dcmi_Text iso639_eng olac_primary_text

http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2018S01
Up-to-date as of: Wed Oct 29 7:01:46 EDT 2025

Metadata
Title:		DIRHA English WSJ Audio
Access Rights:		Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:		Ravanelli, Mirco, Luca Cristoforetti, and Maurizio Omologo. DIRHA English WSJ Audio LDC2018S01. Web Download. Philadelphia: Linguistic Data Consortium, 2018
Contributor:		Ravanelli, Mirco
		Cristoforetti, Luca
		Omologo, Maurizio
Date (W3CDTF):		2018
Date Issued (W3CDTF):		2018-01-16
Description:		Introduction DIRHA English WSJ Audio was developed as part of the Distant-Speech Interaction for Robust Home Applications (DIRHA) Project which addressed natural spontaneous speech interaction with distant microphones in a domestic environment. It is comprised of approximately 85 hours of real and simulated read speech by six native American English speakers. The target utterances were taken from CSR-I (WSJ0) Complete (LDC93S6A), specifically, the 5,000 word subset of read speech from Wall Street Journal news text. This release contains signals of different characteristics in terms of noise and reverberation making it suitable for various multi-microphone signal processing and distant speech recognition tasks. The corpus can be coupled with related Kaldi baselines and tools that are available here. Data Speech was collected in a real apartment setting with typical domestic background noise and inter/intra-room reverberation effects. A total of 32 microphones were placed in the living-room (26 microphones) and in the kitchen (6 microphones). The original recordings were made at a sampling frequency of 48 kHz. However, for the sake of compactness, the released signals in this publication are in wav format with 16 kHz sampling frequency and 16 bit resolution. Annotations for each acoustic sequence are included in xml format, such as microphone positions, speaker id, speaker gender and speaker position. Additional metadata about the speakers and images of the apartment setting are also provided. Consult the documentation accompanying this release for more information about the collection. Samples Please view this audio sample and annotation sample. Updates None at this time.
Extent:		Corpus size: 9686640 KB
Format:		Sampling Rate: 16000
Format:		Sampling Format: pcm
Identifier:		LDC2018S01
		https://catalog.ldc.upenn.edu/LDC2018S01
		ISBN: 1-58563-831-5
		ISLRN: 112-363-425-685-7
		DOI: 10.35111/2j6c-6z19
Language:		English
Language (ISO639):		eng
License:		DIRHA English WSJ Audio Agreement: https://catalog.ldc.upenn.edu/license/dirha-english-wsj-audio-agreement.pdf
Medium:		Distribution: Web Download
Publisher:		Linguistic Data Consortium
Publisher (URI):		https://www.ldc.upenn.edu
Relation (URI):		https://catalog.ldc.upenn.edu/docs/LDC2018S01
Rights Holder:		Portions © 1987-1989 Dow Jones & Company, Inc., © 2018 Fondazione Bruno Kessler, © 1996, 2018 Trustees of the University of Pennsylvania
Type (DCMI):		Sound
Type (DCMI):		Text
Type (OLAC):		primary_text
OLAC Info
Archive:		The LDC Corpus Catalog
Description:		http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:		OAI-PMH request for OLAC format
GetRecord:		Pre-generated XML file
OAI Info
OaiIdentifier:		oai:www.ldc.upenn.edu:LDC2018S01
DateStamp:		2020-11-30
GetRecord:		OAI-PMH request for simple DC format
Search Info
Citation:		Ravanelli, Mirco; Cristoforetti, Luca; Omologo, Maurizio. 2018. Linguistic Data Consortium.
Terms:		area_Europe country_GB dcmi_Sound dcmi_Text iso639_eng olac_primary_text