OLAC Record: Mixer 4 and 5 Speech

OLAC Record
oai:www.ldc.upenn.edu:LDC2020S03

Metadata

Title: Mixer 4 and 5 Speech

Access Rights: Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining

Bibliographic Citation: Brandschain, Linda, et al. Mixer 4 and 5 Speech LDC2020S03. Web Download. Philadelphia: Linguistic Data Consortium, 2020

Contributor: Brandschain, Linda

Walker, Kevin

Graff, David

Cieri, Christopher

Neely, Abby

Mirghafori, Nikki

Peskin, Barbara

Godfrey, Jack

Strassel, Stephanie

Goodman, Fred

Doddington, George R.

King, Mike

Date (W3CDTF): 2020

Date Issued (W3CDTF): 2020-03-13

Description: *Description* *Introduction* Mixer 4 and 5 Speech was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 14,185 hours of audio recordings of conversational telephone speech, interviews, elicitation exercises and transcript readings involving 616 distinct speakers. The material was collected in 2007 as part of the Mixer project and recordings in this corpus were used in the 2008 NIST Speaker Recognition Evaluation (SRE). The data in this release was collected in 2007 by LDC at its Human Subjects Data Collection Laboratories in Philadelphia and by the International Computer Science Institute (ICSI) at the University of California, Berkeley. The Mixer 4 and Mixer 5 collections were conducted simultaneously, as a collaborative, carefully coordinated activity at both recording sites. The telephone protocol connected recruited speakers through a robot operator to carry on casual conversations. In Mixer 4, 400 subjects made ten 10-minute calls; half of those subjects also visited one of the collection sites where they made two telephone calls while also being recorded on a cross-channel platform. In Mixer 5, 300 subjects each completed ten calls and six interview sessions at either LDC or ICSI; those sessions were conducted on a cross channel platform and included a telephone call in one of three vocal-effort conditions - normal, high and low. Mixer participants were nearly all native English speakers, the rest being bilingual English speakers. Researchers interested in applying NIST 2008 SRE benchmark test sets should consult the respective NIST Evaluation Plans for guidelines on allowable training data for those tests. Training, evaluation and supplemental data from 2008 SRE are available in the LDC Catalog: 2008 NIST Speaker Recognition Evaluation Training Set Part 1 (LDC2011S05), 2008 NIST Speaker Recognition Evaluation Training Set Part 2 (LDC2011S07), 2008 NIST Speaker Recognition Evaluation Test Set (LDC2011S08) and 2008 NIST Speaker Recognition Evaluation Supplemental Set (LDC2011S11). *Data* The Mixer 4 and 5 collection contains 2,568 recordings made via the public telephone network and 2,152 sessions of multiple microphone recordings in office-room settings. The telephone recordings are presented as 8-KHz 2-channel NIST SPHERE files, and the microphone recordings are 16-KHz 1-channel flac/ms-wav files. When the microphone recording flac files are uncompressed, they become ms-wav/RIFF files (flac compression does not presently support SPHERE file format). The telephone audio is presented in SPHERE format because this is consistent with other LDC telephone audio releases and because flac does not support ulaw sample encoding. The open-source SoX utility is able to handle both formats as input. Other utilities are available for flac and SPHERE formats. Metadata about the calls and speakers is also included in this release, along with time-aligned entries for many of the component portions of the recording sessions. *Samples* Please listen to this telephone sample (SPH) and microphone sample (FLAC). *Updates* None at this time. *Additional Licensing Instructions* This 'members-only' corpora is available to current members who can request the data at the listed reduced-license fee. Contact ldc@ldc.upenn.edu for information about becoming a member.

Extent: Corpus size: 725143108 KB

Format: Sampling Rate: 16000

Sampling Format: pcm

Identifier: LDC2020S03

https://catalog.ldc.upenn.edu/LDC2020S03

ISBN: 1-58563-922-2

ISLRN: 102-906-715-140-9

DOI: 10.35111/xq98-yj91

Language: English

Language (ISO639): eng

Medium: Distribution: Web Download

Publisher: Linguistic Data Consortium

Publisher (URI): https://www.ldc.upenn.edu

Relation (URI): https://catalog.ldc.upenn.edu/docs/LDC2020S03

Rights Holder: Portions © 2007, 2008, 2020 Trustees of the University of Pennsylvania

Type (DCMI): Sound

Type (OLAC): primary_text

OLAC Info

Archive: The LDC Corpus Catalog

Description: http://www.language-archives.org/archive/www.ldc.upenn.edu

GetRecord: OAI-PMH request for OLAC format

GetRecord: Pre-generated XML file

OAI Info

OaiIdentifier: oai:www.ldc.upenn.edu:LDC2020S03

DateStamp: 2021-12-06

GetRecord: OAI-PMH request for simple DC format

Search Info
Citation: Brandschain, Linda; Walker, Kevin; Graff, David; Cieri, Christopher; Neely, Abby; Mirghafori, Nikki; Peskin, Barbara; Godfrey, Jack; Strassel, Stephanie; Goodman, Fred; Doddington, George R.; King, Mike. 2020. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text

http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2020S03
Up-to-date as of: Wed Oct 29 7:01:59 EDT 2025

Metadata
Title:		Mixer 4 and 5 Speech
Access Rights:		Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:		Brandschain, Linda, et al. Mixer 4 and 5 Speech LDC2020S03. Web Download. Philadelphia: Linguistic Data Consortium, 2020
Contributor:		Brandschain, Linda
		Walker, Kevin
		Graff, David
		Cieri, Christopher
		Neely, Abby
		Mirghafori, Nikki
		Peskin, Barbara
		Godfrey, Jack
		Strassel, Stephanie
		Goodman, Fred
		Doddington, George R.
		King, Mike
Date (W3CDTF):		2020
Date Issued (W3CDTF):		2020-03-13
Description:		Description Introduction Mixer 4 and 5 Speech was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 14,185 hours of audio recordings of conversational telephone speech, interviews, elicitation exercises and transcript readings involving 616 distinct speakers. The material was collected in 2007 as part of the Mixer project and recordings in this corpus were used in the 2008 NIST Speaker Recognition Evaluation (SRE). The data in this release was collected in 2007 by LDC at its Human Subjects Data Collection Laboratories in Philadelphia and by the International Computer Science Institute (ICSI) at the University of California, Berkeley. The Mixer 4 and Mixer 5 collections were conducted simultaneously, as a collaborative, carefully coordinated activity at both recording sites. The telephone protocol connected recruited speakers through a robot operator to carry on casual conversations. In Mixer 4, 400 subjects made ten 10-minute calls; half of those subjects also visited one of the collection sites where they made two telephone calls while also being recorded on a cross-channel platform. In Mixer 5, 300 subjects each completed ten calls and six interview sessions at either LDC or ICSI; those sessions were conducted on a cross channel platform and included a telephone call in one of three vocal-effort conditions - normal, high and low. Mixer participants were nearly all native English speakers, the rest being bilingual English speakers. Researchers interested in applying NIST 2008 SRE benchmark test sets should consult the respective NIST Evaluation Plans for guidelines on allowable training data for those tests. Training, evaluation and supplemental data from 2008 SRE are available in the LDC Catalog: 2008 NIST Speaker Recognition Evaluation Training Set Part 1 (LDC2011S05), 2008 NIST Speaker Recognition Evaluation Training Set Part 2 (LDC2011S07), 2008 NIST Speaker Recognition Evaluation Test Set (LDC2011S08) and 2008 NIST Speaker Recognition Evaluation Supplemental Set (LDC2011S11). Data The Mixer 4 and 5 collection contains 2,568 recordings made via the public telephone network and 2,152 sessions of multiple microphone recordings in office-room settings. The telephone recordings are presented as 8-KHz 2-channel NIST SPHERE files, and the microphone recordings are 16-KHz 1-channel flac/ms-wav files. When the microphone recording flac files are uncompressed, they become ms-wav/RIFF files (flac compression does not presently support SPHERE file format). The telephone audio is presented in SPHERE format because this is consistent with other LDC telephone audio releases and because flac does not support ulaw sample encoding. The open-source SoX utility is able to handle both formats as input. Other utilities are available for flac and SPHERE formats. Metadata about the calls and speakers is also included in this release, along with time-aligned entries for many of the component portions of the recording sessions. Samples Please listen to this telephone sample (SPH) and microphone sample (FLAC). Updates None at this time. Additional Licensing Instructions This 'members-only' corpora is available to current members who can request the data at the listed reduced-license fee. Contact ldc@ldc.upenn.edu for information about becoming a member.
Extent:		Corpus size: 725143108 KB
Format:		Sampling Rate: 16000
Format:		Sampling Format: pcm
Identifier:		LDC2020S03
		https://catalog.ldc.upenn.edu/LDC2020S03
		ISBN: 1-58563-922-2
		ISLRN: 102-906-715-140-9
		DOI: 10.35111/xq98-yj91
Language:		English
Language (ISO639):		eng
Medium:		Distribution: Web Download
Publisher:		Linguistic Data Consortium
Publisher (URI):		https://www.ldc.upenn.edu
Relation (URI):		https://catalog.ldc.upenn.edu/docs/LDC2020S03
Rights Holder:		Portions © 2007, 2008, 2020 Trustees of the University of Pennsylvania
Type (DCMI):		Sound
Type (OLAC):		primary_text
OLAC Info
Archive:		The LDC Corpus Catalog
Description:		http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:		OAI-PMH request for OLAC format
GetRecord:		Pre-generated XML file
OAI Info
OaiIdentifier:		oai:www.ldc.upenn.edu:LDC2020S03
DateStamp:		2021-12-06
GetRecord:		OAI-PMH request for simple DC format
Search Info
Citation:		Brandschain, Linda; Walker, Kevin; Graff, David; Cieri, Christopher; Neely, Abby; Mirghafori, Nikki; Peskin, Barbara; Godfrey, Jack; Strassel, Stephanie; Goodman, Fred; Doddington, George R.; King, Mike. 2020. Linguistic Data Consortium.
Terms:		area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text