OLAC Record: The DKU-JNU-EMA Electromagnetic Articulography Database

OLAC Record
oai:www.ldc.upenn.edu:LDC2019S14

Metadata

Title: The DKU-JNU-EMA Electromagnetic Articulography Database

Access Rights: Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining

Bibliographic Citation: Qin, Xiaoyi, et al. The DKU-JNU-EMA Electromagnetic Articulography Database LDC2019S14. Web Download. Philadelphia: Linguistic Data Consortium, 2019

Contributor: Qin, Xiaoyi

Liu, Xinzhong

Cai, Zexin

Li, Ming

Date (W3CDTF): 2019

Date Issued (W3CDTF): 2019-07-15

Description: *Introduction* The DKU-JNU-EMA Electromagnetic Articulography Database was developed by Duke Kunshan University and Jinan University and contains approximately 10 hours of articulography and speech data in Mandarin, Cantonese, Hakka, and Teochew Chinese from two to seven native speakers for each dialect. Electromagnetic articulography (EMA) is a method of measuring the position of parts of the mouth and their movement over time during speech and swallowing. Measurements are made from sensors placed in the mouth to capture real-time vocal tract variable trajectories. EMA is used in linguistics and language-related research to study phonetics, in particular, articulation (how sounds are made). *Data* Articulatory measurements were made using the NDI electromagnetic articulography wave research system. Subjects had six sensors placed in various locations in their mouth and one reference sensor was placed on the bridge of their nose. For simultaneous recording of speech signals, subjects also wore a head-mounted close-talk microphone. Speakers engaged in four different types of recording sessions: one in which they read complete sentences or short texts, and three sessions in which they read related words of a specific common consonant, vowel or tone. Audio data is presented as single channel, 16kHz, 16-bit flac compressed wav files. Articulography data is stored as UTF-8 plain text files. *Samples* Please view the following samples: * Cantonese Speech * Cantonese Sensor Data * Cantonese Parameter Data * Mandarin Speech * Mandarin Sensor Data * Mandarin Parameter Data *Updates* Chinese and English README files were updated on April 22, 2024.

Extent: Corpus size: 3122232 KB

Format: Sampling Rate: 16000

Sampling Format: pcm

Identifier: LDC2019S14

https://catalog.ldc.upenn.edu/LDC2019S14

ISBN: 1-58563-894-3

ISLRN: 147-070-436-975-2

DOI: 10.35111/61f9-zg33

Language: Yue Chinese

Hakka Chinese

Min Nan Chinese

Mandarin Chinese

Language (ISO639): yue

hak

nan

cmn

License: LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf

Medium: Distribution: Web Download

Publisher: Linguistic Data Consortium

Publisher (URI): https://www.ldc.upenn.edu

Relation (URI): https://catalog.ldc.upenn.edu/docs/LDC2019S14

Rights Holder: Portions © 2019 Duke Kunshan University, © 2019 Jinan University, © 2019 Trustees of the University of Pennsylvania

Type (DCMI): Sound

Text

Type (OLAC): primary_text

OLAC Info

Archive: The LDC Corpus Catalog

Description: http://www.language-archives.org/archive/www.ldc.upenn.edu

GetRecord: OAI-PMH request for OLAC format

GetRecord: Pre-generated XML file

OAI Info

OaiIdentifier: oai:www.ldc.upenn.edu:LDC2019S14

DateStamp: 2024-04-22

GetRecord: OAI-PMH request for simple DC format

Search Info
Citation: Qin, Xiaoyi; Liu, Xinzhong; Cai, Zexin; Li, Ming. 2019. Linguistic Data Consortium.
Terms: area_Asia country_CN dcmi_Sound dcmi_Text iso639_cmn iso639_hak iso639_nan iso639_yue olac_primary_text

http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2019S14
Up-to-date as of: Wed Oct 29 7:01:55 EDT 2025

Metadata
Title:		The DKU-JNU-EMA Electromagnetic Articulography Database
Access Rights:		Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:		Qin, Xiaoyi, et al. The DKU-JNU-EMA Electromagnetic Articulography Database LDC2019S14. Web Download. Philadelphia: Linguistic Data Consortium, 2019
Contributor:		Qin, Xiaoyi
		Liu, Xinzhong
		Cai, Zexin
		Li, Ming
Date (W3CDTF):		2019
Date Issued (W3CDTF):		2019-07-15
Description:		Introduction The DKU-JNU-EMA Electromagnetic Articulography Database was developed by Duke Kunshan University and Jinan University and contains approximately 10 hours of articulography and speech data in Mandarin, Cantonese, Hakka, and Teochew Chinese from two to seven native speakers for each dialect. Electromagnetic articulography (EMA) is a method of measuring the position of parts of the mouth and their movement over time during speech and swallowing. Measurements are made from sensors placed in the mouth to capture real-time vocal tract variable trajectories. EMA is used in linguistics and language-related research to study phonetics, in particular, articulation (how sounds are made). Data Articulatory measurements were made using the NDI electromagnetic articulography wave research system. Subjects had six sensors placed in various locations in their mouth and one reference sensor was placed on the bridge of their nose. For simultaneous recording of speech signals, subjects also wore a head-mounted close-talk microphone. Speakers engaged in four different types of recording sessions: one in which they read complete sentences or short texts, and three sessions in which they read related words of a specific common consonant, vowel or tone. Audio data is presented as single channel, 16kHz, 16-bit flac compressed wav files. Articulography data is stored as UTF-8 plain text files. Samples Please view the following samples: * Cantonese Speech * Cantonese Sensor Data * Cantonese Parameter Data * Mandarin Speech * Mandarin Sensor Data * Mandarin Parameter Data Updates Chinese and English README files were updated on April 22, 2024.
Extent:		Corpus size: 3122232 KB
Format:		Sampling Rate: 16000
Format:		Sampling Format: pcm
Identifier:		LDC2019S14
		https://catalog.ldc.upenn.edu/LDC2019S14
		ISBN: 1-58563-894-3
		ISLRN: 147-070-436-975-2
		DOI: 10.35111/61f9-zg33
Language:		Yue Chinese
		Hakka Chinese
		Min Nan Chinese
		Mandarin Chinese
Language (ISO639):		yue
		hak
		nan
		cmn
License:		LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:		Distribution: Web Download
Publisher:		Linguistic Data Consortium
Publisher (URI):		https://www.ldc.upenn.edu
Relation (URI):		https://catalog.ldc.upenn.edu/docs/LDC2019S14
Rights Holder:		Portions © 2019 Duke Kunshan University, © 2019 Jinan University, © 2019 Trustees of the University of Pennsylvania
Type (DCMI):		Sound
Type (DCMI):		Text
Type (OLAC):		primary_text
OLAC Info
Archive:		The LDC Corpus Catalog
Description:		http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:		OAI-PMH request for OLAC format
GetRecord:		Pre-generated XML file
OAI Info
OaiIdentifier:		oai:www.ldc.upenn.edu:LDC2019S14
DateStamp:		2024-04-22
GetRecord:		OAI-PMH request for simple DC format
Search Info
Citation:		Qin, Xiaoyi; Liu, Xinzhong; Cai, Zexin; Li, Ming. 2019. Linguistic Data Consortium.
Terms:		area_Asia country_CN dcmi_Sound dcmi_Text iso639_cmn iso639_hak iso639_nan iso639_yue olac_primary_text