OLAC Record

Title:Evaluating cross-linguistic forced alignment of conversational data in north Australian Kriol, an under-resourced language
Bibliographic Citation:Jones, Caroline, Li, Weicong, Almeida, Andre, German, Amit; 2019-06; Kaipuleohone University of Hawai'i Digital Language Archive;http://hdl.handle.net/10125/24869.
Creator:Jones, Caroline
Li, Weicong
Almeida, Andre
German, Amit
Date (W3CDTF):2019-06
Description:Speech technology is transforming language documentation; acoustic models trained on “small” languages are now technically feasible. At the same time, forced alignment built for major world languages has matured and now offers ease of use through web interfaces requiring low technical expertise. This paper provides an updated and detailed evaluation of cross-linguistic forced alignment, the approach of using forced aligners untrained on the target language. We compare two options within MAUS (Munich Automatic Segmentation System): language-independent mode vs major world language system (here, Italian) on the one dataset, a comparison that has not previously been reported. The dataset comes from a corpus of adult conversational speech in Kriol, an English-based creole of northern Australia. The results of using MAUS Italian were better than those of using the language-independent mode and those in previous studies: the agreement rate at 20 ms was 72.1% at vowel onset and 57.2% at vowel offset. With completely misaligned tokens excluded, the overall agreement rate rose to 69.2% at 20 ms and over 90% at 50 ms. Most errors in the output SAMPA (Speech Assessment Methods Phonetic Alphabet) labels were resolvable with simple text replacements. These results offer updated benchmark data for an untrained, late-model forced alignment system.
National Foreign Language Resource Center
Format:19 pages
Identifier:Jones, Caroline, Weicong Li, Andre Almeida, & Amit German. 2019. Evaluating cross-linguistic forced alignment of conversational data in north Australian Kriol, an under-resourced language. Language Documentation & Conservation 13: 281-299.
Identifier (URI):http://hdl.handle.net/10125/24869
Publisher:University of Hawaii Press
Rights:Creative Commons Attribution-NonCommercial 4.0 International
Attribution-NonCommercial 3.0 United States
Subject:Australian Kriol
forced alignment
language documentation
speech technology
Table Of Contents:jones_et_al.pdf
Type (DCMI):Text


Archive:  Language Documentation and Conservation
Description:  http://www.language-archives.org/archive/ldc.scholarspace.manoa.hawaii.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:scholarspace.manoa.hawaii.edu:10125/24869
DateStamp:  2019-06-20
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Jones, Caroline; Li, Weicong; Almeida, Andre; German, Amit. 2019. University of Hawaii Press.
Terms: dcmi_Text

Up-to-date as of: Sun Mar 1 15:49:37 EST 2020