OLAC Record
oai:www.ldc.upenn.edu:LDC2022S05

Metadata
Title:Samrómur Icelandic Speech 1.0
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Mollberg, David, et al. Samrómur Icelandic Speech 1.0 LDC2022S05. Web Download. Philadelphia: Linguistic Data Consortium, 2022
Contributor:Mollberg, David
Jónsson, Ólafur Helgi
Þorsteinsdóttir, Sunneva
Guðmundsdóttir, Jóhanna Vigdís
Steingrimsson, Steinthor
Magnusdottir, Eydis Huld
Fong, Judy
Borsky, Michal
Gudnason, Jon
Date (W3CDTF):2022
Date Issued (W3CDTF):2022-05-16
Description:*Introduction* Samrómur Icelandic Speech 1.0 was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology. The corpus contains 145 hours of Icelandic prompted speech from 8,392 speakers representing 100,000 utterances. This version 1.0 is equivalent to "Samrómur Icelandic Speech 21.05" as used by the Language Technology Programme for Icelandic 2019-2023. *Data* Speech data was collected between October 2019 and May 2021 using the Samrómur website which displayed prompts to participants. The prompts were mainly from The Icelandic Gigaword Corpus, which includes text from novels, news, plays, and from a list of location names in Iceland. Additional prompts were taken from the Icelandic Web of Science and others were created by combining a name followed by a question or a demand. Prompts and speaker metadata are included in the corpus. The audio data is divided into train, dev, and test sets and is presented as flac compressed, single channel, 16 kHz, 16-bit linear PCM. *Samples* Please view this audio sample (FLAC). *Updates* None at this time.
Extent:Corpus size: 8230968 KB
Format:Sampling Rate: 16000
Sampling Format: flac
Identifier:LDC2022S05
https://catalog.ldc.upenn.edu/LDC2022S05
ISBN: 1-58563-991-5
ISLRN: 643-778-441-472-4
DOI: 10.35111/thx3-f170
Language:Icelandic
Language (ISO639):isl
License:Samrómur Icelandic Speech 1.0 Agreement (For-Profit): https://catalog.ldc.upenn.edu/license/samromur-icelandic-speech-1-dot-0-agreement-for-profit.pdf
Samrómur Icelandic Speech 1.0 Agreement (Non-Member): https://catalog.ldc.upenn.edu/license/samromur-icelandic-speech-1-dot-0-agreement-non-member.pdf
Samrómur Icelandic Speech 1.0 Agreement (Not-For-Profit): https://catalog.ldc.upenn.edu/license/samromur-icelandic-speech-1-dot-0-agreement-not-for-profit.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2022S05
Rights Holder:Portions © 2022 Reykjavik University, © 2022 Trustees of the University of Pennsylvania
Type (DCMI):Sound
Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2022S05
DateStamp:  2023-01-01
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Mollberg, David; Jónsson, Ólafur Helgi; Þorsteinsdóttir, Sunneva; Guðmundsdóttir, Jóhanna Vigdís; Steingrimsson, Steinthor; Magnusdottir, Eydis Huld; Fong, Judy; Borsky, Michal; Gudnason, Jon. 2022. Linguistic Data Consortium.
Terms: area_Europe country_IS dcmi_Sound dcmi_Text iso639_isl olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2022S05
Up-to-date as of: Fri Dec 6 7:49:11 EST 2024