OLAC Record
oai:www.ldc.upenn.edu:LDC2024S12

Metadata
Title:Samrómur Synthetic
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Hernández Mena, Carlos Daniel, Gunnar Örnólfsson, and Jon Gudnason. Samrómur Synthetic LDC2024S12. Web Download. Philadelphia: Linguistic Data Consortium, 2024
Contributor:Hernández Mena, Carlos Daniel
Örnólfsson, Gunnar Thor
Gudnason, Jon
Date (W3CDTF):2024
Date Issued (W3CDTF):2024-11-15
Description:*Introduction* Samrómur Synthetic was developed by the Language and Voice Lab, Reykjavik University and contains 72 hours of Icelandic synthetic speech, transcripts and metadata. *Data* Source sentences were extracted from the Samrómur platform, comprised of texts and transcripts covering various genres. Text was processed through a text-to-speech system developed by Reykjavik University's Language and Voice Lab to generate speech files. Synthesized speech was created with 44 voices (22 male, 22 female) at four different speed rates for a total of 220 speakers and 62,700 utterances (with 285 sentences/speaker). Audio data is divided by speaker and is presented as flac compressed, single channel, 16 kHz, 16-bit linear PCM. Transcripts and metadata are presented in .tsv format. *Samples* Please view this audio sample (FLAC) and metadata (TSV). *Updates* None at this time.
Extent:Corpus size: 6326534 KB
Format:Sampling Rate: 16000
Sampling Format: flac
Identifier:LDC2024S12
https://catalog.ldc.upenn.edu/LDC2024S12
ISLRN: 446-426-909-343-3
DOI: 10.35111/4fam-5358
Language:Icelandic
Language (ISO639):isl
License:Samrómur Synthetic Agreement (For-Profit Member): https://catalog.ldc.upenn.edu/license/samromur-synthetic-for-profit-member.pdf
Samrómur Synthetic Agreement (Non-Member): https://catalog.ldc.upenn.edu/license/samromur-synthetic-agreement-non-member.pdf
Samrómur Synthetic Agreement (Not-for-Profit): https://catalog.ldc.upenn.edu/license/samromur-synthetic-agreement-not-for-profit.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2024S12
Rights Holder:Portions © 2024 Reykjavik University, © 2024 Trustees of the University of Pennsylvania
Type (DCMI):Sound
Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2024S12
DateStamp:  2024-11-19
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Hernández Mena, Carlos Daniel; Örnólfsson, Gunnar Thor; Gudnason, Jon. 2024. Linguistic Data Consortium.
Terms: area_Europe country_IS dcmi_Sound dcmi_Text iso639_isl olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2024S12
Up-to-date as of: Fri Dec 6 7:49:18 EST 2024