OLAC Record
oai:clarin.eurac.edu:20.500.12124/60

Metadata
Title:MT@BZ translation corpus v1.0
Bibliographic Citation:http://hdl.handle.net/20.500.12124/60
Creator:De Camillis, Flavia
Chiocchetti, Elena
Stemle, Egon W.
Date (W3CDTF):2023-06-18T18:33:02Z
Date Available:2023-06-18T18:33:02Z
Description:The MT@BZ is a translation corpus that consists of 52 decrees published by the Autonomous Province of Bolzano (South Tyrol) aligned with their machine translated versions. More precisely, it consists of 26 decrees in German and the same 26 in Italian in their official versions, respectively machine translated by the project team into Italian and into German. 10 of them are COVID-19 related decress, while 16 are miscellaneous. Overall, they consist of around 130,000 words. Their machine translation was carried out with a customized version of ModernMT. Later, the corpus was uploaded first into the annotation platform Webanno, then transferred to Inception. Four annotators annotated the translation errors made by the machine according to an ad hoc error taxonomy for quality assessment. Finally, the annotations were curated to create a gold standard corpus.
Identifier (URI):http://hdl.handle.net/20.500.12124/60
Language:Italian
German
Language (ISO639):ita
deu
Publisher:Institute for Applied Linguistics, Eurac Research
Rights:Creative Commons - Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)
https://creativecommons.org/licenses/by-nc/4.0/
Subject:machine translation
annotation
translation errors
accuracy
fluency
Italian
German
South Tyrolean German
legal language
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  Eurac Research CLARIN Centre
Description:  http://www.language-archives.org/archive/clarin.eurac.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:clarin.eurac.edu:20.500.12124/60
DateStamp:  2023-06-18
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: De Camillis, Flavia; Chiocchetti, Elena; Stemle, Egon W. 2023. Institute for Applied Linguistics, Eurac Research.
Terms: area_Europe country_DE country_IT dcmi_Text iso639_deu iso639_ita olac_primary_text


http://www.language-archives.org/item.php/oai:clarin.eurac.edu:20.500.12124/60
Up-to-date as of: Mon Sep 18 0:46:46 EDT 2023