OLAC Record
oai:lindat.mff.cuni.cz:11234/1-3691

Metadata
Title:RobeCzech Base
Bibliographic Citation:http://hdl.handle.net/11234/1-3691
Creator:Straka, Milan
Náplava, Jakub
Straková, Jana
Samuel, David
Date (W3CDTF):2021-05-25T08:15:54Z
Date Available:2021-05-25T08:15:54Z
Description:RobeCzech is a monolingual RoBERTa language representation model trained on Czech data. RoBERTa is a robustly optimized Transformer-based pretraining approach. We show that RobeCzech considerably outperforms equally-sized multilingual and Czech-trained contextualized language representation models, surpasses current state of the art in all five evaluated NLP tasks and reaches state-of-theart results in four of them. The RobeCzech model is released publicly at https://hdl.handle.net/11234/1-3691 and https://huggingface.co/ufal/robeczech-base, both for PyTorch and TensorFlow.
Identifier (URI):http://hdl.handle.net/11234/1-3691
Language:Czech
Language (ISO639):ces
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Rights:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
http://creativecommons.org/licenses/by-nc-sa/4.0/
Subject:Czech
BERT
RoBERTa
Czech language
Subject (ISO639):ces
Type:languageDescription
Type (DCMI):Text
Type (OLAC):language_description

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/1-3691
DateStamp:  2021-06-29
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Straka, Milan; Náplava, Jakub; Straková, Jana; Samuel, David. 2021. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: area_Europe country_CZ dcmi_Text iso639_ces olac_language_description

Inferred Metadata

Country: Czech Republic
Area: Europe


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/1-3691
Up-to-date as of: Thu Oct 5 0:41:21 EDT 2023