OLAC Record
oai:www.clarin.si:11356/1044

Metadata
Title:MULTEXT-East "1984" document corpus 4.0
Bibliographic Citation:http://hdl.handle.net/11356/1044
Creator:Erjavec, Tomaž
Bruda, Ştefan
Dimitrova, Ludmila
Ide, Nancy
Kaalep, Heiki-Jaan
Krstev, Cvetana
Orav, Heili
Oravecz, Csaba
Paldre, Leho
Petkevič, Vladimír
Priest-Dorman, Greg
Simov, Kiril
Sinapova, Lydia
Sokolovsky, Paul
Sryvkin, Sergey
Tufiş, Dan
Utka, Andrius
Villandi, Viire
Vitas, Duško
Vuković, Olga
Date (W3CDTF):2015-06-15T08:56:08Z
Date Available:2015-06-15T08:56:08Z
Description:The novel "1984" by George Orwell is the central component of the MULTEXT-East corpus. This parallel and sentence aligned corpus contains the novel in the English original (about 100,000 words in length), and its translations into a number of languages. This version of the corpus contains structurally annotated texts only, which contain elements such as the paragraph, the footnote, and highlighted text. In terms of linguistic annotations, the text contain names and sentences. The linguistically annotated texts are a separate submission (http://hdl.handle.net/11356/1043) also with somewhat different languages.
Identifier (URI):http://hdl.handle.net/11356/1044
Language:Bulgarian
Czech
English
Estonian
Hungarian
Lithuanian
Romanian
Russian
Slovenian
Serbian
Language (ISO639):bul
ces
eng
est
hun
lit
ron
rus
slv
srp
Publisher:Jožef Stefan Institute
Replaces (URI):http://hdl.handle.net/11372/LRT-675
Rights:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
https://creativecommons.org/licenses/by-nc-sa/4.0/
Subject:parallel corpus
multilingual
TEI
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  Slovenian language resource repository CLARIN.SI
Description:  http://www.language-archives.org/archive/clarin.si
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.clarin.si:11356/1044
DateStamp:  2017-09-29
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Erjavec, Tomaž; Bruda, Ştefan; Dimitrova, Ludmila; Ide, Nancy; Kaalep, Heiki-Jaan; Krstev, Cvetana; Orav, Heili; Oravecz, Csaba; Paldre, Leho; Petkevič, Vladimír; Priest-Dorman, Greg; Simov, Kiril; Sinapova, Lydia; Sokolovsky, Paul; Sryvkin, Sergey; Tufiş, Dan; Utka, Andrius; Villandi, Viire; Vitas, Duško; Vuković, Olga. 2015. Jožef Stefan Institute.
Terms: area_Europe country_BG country_CZ country_GB country_HU country_LT country_RO country_RS country_RU country_SI dcmi_Text iso639_bul iso639_ces iso639_eng iso639_est iso639_hun iso639_lit iso639_ron iso639_rus iso639_slv iso639_srp olac_primary_text


http://www.language-archives.org/item.php/oai:www.clarin.si:11356/1044
Up-to-date as of: Sun Mar 31 8:57:26 EDT 2019