OLAC Record
oai:lindat.mff.cuni.cz:11234/1-2621

Metadata
Title:Prague Dependency Treebank 3.5
Bibliographic Citation:http://hdl.handle.net/11234/1-2621
Creator:Hajič, Jan
Bejček, Eduard
Bémová, Alevtina
Buráňová, Eva
Hajičová, Eva
Havelka, Jiří
Homola, Petr
Kárník, Jiří
Kettnerová, Václava
Klyueva, Natalia
Kolářová, Veronika
Kučová, Lucie
Lopatková, Markéta
Mikulová, Marie
Mírovský, Jiří
Nedoluzhko, Anna
Pajas, Petr
Panevová, Jarmila
Poláková, Lucie
Rysová, Magdaléna
Sgall, Petr
Spoustová, Johanka
Straňák, Pavel
Synková, Pavlína
Ševčíková, Magda
Štěpánek, Jan
Urešová, Zdeňka
Vidová Hladká, Barbora
Zeman, Daniel
Zikánová, Šárka
Žabokrtský, Zdeněk
Date (W3CDTF):2018-02-20T01:15:58Z
Date Available:2018-02-20T01:15:58Z
Description:The Prague Dependency Treebank 3.5 is the 2018 edition of the core Prague Dependency Treebank (PDT). It contains all PDT annotation made at the Institute of Formal and Applied Linguistics under various projects between 1996 and 2018 on the original texts, i.e., all annotation from PDT 1.0, PDT 2.0, PDT 2.5, PDT 3.0, PDiT 1.0 and PDiT 2.0, plus corrections, new structure of basic documentation and new list of authors covering all previous editions. The Prague Dependency Treebank 3.5 (PDT 3.5) contains the same texts as the previous versions since 2.0; there are 49,431 annotated sentences (832,823 words) on all layers, from tectogrammatical annotation to syntax to morphology. There are additional annotated sentences for syntax and morphology; the totals for the lower layers of annotation are: 87,913 sentences with 1,502,976 words at the analytical layer (surface dependency syntax) and 115,844 sentences with 1,956,693 words at the morphological layer of annotation (these totals include the annotation with the higher layers annotated as well). Closely linked to the tectogrammatical layer is the annotation of sentence information structure, multiword expressions, coreference, bridging relations and discourse relations.
Identifier (URI):http://hdl.handle.net/11234/1-2621
Is Replaced By (URI):http://hdl.handle.net/11234/1-3185
Language:Czech
Language (ISO639):ces
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Replaces (URI):http://hdl.handle.net/11858/00-097C-0000-0023-1AAF-3
http://hdl.handle.net/11234/1-1905
Rights:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
http://creativecommons.org/licenses/by-nc-sa/4.0/
Subject:treebank
dependency
tectogrammatics
topic-focus articulation
multiword expressions
coreference
bridging relations
discourse
morphology
syntax
tokenization
lemmatization
clauses
semantics
semantic relations
lexical semantics
lexicon
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/1-2621
DateStamp:  2021-06-29
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Hajič, Jan; Bejček, Eduard; Bémová, Alevtina; Buráňová, Eva; Hajičová, Eva; Havelka, Jiří; Homola, Petr; Kárník, Jiří; Kettnerová, Václava; Klyueva, Natalia; Kolářová, Veronika; Kučová, Lucie; Lopatková, Markéta; Mikulová, Marie; Mírovský, Jiří; Nedoluzhko, Anna; Pajas, Petr; Panevová, Jarmila; Poláková, Lucie; Rysová, Magdaléna; Sgall, Petr; Spoustová, Johanka; Straňák, Pavel; Synková, Pavlína; Ševčíková, Magda; Štěpánek, Jan; Urešová, Zdeňka; Vidová Hladká, Barbora; Zeman, Daniel; Zikánová, Šárka; Žabokrtský, Zdeněk. 2018. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: area_Europe country_CZ dcmi_Text iso639_ces olac_primary_text


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/1-2621
Up-to-date as of: Thu Oct 5 0:40:52 EDT 2023