OLAC Record

Title:Bulgarian Treebank Corpus
Access Rights: Rights available for: attribution
Date Available (W3CDTF):2022-10-03
Date Issued (W3CDTF):2022-10-03
Description:The Bulgarian Treebank Corpus is composed of 156,149 tokens (11,138 sentences) coming from three main sources in the domain of Grammar Notebooks (1,391 sentences), News (6,698 sentences), Other (3,049 sentences). It is available with syntactical and morphological annotation on a sentence basis in Universal Dependencies format. This subset of BulTreeBank excludes ellipses and some rare phenomena. The conversion of BulTreeBank into Universal Dependency format was supported by the EU Project QTLeap (http://qtleap.eu/).
ISLRN: 761-430-854-533-2
Identifier (URI):https://catalog.elra.info/en-us/repository/browse/ELRA-W0328/
Language (ISO639):bul
Medium:Not specified
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-W0328
DateStamp:  2022-10-03
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2022. ELRA (European Language Resources Association).
Terms: area_Europe country_BG dcmi_Text iso639_bul olac_primary_text

Up-to-date as of: Wed Sep 20 0:40:44 EDT 2023