OLAC Record
oai:catalogue.elra.info:ELRA-W0328

Metadata
Title:Bulgarian Treebank Corpus
Access Rights: Rights available for: attribution
Date Available (W3CDTF):2022-10-03
Date Issued (W3CDTF):2022-10-03
Description:The Bulgarian Treebank Corpus is composed of 156,149 tokens (11,138 sentences) coming from three main sources in the domain of Grammar Notebooks (1,391 sentences), News (6,698 sentences), Other (3,049 sentences). It is available with syntactical and morphological annotation on a sentence basis in Universal Dependencies format. This subset of BulTreeBank excludes ellipses and some rare phenomena. The conversion of BulTreeBank into Universal Dependency format was supported by the EU Project QTLeap (http://qtleap.eu/).
Identifier:ELRA-W0328
ISLRN: 761-430-854-533-2
Identifier (URI):https://catalog.elra.info/en-us/repository/browse/ELRA-W0328/
Language:Bulgarian
Language (ISO639):bul
Medium:Not specified
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-W0328
DateStamp:  2022-10-03
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2022. ELRA (European Language Resources Association).
Terms: area_Europe country_BG dcmi_Text iso639_bul olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-W0328
Up-to-date as of: Fri Apr 19 6:30:17 EDT 2024