OLAC Record

Title:Bulgarian Event Corpus
Access Rights: Rights available for: attribution
Date Available (W3CDTF):2022-10-03
Date Issued (W3CDTF):2022-10-03
Description:The Bulgarian Event Corpus is composed 324,905 tokens appropriate for training Named Entity Recognition (NER), Named Entity Linking (NEL) and Event Recognition models for Bulgarian in a multidomain context within Humanities. The texts are domain related. They include documents from the area of Social Sciences and Humanities – scientific papers, archive documents, popular documents, and Wikipedia articles in the relevant areas. The annotation scheme reflects the rationale behind the CIDOC-CRM ontology since this ontology has been widely used in the areas of GLAM and Humanities. The annotation scheme envisages two main layers: the first one is the Named Entity (NE) layer - 16 types, and the second one is the event layer where each event is connected to its participants – 39 event labels.
ISLRN: 832-960-876-604-2
Identifier (URI):https://catalog.elra.info/en-us/repository/browse/ELRA-W0329/
Language (ISO639):bul
Medium:Not specified
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-W0329
DateStamp:  2022-10-03
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2022. ELRA (European Language Resources Association).
Terms: area_Europe country_BG dcmi_Text iso639_bul olac_primary_text

Up-to-date as of: Fri Apr 19 6:30:18 EDT 2024