OLAC Record
oai:www.ldc.upenn.edu:LDC2023T10

Metadata
Title:AIDA Scenario 1 and 2 Reference Knowledge Base
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Tracey, Jennifer, et al. AIDA Scenario 1 and 2 Reference Knowledge Base LDC2023T10. Web Download. Philadelphia: Linguistic Data Consortium, 2023
Contributor:Tracey, Jennifer
Strassel, Stephanie
Getman, Jeremy
Bies, Ann
Griffitt, Kira
Graff, David
Caruso, Christopher
Date (W3CDTF):2023
Date Issued (W3CDTF):2023-10-16
Description:*Introduction* AIDA Scenario 1 and 2 Reference Knowledge Base was developed by the Linguistic Data Consortium (LDC) and contains the English knowledge base (KB) used for all AIDA entity linking annotation in Scenario 1 (Russia-Ukraine Relations) and Scenario 2 (Crisis in Venezuela). The KB content was drawn from GeoNames, the CIA World Leaders List and the CIA World Factbook and was supplemented with manually-created KB entries developed specifically for AIDA data. The DARPA AIDA (Active Interpretation of Disparate Alternatives) program aimed to develop a multi-hypothesis semantic engine to generate explicit alternative interpretations of events, situations and trends from a variety of unstructured sources. LDC supported AIDA by collecting, creating and annotating multimodal linguistic resources in multiple languages. Each phase of the AIDA program focused on a specific scenario, or broad topic area, with related subtopics designated as either practice subtopics or evaluation subtopics. The Phase 1 scenario focused on political relations between Russia and Ukraine in the 2010s. The socioeconomic and political crisis in Venezuela since 2010 was the scenario in Phase 2. *Data* This knowledge base supported the AIDIA entity detection and linking task for 13 entity types: GPE (Geo-Political Entity), LOC (Location), PER (Person), ORG (Organization), FAC (Facility), MHI (Medical/Health Issue), WEA (Weapon), SID (Side), COM (Commodity), CRM (Crime), LAW (Law), VEH (Vehicle), and BAL (Ballot). There are four inputs to the KB: GPE and LOC entities from GeoNames (GEO), PER entities from the CIA World Leaders List (WLL), ORG entities from Appendix B of the CIA World Factbook (APB), and additional entities manually created by LDC. The GEO, WLL and APB entries are also found in LORELEI Entity Detection and Linking Knowledge Base (LDC2010T10). *Acknowledgement* This material is based upon work supported by Air Force Research Laboratory (AFRL) and the Defense Advanced Research Projects Agency (DARPA) under Contract No. FA8750-18-C-0013. *Samples* Please view the following samples: * Alternate Names Sample * Entities Sample * Member States Sample *Updates* None at this time.
Extent:Corpus size: 805034 KB
Identifier:LDC2023T10
https://catalog.ldc.upenn.edu/LDC2023T10
ISLRN: 644-411-403-964-6
DOI: 10.35111/3wzr-h616
Language:English
Language (ISO639):eng
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2023T10
Rights Holder:Portions © 2023 Trustees of the University of Pennsylvania
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2023T10
DateStamp:  2024-01-01
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Tracey, Jennifer; Strassel, Stephanie; Getman, Jeremy; Bies, Ann; Griffitt, Kira; Graff, David; Caruso, Christopher. 2023. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Text iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2023T10
Up-to-date as of: Mon Mar 25 7:21:21 EDT 2024