OLAC Record
oai:www.ldc.upenn.edu:LDC2023T11

Metadata
Title:AIDA Scenario 1 Practice Topic Source Data
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Tracey, Jennifer, et al. AIDA Scenario 1 Practice Topic Source Data LDC2023T11. Web Download. Philadelphia: Linguistic Data Consortium, 2023
Contributor:Tracey, Jennifer
Strassel, Stephanie
Getman, Jeremy
Bies, Ann
Griffitt, Kira
Graff, David
Caruso, Christopher
Date (W3CDTF):2023
Date Issued (W3CDTF):2023-10-16
Description:*Introduction* AIDA Scenario 1 Practice Topic Source Data was developed by the Linguistic Data Consortium (LDC) and is comprised of 1511 root documents (text, image, and video) from English, Russian, and Ukrainian web sources. The DARPA AIDA (Active Interpretation of Disparate Alternatives) program aimed to develop a multi-hypothesis semantic engine to generate explicit alternative interpretations of events, situations and trends from a variety of unstructured sources. LDC supported AIDA by collecting, creating and annotating multimodal linguistic resources in multiple languages. Each phase of the AIDA program centered on a specific scenario, or broad topic area, with related subtopics designated as either practice subtopics or evaluation subtopics. The Phase 1 scenario focused on political relations between Russia and Ukraine in the 2010s. This corpus constitutes the full set of topic-focused documents for Phase 1 practice subtopics. *Data* Data was collected from web sources by a combination of automatic and manual processes. HTML content was converted from its original form into XML. To the extent possible, all resources referenced by a given "root" HTML page (style sheets, javascript, images, media files, etc.) were stored as separate files of the given data type and assigned separate 9-character file-IDs (the same form of ID used for the "root" HTML page). The knowledge base for entity detection and linking annotation for all AIDA Scenario 1 and 2 corpora is available separately as AIDA Scenario 1 and 2 Reference Knowledge Base (LDC2023T10). *Sponsorship* This material is based upon work supported by Air Force Research Laboratory (AFRL) and the Defense Advanced Research Projects Agency (DARPA) under Contract No. FA8750-18-C-0013. *Samples* Please view the following samples: * LTF XML * PSM XML *Updates* None at this time.
Extent:Corpus size: 12558147 KB
Format:Sampling Rate: 44100 Hz
Sampling Format: mpeg
Identifier:LDC2023T11
https://catalog.ldc.upenn.edu/LDC2023T11
ISLRN: 039-139-038-220-6
DOI: 10.35111/pbed-b924
Language:Russian
Ukrainian
English
Language (ISO639):rus
ukr
eng
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2023T11
Rights Holder:Portions © 2014 0342.ua, © 2014 ABC News Internet Ventures, © 2018 About the West, © 2014 Agency for Information and Analytics, © 2015 Al Jazeera Media Network, © 2017-2018 ANO Creative Team Expert, © 2014, 2017 ANO RID Novaya Gazeta, © 2014 ANTIKOR,© 2017 Apostrophe, © 2014 “ARGUMENT,” © 2014, 2017 Arguments and Facts, © 2014 Associated Newspapers Ltd, © 2015 Athens News, © 2014-2017 Autonomous Nonprofit Organization “TV-Novosti,” © 2014-2017 BBC, © 2015, 2017-2018 Bellingcat, © 2014 Bessarabia INFORM, © 2016 Bird In Flight, © 2014 BIZNESGRUPP TOV, © 2014 Boston Globe Media Partners, LLC, © 2018 Business capital, © 2014 BuzzFeed, Inc., © 2014, 2016 Cable News Network. A Warner Bros. Discovery Company, © 2015 Carnegie Endowment for International Peace, © 2015 CBS Interactive Inc., © 2014, 2017 Censor.NET, © 2015 Channel 5, © 2017 Charter ’97, © 2014 CJSC Moskovsky Komsomolets, MK.ru, © 2014 CNBC LLC, © 2014 Colta.com, © 2018 Conflicts and laws, © 2014, 2016 Consortium News, © 2016 Crime NO, © 2015 Dawn of Novorossiya, © 2016 Depo.ua, © 2014 Dicasterium pro Communicationene, © 2016 DutchNews, © 2017 "Echo of the Planet," © 2017 EN.News Front, © 2014, 2017 ESPRESO.TV, © 2015 Eubulletin.com, © 2015-2016 Euromaidan Press, © 2017 Express, © 2014 "Facts and Comments," © 2015 FAN, © 2022 Federal State Budgetary Institution "Editorial Office of Rossiyskaya Gazeta," © 2014-2015 First Channel, © 2014 Focus, © 2015 Forbes Media LLC, © 2017 Future Publishing Limited, Quay House, The Ambury, Bath BA1 1UA, © 2018 Geopoliticalmonitor Intelligence Corp., © 2014 Haaretz Daily Newspaper Ltd., © 2014 Infowars, © 2014 Interlocutor, © 2014-2015, 2017 Golden Mean LLC, © 2017 GolosIslam.RU, © 2014 GORDON, © 2014 Gorlovka.ua, © 2015 Graphic News Ltd, © 2014 Guardian News & Media Limited or its affiliated companies,© 2014 High Castle Online, ©2014 HotAir.com/Salem Media, © 2014 Hürriyet Daily News, © 2017 HVILYA, © 2018 IA "InfoResist," © 2015-2016 IA "Russia Today," © 2014 IBTimes Co., Ltd, © 2017-2018 InA "Ukrainian News," © 2014 InfoKava.com, © 2015 Information agency LIGABusinessInform, © 2014 Information and analytical publication "One Motherland," © 2014 noSMI.ru, © 2014 INSIDER, © 2014 Insider Inc., © 2016 Interfax-Ukraine, © 2014 Internet Television "Piter.TV," © 2014 IP Filin M.S., © 2014, 2017-2018 JSC “Gazeta.Ru,” © 2014-2015, 2017 JSC "Kommersant," © 2014, 2017 JSC ROSBUSINESSCONSULTING,© 2014-2015, 2017 JSC TRK AF RF ZVEZDA, © 2017 Korrespondent.net, © 2017 Lenta.Ru LLC, © 2014 LLC “Kurs,” © 2014 LLC "National Information Systems," © 2015 LLC "Rusevik," © 2017 LLC “UKRAINIAN PRESS GROUP,” © 2014 Los Angeles Times, © 2014 M24, © 2014 Mashable, Inc., © 2015 Max Park, © 2015, MEDIA-DK PUBLISHING HOUSE LLC, © 2014 “MEDIASAPIENS,” © 2015-2016 Meduza, © 2016 mirnews.su, © 2014 “Mirror of the Week. Ukraine," © 2014 Moscow Digital Media LLC, © 2014 Naharnet, © 2018 National Post, a division of Postmedia Network Inc., © 2017 National Bank of News, © 2015 Nationwide News Pty Ltd, © 2014 NBC Universal, © 2014 NDTV Convergence Limited, © 2017 News24 Today, © 2015 News of Ukraine on Rivnist.In.Ua, © 2016 NEWSWEEK DIGITAL LLC, © 2014 NGO "Transcarpathian Free Media," © 2014 Nine Digital Network, © 2014 npr, © 2014-2015, 2017 Online edition "Vesti.Ru," © 2014 ONLINE.UA, © 2014, 2017 Organization for Security and Cooperation in Europe, © 2014 OstroV, © 2014 Paris Match, © 2014 PE "Ukraine Young," © 2017 Politeka, © 2017 "Politic.Kiev.Ua," © 2017 PolitRussia,© 2017 POWER NET, © 2015-2016 Present Time, © 2015-2016 Public Television, © 2014, 2016-2017 Publishing House Komsomolskaya Pravda> JSC, © 2014-2015, 2017 Radio Liberty,© 2014 Rakurs, © 2016 Rambler, © 2017Replyua.net, © 2014 Reuters, © 2014-2015 RFE/RL, Inc., © 2015 Russia Insider, © 2014-2015 segodnya.ua, © 2015 sevascom, © 2017 Spiegel Group, © 2014, 2016 Sputnik, © 2014 SVIT24.NET, © 2014, 2017 TASS, Russian news agency, © 2014-2015 Telegraph Media Group Limited, © 2014-2017 Television and Radio Company Lux, TV Channel 24, © 2014, 2016 Television news service, © 2014 The Atlantic Monthly Group, © 2014 The Christian Science Monitor, © 2014, 2017 The Daily Beast Company LLC, © 2014 The Economist Newspaper Limited, © 2017 The EurAsian Times, © 2017-2018 THE FINANCIAL TIMES LTD, © 2014 The Globe and Mail Inc., © 2015 THE IRISH TIMES, © 2017 The Moscow Times, © 2014-2015 The New York Times Company, © 2014 The Slate Group, © 2018 The Times of Israel, © 2014 The Washington Post, © 2014 The World from PRX, © 2014, 2016 TIME USA, LLC, © 2014 TOPNEWS.RU, © 2014 Toronto Star Newspapers Ltd., © 2014-2016 TOV "KEPRATE PARTNERS," © 2014 Transcarpathia online Beta, © 2014-2015, 2017 TV Center JSC, © 2014–2015, 2017 Tyzhden.ua, © 2014 uapress, © 2014-2016 “Ukrainian media systems,”© 2017 Ukrainian National News Agency, © 2017 Ukrainian Truth, © 2014 Ukrinform, © 2014-2017 UNIAN.NET, © 2014-2017 USA TODAY, a division of Gannett Satellite Information Network, LLC, © 2017 Vchasno LLC, © 2017 Vchasno news agency - Donbass news, © 2014 VGOS INFORMATION AGENCY, © 2014 vinnitsa.info, © 2014 Volyn News Agency, © 2014 VolynPost, © 2014 Voskanapat.info, © 2014 Vox Media, LLC, © 2014 Vysokyi Zamok Publishing House LLC, © 2014 WINDOWS, © 2017 "Word and Deed,"© 2014 worldnewsage.com, © 2015 XINHUANET.com, © 2017-2018 YouTube, LLC, © 2014 ZAKHID.NET LLC, © 2015 zbruc.eu, © 2014 Zhitomir-Online, © 2017 Znaj.ua, © 2022, 2023 Trustees of the University of Pennsylvania
Type (DCMI):MovingImage
Software
Sound
StillImage
Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2023T11
DateStamp:  2024-04-11
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Tracey, Jennifer; Strassel, Stephanie; Getman, Jeremy; Bies, Ann; Griffitt, Kira; Graff, David; Caruso, Christopher. 2023. Linguistic Data Consortium.
Terms: area_Europe country_GB country_RU country_UA dcmi_MovingImage dcmi_Software dcmi_Sound dcmi_StillImage dcmi_Text iso639_eng iso639_rus iso639_ukr olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2023T11
Up-to-date as of: Tue Sep 10 8:13:06 EDT 2024