OLAC Record
oai:www.ldc.upenn.edu:LDC2006T08

Metadata
Title:TimeBank 1.2
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Pustejovsky, James, et al. TimeBank 1.2 LDC2006T08. Web Download. Philadelphia: Linguistic Data Consortium, 2006
Contributor:Pustejovsky, James
Verhagen, Marc
Sauri, Roser
Littman, Jessica
Gaizauskas, Robert
Katz, Graham
Mani, Inderjeet
Knippen, Robert
Setzer, Andrea
Date (W3CDTF):2006
Date Issued (W3CDTF):2006-04-17
Description:*Introduction* TimeBank 1.2 was developed by Brandeis University and contains 183 English news articles with over 27,000 event and temporal annotations, adding events, times and temporal links between events and times. The annotation follows the TimeML 1.2.1 specification. *Data* TimeML aims to capture and represent temporal information. This is accomplished using four primary tag types: TIMEX3 for temporal expressions, EVENT for temporal events, SIGNAL for temporal signals, and LINK for representing relationships. For a detailed description of TimeML, see the TimeML 1.2.1 Specification and Guidelines included in the corpus package documentation. Here are descriptions for each tag: TIMEX3 - Captures dates, times, durations, and sets of dates and times. EVENT - Annotates those elements in a text that mark the semantic events described by it. MAKEINSTANCE - Creates tags for events that include information about a particular instance of the event. When an event participates in a relationship, it is actually the event instance that is referenced. SIGNAL - Annotates temporal function words such as "after," "during," and "when." The following three tags are link tags. They capture temporal, subordination, and aspectual relationships found in the text. These tags do not consume any actual text, but they do relate the four tag types above to each other. TLINK - Temporally relates two temporal expressions, two event instances, or a temporal expression and an event instance. SLINK - Captures subordination relationships that involve event modality, evidentiality, and factuality. ALINK - Captures an aspectual connection between two event instances. TimeBank 1.2 contains 183 articles with just over 61,000 non-punctuation tokens. The count for each TimeML tag is listed below: EVENT 7,935 MAKEINSTANCE 7,940 TIMEX3 1,414 SIGNAL 688 ALINK 265 SLINK 2,932 TLINK 6,418 Total 27,592 *Samples* For an example of the data in this corpus, please view the following sample (XML). *Updates* None at this time.
Extent:Corpus size: 11264 KB
Identifier:LDC2006T08
https://catalog.ldc.upenn.edu/LDC2006T08
ISBN: 1-58563-386-0
ISLRN: 717-712-373-266-4
DOI: 10.35111/09b1-5n19
Language:English
Language (ISO639):eng
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2006T08
Rights Holder:Portions © 1998 American Broadcasting Corporation, © 1998 The Associated Press, © 1998 Cable News Network, LP, LLLP, © 1987-1989 Dow Jones & Company, Inc., © 1998 New York Times, © 1998 Public Radio International, © 2002-2006 Brandeis University, © 2006 Trustees of the University of Pennsylvania

The World is a co-production of Public Radio International and the British Broadcasting Corporation and is produced at WGBH Boston.
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2006T08
DateStamp:  2021-03-12
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Pustejovsky, James; Verhagen, Marc; Sauri, Roser; Littman, Jessica; Gaizauskas, Robert; Katz, Graham; Mani, Inderjeet; Knippen, Robert; Setzer, Andrea. 2006. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Text iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2006T08
Up-to-date as of: Thu Oct 24 7:30:13 EDT 2024