OLAC Record
oai:www.ldc.upenn.edu:LDC2015T08

Metadata
Title:Coordination Annotation for the Penn Treebank
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Kübler, Sandra, Wolfgang Maier, and Erhard Hinrichs. Coordination Annotation for the Penn Treebank LDC2015T08. . Philadelphia: Linguistic Data Consortium, 2015
Contributor:Kübler, Sandra
Maier, Wolfgang
Hinrichs, Erhard
Date (W3CDTF):2015
Date Issued (W3CDTF):2015-05-15
Description:*Introduction* Coordination Annotation for the Penn Treebank is a stand-off annotation for the Wall Street Journal portion of Treebank-3 (PTB3) (LDC99T42) developed by researchers at the University of Düsseldorf and Indiana University. It marks all tokens that have a coordinating function (potentially among other functions). Coordination is a syntactic structure that links together two or more elements known as conjuncts or conjoins. The presence of coordination is often signaled by the appearance of a coordinator (coordinating conjunction), such as and, or, but in English. Penn Coordination Annotation is available at no cost to all licensees of PTB3 and appears in their download queue associated with LDC99T42 as penn_coordination_anno_LDC2015T08.tgz. *Data* This annotation is presented in a single UTF-8 plain text tsv file with columns as follows: * section: Penn Treebank WSJ section number * file: Number of file within section * sentence: Number of sentence (starting with 0) * token: Number of token (starting with 0) * annotation: "P" if the token is a coordinating punctuation, "O" otherwise *Samples* Please view this sample. *Updates* None at this time.
Extent:Corpus size: 19528 KB
Identifier:LDC2015T08
https://catalog.ldc.upenn.edu/LDC2015T08
ISBN: 1-58563-714-9
ISLRN: 060-785-139-403-2
DOI: 10.35111/ekgv-et49
Language:English
Language (ISO639):eng
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2015T08
Rights Holder:Portions © 2015 Sandra Kübler, Wolfgang Maier, Trustees of the University of Pennsylvania
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2015T08
DateStamp:  2020-11-30
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Kübler, Sandra; Maier, Wolfgang; Hinrichs, Erhard. 2015. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Text iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2015T08
Up-to-date as of: Mon Mar 25 7:20:44 EDT 2024