OLAC Record oai:www.ldc.upenn.edu:LDC2015T08 |
Metadata | ||
Title: | Coordination Annotation for the Penn Treebank | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Kübler, Sandra, Wolfgang Maier, and Erhard Hinrichs. Coordination Annotation for the Penn Treebank LDC2015T08. . Philadelphia: Linguistic Data Consortium, 2015 | |
Contributor: | Kübler, Sandra | |
Maier, Wolfgang | ||
Hinrichs, Erhard | ||
Date (W3CDTF): | 2015 | |
Date Issued (W3CDTF): | 2015-05-15 | |
Description: | *Introduction* Coordination Annotation for the Penn Treebank is a stand-off annotation for the Wall Street Journal portion of Treebank-3 (PTB3) (LDC99T42) developed by researchers at the University of Düsseldorf and Indiana University. It marks all tokens that have a coordinating function (potentially among other functions). Coordination is a syntactic structure that links together two or more elements known as conjuncts or conjoins. The presence of coordination is often signaled by the appearance of a coordinator (coordinating conjunction), such as and, or, but in English. Penn Coordination Annotation is available at no cost to all licensees of PTB3 and appears in their download queue associated with LDC99T42 as penn_coordination_anno_LDC2015T08.tgz. *Data* This annotation is presented in a single UTF-8 plain text tsv file with columns as follows: * section: Penn Treebank WSJ section number * file: Number of file within section * sentence: Number of sentence (starting with 0) * token: Number of token (starting with 0) * annotation: "P" if the token is a coordinating punctuation, "O" otherwise *Samples* Please view this sample. *Updates* None at this time. | |
Extent: | Corpus size: 19528 KB | |
Identifier: | LDC2015T08 | |
https://catalog.ldc.upenn.edu/LDC2015T08 | ||
ISBN: 1-58563-714-9 | ||
ISLRN: 060-785-139-403-2 | ||
DOI: 10.35111/ekgv-et49 | ||
Language: | English | |
Language (ISO639): | eng | |
License: | LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC2015T08 | |
Rights Holder: | Portions © 2015 Sandra Kübler, Wolfgang Maier, Trustees of the University of Pennsylvania | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC2015T08 | |
DateStamp: | 2020-11-30 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Kübler, Sandra; Maier, Wolfgang; Hinrichs, Erhard. 2015. Linguistic Data Consortium. | |
Terms: | area_Europe country_GB dcmi_Text iso639_eng olac_primary_text |