OLAC Record oai:www.ldc.upenn.edu:LDC2005T08 |
Metadata | ||
Title: | Discourse Graphbank | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Wolf, Florian, et al. Discourse Graphbank LDC2005T08. Web Download. Philadelphia: Linguistic Data Consortium, 2005 | |
Contributor: | Wolf, Florian | |
Gibson, Edward | ||
Fisher, Amy | ||
Knight, Meredith | ||
Date (W3CDTF): | 2005 | |
Date Issued (W3CDTF): | 2005-03-15 | |
Description: | *Introduction* Discourse Graphbank contains 135 newswire texts totalling 70,000 words annotated with coherence relations. The project was Florian Wolf's PhD thesis and aimed to define a descriptively adequate data structure for representing discourse coherence structures, investigated the impact of discourse coherence structures on other linguistic processes and natural language applications (e.g. anaphora resolution, summarization and information retrieval), and developed and tested discourse parsing algorithms. *Data* The source data consists of Assoicated Press and Wall Street Journal newswire data from TIPSTER Complete (LDC93T3A) annotated with coherence relations. The data was annotated by two independent annotators with 88% agreement. The annotators notated 11 types of coherence relations: Resemblance relations Parallel Contrast Example Generalization Elaboration Cause-Effect relations Explanation Violated Expectation Condition Temporal Sequence relation Attribution relation Same relation *Samples* For an example of the data in this corpus, please view this sample (JPG). *Updates* None at this time. | |
Identifier: | LDC2005T08 | |
https://catalog.ldc.upenn.edu/LDC2005T08 | ||
ISBN: 1-58563-320-8 | ||
ISLRN: 983-656-398-539-6 | ||
DOI: 10.35111/7snd-y397 | ||
Language: | English | |
Language (ISO639): | eng | |
License: | LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC2005T08 | |
Rights Holder: | Portions © 1988-1990 Associated Press, © 1986-1989 Dow Jones & Company, Inc., © 2005 Trustees of the University of Pennsylvania | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC2005T08 | |
DateStamp: | 2021-11-15 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Wolf, Florian; Gibson, Edward; Fisher, Amy; Knight, Meredith. 2005. Linguistic Data Consortium. | |
Terms: | area_Europe country_GB dcmi_Text iso639_eng olac_primary_text |