OLAC Record oai:www.ldc.upenn.edu:LDC2019T07 |
Metadata | ||
Title: | Chinese Abstract Meaning Representation 1.0 | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Li, Bin, et al. Chinese Abstract Meaning Representation 1.0 LDC2019T07. Web Download. Philadelphia: Linguistic Data Consortium, 2019 | |
Contributor: | Li, Bin | |
Wen, Yuan | ||
Song, Li | ||
Dai, Rubing | ||
Qu, Weiguang | ||
Xue, Nianwen | ||
Date (W3CDTF): | 2019 | |
Date Issued (W3CDTF): | 2019-04-15 | |
Description: | *Introduction* Chinese Abstract Meaning Representation was developed by Brandeis University and Nanjing Normal University and is comprised of semantic representations of a set of Chinese sentences from Chinese Treebank 8.0 (LDC2013T21). Abstract Meaning Representation (AMR) captures "who is doing what to whom" in a sentence. Each sentence is paired with a graph that represents its whole-sentence meaning in a tree structure. LDC has released the following AMR English data sets: Abstract Meaning Representation (AMR) Annotation Release 1.0 (LDC2014T12) and Abstract Meaning Representation (AMR) Annotation Release 2.0 (LDC2017T10). Chinese AMR is based on the annotation methodology developed for English with adaptations for handling specific Chinese phenomena. The goal of the Chinese AMR project is to create a large aligned AMR corpus, of which this data set is the first release. For more information about the project, see the Chinese AMR homepage. *Data* The text is extracted from the 10,325 sentences of the weblog and discussion forum portions of Chinese Treebank 8.0. Annotations were applied to 10,149 sentences, with 176 sentences unannotated. The data is divided into training, development and test sets. These three files are presented as plain text in UTF-8 encoding. *Samples* Please view this sample. *Updates* None at this time. | |
Extent: | Corpus size: 12776 KB | |
Identifier: | LDC2019T07 | |
https://catalog.ldc.upenn.edu/LDC2019T07 | ||
ISBN: 1-58563-880-3 | ||
ISLRN: 376-537-072-369-4 | ||
DOI: 10.35111/8ddt-ze77 | ||
Language: | Mandarin Chinese | |
Language (ISO639): | cmn | |
License: | LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC2019T07 | |
Rights Holder: | Portions © 2019 Bin Li, © 2001, 2004, 2005, 2007, 2009, 2010, 2013, 2019 Trustees of the University of Pennsylvania | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC2019T07 | |
DateStamp: | 2020-11-30 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Li, Bin; Wen, Yuan; Song, Li; Dai, Rubing; Qu, Weiguang; Xue, Nianwen. 2019. Linguistic Data Consortium. | |
Terms: | area_Asia country_CN dcmi_Text iso639_cmn olac_primary_text |