OLAC Record

Title:Chinese Proposition Bank 3.0
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Xue, Nianwen, et al. Chinese Proposition Bank 3.0 LDC2013T13. Web Download. Philadelphia: Linguistic Data Consortium, 2013
Contributor:Xue, Nianwen
Bai, Xiaopeng
Lu, Jill
Zhang, Jennifer
Palmer, Martha
Chang, Meiyu
Zhong, Hua
Date (W3CDTF):2013
Date Issued (W3CDTF):2013-07-15
Description:Chinese Proposition Bank 3.0 is a continuation of the Chinese Proposition Bank project which aims to create a corpus of text annotated with information about basic semantic propositions. Chinese Proposition Bank 3.0 adds predicate-argument annotation on 187,731 words from Chinese Treebank 7.0 (LDC2010T07). The data sources are comprised of newswire, magazine articles, various broadcast news and broadcast conversation programming, web newsgroups and weblogs. LDC has also released Chinese Proposition Bank 1.0 (LDC2005T23) and Chinese Proposition Bank 2.0 (LDC2008T07). *Data* This release contains the predicate-argument annotation of 173,206 verb instances and 14,525 noun instances. The annotation of nouns is limited to nominalizations that have a corresponding verb. The general annotation guidelines and the lexical guidelines (called frame files) for each verbal and nominal predicate are also included in this release. Below are some statistics about the corpus. * Total propositions for verbs - 173,206 * Total propositions for nouns - 14,525 * Total verbs framed - 24,642 * Total framesets - 26,467 * Verbs with multiple framesets - 1337 * Average framesets per verb - 1.07 * Total nouns framed - 1,421 * Total noun framesets - 1,528 * Nouns with multiple framesets - 48 * Average framesets per nouns - 1.08 *Samples* Please view the following samples. * Noun Sample * Verb Sample * XML Sample *Updates* None at this time.
Extent:Corpus size: 217088 KB
ISBN: 1-58563-648-7
ISLRN: 460-638-744-650-2
Language:Mandarin Chinese
Language (ISO639):cmn
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2013T13
Rights Holder:Portions © 2006 Agence France Presse, © 2006 Anhui TV, © 2005 Cable News Network, LP, LLLP, © 2000-2001 China Broadcasting System, © 2000-2001, 2005-2006 China Central TV, © 2000-2001 China National Radio, © 2006 Chinanews.com, © 2000-2001 China Television System, © 2006 Guangming Daily, © 2006 National Broadcasting Company, Inc., © 2006 New Tang Dynasty TV, © 2006 Peoples Daily Online, © 2005-2006 Phoenix TV, © 1999-2001 Sinorama Magazine, © 1996-1998, 2006 Xinhua News Agency, © 2001, 2004, 2005, 2007, 2008, 2009, 2010, 2013 Trustees of the University of Pennsylvania
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2013T13
DateStamp:  2019-12-12
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Xue, Nianwen; Bai, Xiaopeng; Lu, Jill; Zhang, Jennifer; Palmer, Martha; Chang, Meiyu; Zhong, Hua. 2013. Linguistic Data Consortium.
Terms: area_Asia country_CN dcmi_Text iso639_cmn olac_primary_text

Up-to-date as of: Sun Aug 2 15:58:41 EDT 2020