OLAC Record

Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Santus, Enrico, Hongchao Liu, and Chu-Ren Huang. EVALution LDC2020T06. Web Download. Philadelphia: Linguistic Data Consortium, 2020
Contributor:Santus, Enrico
Liu, Hongchao
Huang, Chu-Ren
Date (W3CDTF):2020
Date Issued (W3CDTF):2020-03-13
Description:*Introduction* EVALution was developed by The Hong Kong Polytechnic University. It is comprised of English and Mandarin Chinese data sets -- EVALution 1.0 and EVALution-Man, respectively -- that contain semantic relations and metadata for training and evaluating distributional semantic models. *Data* EVALution 1.0 consists of approximately 7500 English tuples extracted from ConceptNet 5.0 and WordNet and filtered through automatic methods and crowd-sourcing. Several semantic relations between word pairs were instantiated, including hypernymy, synonymy, antonymy and meronymy. The corpus also includes additional information that can be used to filter the pairs or to analyze the results, such as relation domain, word frequency, word part-of-speech and word semantic field. EVALution-MAN consists of Chinese word pairs from two sources: Chinese Wordnet and humans who completed an elicitation task by supplying missing words to sentences. The human-supplied sentence word pairs were then judged by human raters for reliability. All text data is presented as UTF-8 encoded tab separated plain text. *Samples* Please view this EVALutaion 1.0 sample and EVALution-MAN sample. *Updates* None at this time.
Extent:Corpus size: 1976 KB
ISBN: 1-58563-921-4
ISLRN: 490-239-801-102-1
DOI: 10.35111/4h9q-yt20
Language:Mandarin Chinese
Language (ISO639):cmn
License:EVALution Agreement: https://catalog.ldc.upenn.edu/license/evalution-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2020T06
Rights Holder:Portions © 2020 The Hong Kong Polytechnic University, © 2020 Trustees of the University of Pennsylvania
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2020T06
DateStamp:  2021-01-01
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Santus, Enrico; Liu, Hongchao; Huang, Chu-Ren. 2020. Linguistic Data Consortium.
Terms: area_Asia area_Europe country_CN country_GB dcmi_Text iso639_cmn iso639_eng olac_primary_text

Up-to-date as of: Tue May 7 7:25:46 EDT 2024