OLAC Record

Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Wang, Shichang, et al. SemTransCNC LDC2020T12. Web Download. Philadelphia: Linguistic Data Consortium, 2020
Contributor:Wang, Shichang
Huang, Chu-Ren
Yao, Yao
Chan, Angel
Date (W3CDTF):2020
Date Issued (W3CDTF):2020-06-22
Description:*Introduction* SemTransCNC was developed by The Hong Kong Polytechnic University. It is comprised of a semantic transparency dataset of Chinese nominal compounds built using a series of crowd-based experiments. Nominal compounds were selected from the Sinica Corpus and a modern Chinese lexicon. Crowd workers answered questionnaires that included demographic information and questions about the Chinese language. For assessing overall semantic transparency (OST) of selected compounds, they answered the question: "How is the sum of the meanings of A and B similar to the meaning of AB?" For assessing constituent semantic transparency (CST), they were asked to describe the similarity of A alone to its meaning in AB and the meaning of B alone to its meaning in AB. *Data* SemTransCNC consists of OST and CST data for 1,176 dimorphemic Chinese nominal compounds, which consist of free morphemes and have mid-range frequencies. The text data is presented as a UTF-8 encoded comma separated text file. *Samples* Please view this text sample (CSV). *Updates* None at this time.
Extent:Corpus size: 140 KB
ISBN: 1-58563-931-1
ISLRN: 835-247-023-332-5
DOI: 10.35111/vreb-7n07
Language:Mandarin Chinese
Language (ISO639):cmn
License:SemTransCNC Agreement: https://catalog.ldc.upenn.edu/license/semtranscnc-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2020T12
Rights Holder:Portions © 2020 The Hong Kong Polytechnic University, © 2020 Trustees of the University of Pennsylvania
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2020T12
DateStamp:  2021-01-01
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Wang, Shichang; Huang, Chu-Ren; Yao, Yao; Chan, Angel. 2020. Linguistic Data Consortium.
Terms: area_Asia country_CN dcmi_Text iso639_cmn olac_primary_text

Up-to-date as of: Tue May 7 7:25:48 EDT 2024