OLAC Record oai:www.ldc.upenn.edu:LDC2014T24 |
Metadata | ||
Title: | Boulder Lies and Truth | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Salvetti, Franco. Boulder Lies and Truth LDC2014T24. Web Download. Philadelphia: Linguistic Data Consortium, 2014 | |
Contributor: | Salvetti, Franco | |
Date (W3CDTF): | 2014 | |
Date Issued (W3CDTF): | 2014-11-15 | |
Description: | *Introduction* Boulder Lies and Truth was developed at the University of Colorado Boulder and contains approximately 1,500 elicited English reviews of hotels and electronics for the purpose of studying deception in written language. Reviews were collected by crowd-sourcing with Amazon Medical Turk. Each review was required to be original and was checked for plagiarism against the web. Reviews were annotated with respect to the following three dimensions: * Domain: Electronics (e.g., iPhone) or Hotels * Sentiment: Positive or Negative * Truth Value: * a) Truthful: a review about an object known by the writer reflecting the real sentiment of the writer toward the object of the review * b) Opposition: A review about an object known by the writer reflecting the opposite sentiment of the writer toward the object of the review (i.e., if the writer liked the object they were asked to write a negative review; if the writer did not like the object, they were asked to write a positive review) * c) Deceptive (i.e., fabricated): a review written about an object not known by the writer either positive or negative in sentiment; the objects reviewed were provided via a URL from the tasks in (a) and (b) *Data* Each review was judged a total of 30 times: (1) 10 times to evaluate its perceived quality (on a range from 1-5); (2) 10 times with judgments about its perceived truthfulness (e.g., truthful or somehow deceptive, a lie or a fabrication); and (3) 10 times for its perceived sentiment (i.e., star rating). The following metadata is available for each review: * time consumed by the writer to write the review * a pair review ID coupling the two reviews (positive/negative) written about the same object by the same person, either false or truthful * the ID of the writer who wrote the review * the writer's disclosure as to whether the object to be reviewed was already used and/or known to the writer * the URL identifying an instance of the object (i.e., hotel or electronic product) on the web * a flag for plagiarized reviews * a marker for reviews that may be removed from the corpus * the reasons for rejecting a review *Samples* Please view this sample. *Updates* None at this time. | |
Extent: | Corpus size: 4280 KB | |
Identifier: | LDC2014T24 | |
https://catalog.ldc.upenn.edu/LDC2014T24 | ||
ISBN: 1-58563-695-9 | ||
ISLRN: 974-370-635-113-0 | ||
DOI: 10.35111/tj47-sd65 | ||
Language: | English | |
Language (ISO639): | eng | |
License: | Boulder Lies and Truth: https://catalog.ldc.upenn.edu/license/boulder-lies-and-truth.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Rights Holder: | Portions © 2014 Franco Salvetti, © 2014 Trustees of the University of Pennsylvania | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC2014T24 | |
DateStamp: | 2020-11-30 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Salvetti, Franco. 2014. Linguistic Data Consortium. | |
Terms: | area_Europe country_GB dcmi_Text iso639_eng olac_primary_text |