OLAC Record
oai:www.ldc.upenn.edu:LDC2014T24

Metadata
Title:Boulder Lies and Truth
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Salvetti, Franco. Boulder Lies and Truth LDC2014T24. Web Download. Philadelphia: Linguistic Data Consortium, 2014
Contributor:Salvetti, Franco
Date (W3CDTF):2014
Date Issued (W3CDTF):2014-11-15
Description:*Introduction* Boulder Lies and Truth was developed at the University of Colorado Boulder and contains approximately 1,500 elicited English reviews of hotels and electronics for the purpose of studying deception in written language. Reviews were collected by crowd-sourcing with Amazon Medical Turk. Each review was required to be original and was checked for plagiarism against the web. Reviews were annotated with respect to the following three dimensions: * Domain: Electronics (e.g., iPhone) or Hotels * Sentiment: Positive or Negative * Truth Value: * a) Truthful: a review about an object known by the writer reflecting the real sentiment of the writer toward the object of the review * b) Opposition: A review about an object known by the writer reflecting the opposite sentiment of the writer toward the object of the review (i.e., if the writer liked the object they were asked to write a negative review; if the writer did not like the object, they were asked to write a positive review) * c) Deceptive (i.e., fabricated): a review written about an object not known by the writer either positive or negative in sentiment; the objects reviewed were provided via a URL from the tasks in (a) and (b) *Data* Each review was judged a total of 30 times: (1) 10 times to evaluate its perceived quality (on a range from 1-5); (2) 10 times with judgments about its perceived truthfulness (e.g., truthful or somehow deceptive, a lie or a fabrication); and (3) 10 times for its perceived sentiment (i.e., star rating). The following metadata is available for each review: * time consumed by the writer to write the review * a pair review ID coupling the two reviews (positive/negative) written about the same object by the same person, either false or truthful * the ID of the writer who wrote the review * the writer's disclosure as to whether the object to be reviewed was already used and/or known to the writer * the URL identifying an instance of the object (i.e., hotel or electronic product) on the web * a flag for plagiarized reviews * a marker for reviews that may be removed from the corpus * the reasons for rejecting a review *Samples* Please view this sample. *Updates* None at this time.
Extent:Corpus size: 4280 KB
Identifier:LDC2014T24
https://catalog.ldc.upenn.edu/LDC2014T24
ISBN: 1-58563-695-9
ISLRN: 974-370-635-113-0
DOI: 10.35111/tj47-sd65
Language:English
Language (ISO639):eng
License:Boulder Lies and Truth: https://catalog.ldc.upenn.edu/license/boulder-lies-and-truth.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Rights Holder:Portions © 2014 Franco Salvetti, © 2014 Trustees of the University of Pennsylvania
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2014T24
DateStamp:  2020-11-30
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Salvetti, Franco. 2014. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Text iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2014T24
Up-to-date as of: Thu Oct 24 7:30:46 EDT 2024