OLAC Record
oai:clarin.eurac.edu:20.500.12124/7

Metadata
Title:DIDI - The DiDi Corpus of South Tyrolean CMC 1.0.0
Bibliographic Citation:http://hdl.handle.net/20.500.12124/7
Creator:Frey, Jennifer-Carmen
Glaznieks, Aivars
Stemle, Egon W.
Date (W3CDTF):2019-03-07T17:58:26Z
Date Available:2019-03-07T17:58:26Z
Description:The DiDi corpus has an overall size of around 600.000 Tokens gathered from 136 South Tyrolean Facebook users who participated in the DiDi project. It consists of 11.102 Facebook wall posts, 6.507 wall comments and 22.218 private messages. All messages were written by the participants throughout the year 2013. Please read the fulldescription of the corpus for further details. Please consider also the description of the method of data collection and the full description of the DiDi project and its research questions. As every participant could offer either his/her private messages, his/her texts on the wall or both, the corpus comprises wall posts and wall comments from 130 profiles and private messages of 56 profiles; 50 participants granted access to both types of data. Free access to the corpus is given to the wall posts and comments. Due to privacy issues the access to the private messages is restricted. Access to the private messages can be given for scientific research only, after signing a non-disclosure agreement. In case you are interested in the data for scientific reasons, please contact the research team. All texts were anonymised in order to guarantee that the participants' identity cannnot be infered from the texts. The anonymisation included person names, group names, geographical names and adjectival references, institution names, hyperlinks, mail addresses, phone numbers, numbers of bank accounts, servers, postal codes and other private information. Please, read the anonymisation document for the anonymisation keys. The corpus offers a vast range of research opportunities for linguists that are interested in CMC in general, and more specific in multilingual language use, the use of regional varieties, code switching, code shifting and code mixing phenomena, etc. Access to the DiDi corpus: https://commul.eurac.edu/annis/didi
Identifier (URI):http://hdl.handle.net/20.500.12124/7
Language:German
Italian
English
Ladino
Language (ISO639):deu
ita
eng
lad
Publisher:Institute for Applied Linguistics, Eurac Research
Rights:CLARIN ACADEMIC END-USER LICENCE (ACA-BY-NC-NORED 1.0) - DIDI
https://gitlab.inf.unibz.it/commul/didi/data-bundle/blob/v1.0.0/EULA-CLARIN-ACA-BY-NC-NORED.pdf
Subject:Facebook
Social Media
Computer-mediated Communication
Chat
Status Updates
Comment
Social Networking Sites
Multilingualism
Dialect
South Tyrol
Instant Messaging
CMC
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  Eurac Research CLARIN Centre
Description:  http://www.language-archives.org/archive/clarin.eurac.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:clarin.eurac.edu:20.500.12124/7
DateStamp:  2019-09-19
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Frey, Jennifer-Carmen; Glaznieks, Aivars; Stemle, Egon W. 2019. Institute for Applied Linguistics, Eurac Research.
Terms: area_Asia area_Europe country_DE country_GB country_IL country_IT dcmi_Text iso639_deu iso639_eng iso639_ita iso639_lad olac_primary_text


http://www.language-archives.org/item.php/oai:clarin.eurac.edu:20.500.12124/7
Up-to-date as of: Sun Oct 25 7:30:06 EDT 2020