OLAC Record

Title:JANA: A Human-Human Dialogues Corpus for Egyptian Dialect
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Elmadany, AbdelRahim A., Sherif Abdou, and Mervat Gheith. JANA: A Human-Human Dialogues Corpus for Egyptian Dialect LDC2016T24. Web Download. Philadelphia: Linguistic Data Consortium, 2016
Contributor:Elmadany, AbdelRahim A.
Abdou, Sherif M.
Gheith, Mervat
Date (W3CDTF):2016
Date Issued (W3CDTF):2016-11-15
Description:*Introduction* JANA: A Human-Human Dialogues Corpus for Egyptian Dialect was developed by researchers at Cairo University. It consists of 82 transcribed dialogues from call center inquiries annotated for dialogue acts. Data was collected from call centers for banks, airlines and mobile network providers as follows: (1) spontaneous spoken dialogues from inquiries to banks and airlines; and (2) instant messaging (chat) dialogues from a mobile network provider's online support system. *Data* The transcribed dialogues consist of 52 telephone calls and 30 instant messaging conversations, amounting to approximately 20,311 words. The data contains roughly 3,001 conversation turns, with an average of 6.7 words per turn, and 4,725 utterances, with an average of 4.3 words per utterance. The data was transcribed using Transcriber. All data is presented as UTF-8 XML. *Samples* Please view this sample. *Updates* None at this time. *Pricing* Not-for-profit organizations may license this data set for US$25.00 under the LDC Not-for-Profit Membership Agreement or under the LDC User Agreement for Non-Members for use in linguistic research, education and non-commercial technology development. For-profit organizations may license this data for US$1650 under the Commercial License Agreement for JANA: A Human-Human Dialogues Corpus for Egyptian Dialect (LDC2016T24). Current fees in this catalog entry reflect those pertaining to a for-profit organization license. Not-for-profit organizations should contact LDC's Membership Office to license this data set.
Extent:Corpus size: 1128 KB
ISBN: 1-58563-777-7
ISLRN: 498-037-802-860-2
DOI: 10.35111/r55z-6547
Egyptian Arabic
Language (ISO639):ara
License:JANA: A Human-Human Dialogues Corpus for Egyptian Dialect Agreement (For-profit): https://catalog.ldc.upenn.edu/license/jana-a-human-human-dialogues-corpus-for-egyptian-dialect-agreement-for-profit.pdf
LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2016T24
Rights Holder:Portions © 2016 AbdelRahim AbdelSabour AbdelHalim Mohamed Elmadany, © 2016 Trustees of the University of Pennsylvania
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2016T24
DateStamp:  2020-11-30
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Elmadany, AbdelRahim A.; Abdou, Sherif M.; Gheith, Mervat. 2016. Linguistic Data Consortium.
Terms: area_Africa country_EG dcmi_Text iso639_ara iso639_arz olac_primary_text

Up-to-date as of: Tue Feb 13 6:33:16 EST 2024