OLAC Record
oai:www.ldc.upenn.edu:LDC97S45

Metadata
Title:CALLHOME Egyptian Arabic Speech
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Canavan, Alexandra, George Zipperlen, and David Graff. CALLHOME Egyptian Arabic Speech LDC97S45. Web Download. Philadelphia: Linguistic Data Consortium, 1997
Contributor:Canavan, Alexandra
Zipperlen, George
Graff, David
Date (W3CDTF):1997
Description:*Introduction* The CALLHOME Egyptian Arabic corpus of telephone speech consists of 120 unscripted telephone conversations between native speakers of Egyptian Colloquial Arabic (ECA), the spoken variety of Arabic found in Egypt. The dialect of ECA that this dictionary represents is Cairene Arabic. *Data* All calls, which lasted up to 30 minutes, originated in North America and were placed to locations overseas (typically Egypt). Most participants called family members or close friends. This corpus contains speech data files ONLY, along with the minimal amount of documentation needed to describe the contents and format of the speech files and the software packages needed to uncompress the speech data. The transcripts and documentation (LDC97T19) are available separately, as is an associated lexicon (LDC99L22). *Samples* Please listen to this speech sample. *Updates* The "shorten" and "sphere" directories have been removed. The sphere directory contained NIST "SPeech HEader REsources" (SPHERE): C-language source code libraries and utilities for manipulating NIST SPHERE-format waveform files. The shorten directory contained files for Tony Robinson's "shorten" software for speech compression. A more recent version of the SPHERE utilities is now available on the NIST web site; additional utilities for converting from SPHERE to other waveform file formats is also available at the LDC web site.
Extent:Corpus size: 1807744 KB
Format:Sampling Rate: 8000
Sampling Format: 2-channel ulaw
Identifier:LDC97S45
https://catalog.ldc.upenn.edu/LDC97S45
ISBN: 1-58563-114-0
ISLRN: 102-150-894-143-2
Language:Egyptian Arabic
Language (ISO639):arz
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC97S45
Rights Holder:Portions © 1996-1997 Trustees of the University of Pennsylvania
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC97S45
DateStamp:  2019-01-03
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Canavan, Alexandra; Zipperlen, George; Graff, David. 1997. Linguistic Data Consortium.
Terms: area_Africa country_EG dcmi_Sound iso639_arz olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC97S45
Up-to-date as of: Sun Sep 1 18:17:18 EDT 2019