OLAC Record

Title:Korean Telephone Conversations Complete Set
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Han, Na-Rae, et al. Korean Telephone Conversations Complete Set LDC2003S07. Web Download. Philadelphia: Linguistic Data Consortium, 2003
Contributor:Han, Na-Rae
Ko, Eon-Suk
Graff, David
Strassel, Stephanie
Martey, Nii
Kim, Myeonchul
Canavan, Alexandra
Zipperlen, George
Date (W3CDTF):2003
Description:*Introduction* Korean Telephone Conversations Complete Set was produced by Linguistic Data Consortium (LDC) catalog number LDC2003P01 and ISBN 1-58563-267-8. The complete set of Korean Telephone Conversations consists of the following: * Korean Telephone Conversations Speech * Korean Telephone Conversations Transcripts * Korean Telephone Conversations Lexicon The Korean telephone conversations were originally recorded as part of the Callfriend project. The Callfriend Korean telephone speech was collected by Linguistic Data Consortium primarily in support of the Language Identification (LID) project, sponsored by the U.S. Department of Defense. The calls were later transcribed for use in other projects. Korean Telephone Conversations Speech consists of 100 telephone conversations, 49 of which were published in 1996 as CALLFRIEND Korean, while the remaining 51 are previously unexposed calls. The recorded conversations are between native speakers of Korean and last up to 30 minutes, of which the transcribed speech covers between 15 and 18 minutes. All speakers were aware that they were being recorded. They were given no guidelines concerning what they should talk about. Once a caller was recruited to participate, he/she was given a free choice of whom to call. Most participants called family members or close friends. All calls originated in either the United States or Canada. Korean Telephone Conversations Transcripts consists of 100 text files, totalling approximately 190K words and 25K unique words. All files are in Korean orthography: orthographic Korean characters are in Hangul, encoded in KSC5601 (Wansung) system. Please follow this link for a sample transcript: txt | gif. Korean Telephone Conversations Lexicon covers the tokens occurring in the Korean Telephone Conversations Transcripts. The lexicon contains five tab-separated information fields: * orthographic form in Hangul (head-word), encoded in the KSC-5601 (Wansung) system * orthographic form in Yale romanization * pronunciation * frequency of the word in Korean Telephone Conversations Transcripts * morphological analysis of the word Please follow this link for a sample page from the lexicon: txt | gif.
Extent:Corpus size: 1435503 KB
ISBN: 1-58563-267-8
ISLRN: 124-056-444-354-0
DOI: 10.35111/d2fq-1e93
Language (ISO639):kor
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2003S07
Rights Holder:Portions © 2003 Trustees of the University of Pennsylvania.
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2003S07
DateStamp:  2020-11-30
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Han, Na-Rae; Ko, Eon-Suk; Graff, David; Strassel, Stephanie; Martey, Nii; Kim, Myeonchul; Canavan, Alexandra; Zipperlen, George. 2003. Linguistic Data Consortium.
Terms: area_Asia country_KR dcmi_Text iso639_kor olac_primary_text

Up-to-date as of: Tue May 7 7:24:52 EDT 2024