OLAC Record

Title:Persian Lexicon
Access Rights: Rights available for: nonCommercialUse, commercialUse
Date Available (W3CDTF):2010-09-27
Date Issued (W3CDTF):2010-09-27
Date Modified (W3CDTF):2010-09-27
Description:This is a Persian (Farsi) lexicon of more than 40,000 entries of non-inflected forms of words. Each word is transliterated based on the proposed framework from MBROLA (Text-To-Speech synthesizer). The database includes a large variety of descriptors for each entry (plural, homograph, ...). This lexicon has been made out from a corpus of newspaper publications collected during a period of six months from the Shargh Newspaper, a publication containing articles from diverse topics: art, culture, policy, social, sport, etc. Due to its coverage, this lexicon can be in particular interesting for Persian TTS systems, as the pronunciation of Persian words cannot be derived directly from their transcription due to the omission of short vowels in Persian writing systems.The number of records is distributed as follows: Adjectives: 11,955Adverbs: 2,047Classifiers: 164Conjunctions: 129Indexes: 85Names: 36,651Numbers: 88Verb-Past Stem: 455Verb-Present Stem: 435Prepositions: 223Pronouns: 141Semi-Sentence: 352The lexicon is provided in a MS Access database.
ISLRN: 547-614-436-004-7
Identifier (URI):https://catalog.elra.info/en-us/repository/browse/ELRA-L0087/
Language (ISO639):fas
Medium:Not specified
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):lexicon


Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-L0087
DateStamp:  2010-09-27
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2010. ELRA (European Language Resources Association).
Terms: dcmi_Text iso639_fas olac_lexicon

Up-to-date as of: Fri Apr 19 6:29:01 EDT 2024