OLAC Record
oai:catalogue.elra.info:ELRA-S0100

Metadata
Title:MHATLex
Access Rights: Rights available for: nonCommercialUse, commercialUse
Date Available (W3CDTF):2001-04-06
Date Issued (W3CDTF):2001-04-06
Date Modified (W3CDTF):2017-06-26
Description:MHATLex is a new enhanced lexical resource for written and speech automatic processing for French. It is derived from BDLex (see ELRA-S0004).It contains three levels of representation: - Syntactic level: S - Phonological word level: W - Phonetic level: P At the W level, a word has two representations: - input representation (W representation) where words are simply imported from the lexicon, - output representation (W' or phonotypical) where words have the phonotypical representation imposed by their context in the sentence. The lexicons contain inflected words (among which canonical words).MHATLexSt (& BDLex) MHATLexW: about 50,000 entries (canonical) & 440,000 entries (inflected)MHATLexW': about 81,000 entries (canonical) & 854,000 entries (inflected)Words are represented with their orthography, pronunciation, morpho-syntactic features, and frequency indicator.Only the pronunciation related part changes according to the lexicon (except if the user want to generate his own lexicon by skipping some features). Four lexicons can be generated from MHATLex: - MHATLexW : this is the central lexical resource which enables to generate the other lexicons - MHATLexW' (or MHATLexPht) : gives the word representations for each pertinent context. - MHATLexSt : with standard and simplified format of the pronunciation. - BDLex (or BDLex50) : already distributed by ELDA (ELRA-S0003 and S0004). The current BDLex, derived from MHATLexW, contains some updates. When purchasing MHATLex, the package includes BDLex (ELRA-S0004).Integrity checks were made and the lexicon was parsed using nsgmls.
Identifier:ELRA-S0100
ISLRN: 740-149-502-864-8
Identifier (URI):https://catalog.elra.info/en-us/repository/browse/ELRA-S0100/
Language:French
Language (ISO639):fra
Medium:Not specified
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Sound
Type (OLAC):lexicon

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0100
DateStamp:  2001-04-06
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2001. ELRA (European Language Resources Association).
Terms: area_Europe country_FR dcmi_Sound dcmi_Text iso639_fra olac_lexicon


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0100
Up-to-date as of: Fri Apr 19 6:28:14 EDT 2024