[Apologies for multiple postings]
We are happy to announce that 3 new monolingual lexicons are now available in our catalogue.
DiaLEX – Egyptian (DiaLEX-EA) https://catalog.elra.info/en-us/repository/browse/ELRA-L0206/ ISLRN: 697-328-151-668-9 http://www.islrn.org/resources/697-328-151-668-9 A comprehensive full-form lexicon of Egyptian Arabic general vocabulary (DiaLEX-EA) including 78 million entries for 31,000 lemmas with all inflected forms, enclitics, proclitics, case endings, declensions, and conjugated forms. Each entry is accompanied by a full and accurate diacriticization (vocalization) as well as an extensive coverage of variants. The lexicon is ideally suited to support natural language processing applications for Egyptian Arabic, especially morphological analysis and speech technology. Quantity and size: 75,204,644 lines / 11,217 MB (11.0 GB)
DiaLEX – Emirati (DiaLEX-UA) https://catalog.elra.info/en-us/repository/browse/ELRA-L0207/ ISLRN: 836-793-503-213-8 http://www.islrn.org/resources/836-793-503-213-8 A comprehensive full-form lexicon of Emirati Arabic general vocabulary (DiaLEX-UA) including 28 million entries for 29,000 lemmas with all inflected forms, enclitics, proclitics, case endings, declensions, and conjugated forms. Each entry is accompanied by a full and accurate diacriticization (vocalization) as well as an extensive coverage of variants. The lexicon is ideally suited to support natural language processing applications for Emirati Arabic, especially morphological analysis and speech technology. Quantity and size: 24,976,871 lines / 3,841 MB (3.8 GB)
DiaLEX – Saudi Arabian Hijazi (DiaLEX-HA) https://catalog.elra.info/en-us/repository/browse/ELRA-L0208/ ISLRN: 849-157-479-216-3 http://www.islrn.org/resources/849-157-479-216-3 A comprehensive full-form lexicon of Hijazi Arabic general vocabulary (DiaLEX-HA) including 21 million entries for 30,000 lemmas with all inflected forms, enclitics, proclitics, case endings, declensions, and conjugated forms. Each entry is accompanied by a full and accurate diacriticization (vocalization) as well as an extensive coverage of variants. The lexicon is ideally suited to support natural language processing applications for Hijazi Arabic, especially morphological analysis and speech technology. Quantity and size: 20,247,655 lines / 2,835 MB (2.8 GB)
For more information on the catalogue or if you would like to enquire about having your resources distributed by ELRA, please contact us mailto:contact@elda.org.
_________________________________________
Visit the ELRA Catalogue of Language Resources http://catalog.elra.info Visit the Universal Catalogue http://universal.elra.info Archives http://www.elra.info/en/catalogues/language-resources-announcements of ELRA Language Resources Catalogue Updates