[Apologies for multiple postings]
We are happy to announce that 3 new monolingual lexicons are now
available in our catalogue.
DiaLEX – Egyptian (DiaLEX-EA)
<https://catalog.elra.info/en-us/repository/browse/ELRA-L0206/>
ISLRN: 697-328-151-668-9 <http://www.islrn.org/resources/697-328-151-668-9>
A comprehensive full-form lexicon of Egyptian Arabic general vocabulary
(DiaLEX-EA) including 78 million entries for 31,000 lemmas with all
inflected forms, enclitics, proclitics, case endings, declensions, and
conjugated forms.
Each entry is accompanied by a full and accurate diacriticization
(vocalization) as well as an extensive coverage of variants. The lexicon
is ideally suited to support natural language processing applications
for Egyptian Arabic, especially
morphological analysis and speech technology.
Quantity and size: 75,204,644 lines / 11,217 MB (11.0 GB)
DiaLEX – Emirati (DiaLEX-UA)
<https://catalog.elra.info/en-us/repository/browse/ELRA-L0207/>
ISLRN: 836-793-503-213-8 <http://www.islrn.org/resources/836-793-503-213-8>
A comprehensive full-form lexicon of Emirati Arabic general vocabulary
(DiaLEX-UA) including 28 million entries for 29,000 lemmas with all
inflected forms, enclitics, proclitics, case endings, declensions, and
conjugated forms.
Each entry is accompanied by a full and accurate diacriticization
(vocalization) as well as an extensive coverage of variants. The lexicon
is ideally suited to support natural language processing applications
for Emirati Arabic, especially
morphological analysis and speech technology.
Quantity and size: 24,976,871 lines / 3,841 MB (3.8 GB)
DiaLEX – Saudi Arabian Hijazi (DiaLEX-HA)
<https://catalog.elra.info/en-us/repository/browse/ELRA-L0208/>
ISLRN: 849-157-479-216-3 <http://www.islrn.org/resources/849-157-479-216-3>
A comprehensive full-form lexicon of Hijazi Arabic general vocabulary
(DiaLEX-HA) including 21 million entries for 30,000 lemmas with all
inflected forms, enclitics, proclitics, case endings, declensions, and
conjugated forms.
Each entry is accompanied by a full and accurate diacriticization
(vocalization) as well as an extensive coverage of variants. The lexicon
is ideally suited to support natural language processing applications
for Hijazi Arabic, especially
morphological analysis and speech technology.
Quantity and size: 20,247,655 lines / 2,835 MB (2.8 GB)
For more information on the catalogue or if you would like to enquire
about having your resources distributed by ELRA, please contact us
<mailto:[email protected]>.
_________________________________________
Visit the ELRA Catalogue of Language Resources <http://catalog.elra.info>
Visit the Universal Catalogue <http://universal.elra.info>
Archives
<http://www.elra.info/en/catalogues/language-resources-announcements> of
ELRA Language Resources Catalogue Updates
_______________________________________________
Corpora mailing list -- [email protected]
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to [email protected]