[Apologies for multiple postings]
We are happy to announce that 1 new lexicon and 1 new evaluation package
are available in our catalogue.
*Arab Full Names Database
<https://catalog.elra.info/en-us/repository/browse/ELRA-L0209/>***
ISLRN: 548-506-480-213-6 <http://www.islrn.org/resources/548-506-480-213-6>
This database consists of over six million Arab Full Names comprising
real people Arabic names (not foreign names), including phonological
data such as romanization and optional vowel diacritics, as well as
English equivalents. If heteronyms (same spelling, different
pronunciations, like Muhammad and Muhammid) are included, the number of
entries is approximately 43.9 million.
*MiLQ: Mixed-Language Query Test Set for Bilingual Web Search –
Evaluation Package
<https://catalog.elra.info/en-us/repository/browse/ELRA-E0047/>***
ISLRN:200-586-423-805-2 <http://www.islrn.org/resources/200-586-423-805-2>
MiLQ is a benchmark of mixed-language (code-switched) search queries
created by bilingual speakers for evaluating Information Retrieval with
mixed-language queries. It provides query versions where English
expressions are embedded within native-language structures for the
following languages: Swahili, Somali, Finnish, German, French, Chinese,
Persian and Russian.
For more information on the catalogue or if you would like to enquire
about having your resources distributed by ELRA, please *contact us*
<mailto:[email protected]>.
_________________________________________
Visit the *ELRA Catalogue of Language Resources* <http://catalog.elra.info>
*Archives *
<https://www.elra.info/catalogues/language-resources-announcements/>of
ELRA Language Resources Catalogue Updates
_______________________________________________
Corpora mailing list -- [email protected]
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to [email protected]