We are happy to release
SinaTools - Open Source Toolkit for Arabic NLP and NLU
We are excited to release SinaTools - Open Source Toolkit for Arabic NLP and
NLU, which consists of Python APIs, command lines, online demos, and many
datasets - free for both commercial and non-commercial purposes. It outperforms
all related tools in all tasks in speed and accuracy. It includes the following
modules:
▸ Morphology Tagger: Lemmatizer, POS tagger, root tagger.
▸ WSD Tagger: Pipeline of semantic taggers: single-word WSD, multi-word WSD,
and NER
▸ Synonyms Generator: Extends a set of synonyms with more synonyms.
▸ Semantic Relatedness: Association between two sentences across various
dimensions, meaning, underlying concepts, domain-specificity, etc.
▸ Named Entity Recognition: Nested and flat NER, 21 entity types.
▸ Relation Extraction: Extract events and their arguments (agents, locations,
and dates).
▸ Diacritic-Based Matching: Decides whether two Arabic words are the same
taking into account diacratization compatibility.
▸ Utilities: A set of useful NLP methods for sentence splitting, duplicate
word removal, Arabic Jaccard similarity metrics, transliteration, and others.
Try and Download: https://sina.birzeit.edu/sinatools.
Article:
Tymaa Hammouda, Mustafa Jarrar, Mohammed Khalilia: SinaTools: Open Source
Toolkit for Arabic Natural Language Understanding
<https://www.jarrar.info/publications/HJK24.pdf>. In Proceedings of the 2024 AI
in Computational Linguistics (ACLING 2024), Procedia Computer Science, Dubai.
ELSEVIER. https://www.jarrar.info/publications/HJK24.pdf
--Mustafa
__________________________
Mustafa Jarrar, PhD
Professor of Artificial Intelligence
Chair, PhD Program in Computer Science
Birzeit University, Palestine
Page: http://www.jarrar.info <http://www.jarrar.info/>
SinaLab: https://sina.birzeit.edu <https://sina.birzeit.edu/>
_______________________________________________
Corpora mailing list -- [email protected]
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to [email protected]