We are happy to release
SinaTools - Open Source Toolkit for Arabic NLP and NLU
We are excited to release SinaTools - Open Source Toolkit for Arabic NLP and NLU, which consists of Python APIs, command lines, online demos, and many datasets - free for both commercial and non-commercial purposes. It outperforms all related tools in all tasks in speed and accuracy. It includes the following modules: ⸠Morphology Tagger: Lemmatizer, POS tagger, root tagger. ⸠WSD Tagger: Pipeline of semantic taggers: single-word WSD, multi-word WSD, and NER ⸠Synonyms Generator: Extends a set of synonyms with more synonyms. ⸠Semantic Relatedness: Association between two sentences across various dimensions, meaning, underlying concepts, domain-specificity, etc. ⸠Named Entity Recognition: Nested and flat NER, 21 entity types. ⸠Relation Extraction: Extract events and their arguments (agents, locations, and dates). ⸠Diacritic-Based Matching: Decides whether two Arabic words are the same taking into account diacratization compatibility. ⸠Utilities: A set of useful NLP methods for sentence splitting, duplicate word removal, Arabic Jaccard similarity metrics, transliteration, and others.
Try and Download: https://sina.birzeit.edu/sinatools.
Article: Tymaa Hammouda, Mustafa Jarrar, Mohammed Khalilia: SinaTools: Open Source Toolkit for Arabic Natural Language Understanding https://www.jarrar.info/publications/HJK24.pdf. In Proceedings of the 2024 AI in Computational Linguistics (ACLING 2024), Procedia Computer Science, Dubai. ELSEVIER. https://www.jarrar.info/publications/HJK24.pdf
--Mustafa __________________________ Mustafa Jarrar, PhD Professor of Artificial Intelligence Chair, PhD Program in Computer Science Birzeit University, Palestine Page: http://www.jarrar.info http://www.jarrar.info/ SinaLab: https://sina.birzeit.edu https://sina.birzeit.edu/