********************************************************************************* Second Call for Papers: The 6th workshop on: "Open-Source Arabic Corpora and Processing Tools (OSACT6) with Shared Tasks on Arabic LLMs Hallucination and Dialect to MSA Machine Translation"
Workshop: co-located with LREC-COLING 2024 | Torino (Italia) | 20-25 May, 2024
The OSACT6 Workshop invites the submission of long and short papers on current language resources, tools and technologies and Issues in the design, construction and use of Arabic language resources.
In addition to the general topics of CL, NLP and IR, the workshop will give a special emphasis on two shared tasks, namely: Arabic LLMs Hallucination and Dialect to MSA Machine Translation.
Website: https://osact-lrec.github.io/ Shared Tasks: Task 1: Arabic LLMs Hallucination Task 2: Dialect to MSA Machine Translation Important dates: Submission deadline: Feb 25, 2024 Paper acceptance notification: March 25, 2024 Camera-ready versions: March 30, 2024 OSACT 2024 day: May 25, 2024 LREC-COLING 2024 conference: 20–25 May 2024 Don’t miss this opportunity to contribute to a pioneering field!
***********************************************************************************
OSACT6 workshop encourages researchers and practitioners of Arabic language technologies, including CL, NLP and IR to share and discuss their latest research efforts, corpora, and tools. The workshop will also give special attention to Large Language Models (LLMs) and Generative AI, which is a hot topic nowadays. In addition to the general topics of CL, NLP and IR, the workshop will give a special emphasis on two shared tasks, namely: Arabic LLMs Hallucination and Dialect to MSA Machine Translation.
We are inviting papers on topics including, but not limited to, the following topics: Pre-trained Arabic language models and their applications. Surveying and evaluating the design of available Arabic corpora, their associated and processing tools. Availing new annotated corpora for NLP and IR applications such as named entity recognition, machine translation, sentiment analysis, text classification, and language learning. Evaluating the use of crowdsourcing platforms for Arabic data annotation. Open source Arabic processing toolkits. Language modeling and pre-trained models. Tokenization, normalization, word segmentation, morphological analysis, part-of-speech tagging, etc. Sentiment analysis, dialect identification, and text classification. Dialect translation. Fake news detection. Web and social media search and analytics. Issues in the design, construction, and use of Arabic LRs: text, speech, sign, gesture, image, in single or multimodal/multimedia data. Guidelines, standards, best practices, and models for LRs interoperability. Methodologies and tools for LRs construction and annotation. Methodologies and tools for extraction and acquisition of knowledge Guidelines, standards, best practices and models for LRs interoperability. Methodologies and tools for LRs construction and annotation. Methodologies and tools for extraction and acquisition of knowledge. Ontologies, terminology and knowledge representation. LRs and Semantic Web (including Linked Data, Knowledge Graphs, etc.).
Submissions for both short and long papers will be made directly via START, following submission guidelines issued by LREC-COLING 2024. Paper submission instructions: https://lrec-coling-2024.org/authors-kit/ Paper submission: https://softconf.com/lrec-coling2024/osact2024/ For full submission details please refer to our workshop website here. Contact email: OSACT.W...@gmail.com The OSACT 2024 Organizing Committee
Hend Al-Khalifa, King Saud University, KSA; Hamdy Mubarak, Qatar Computing Research Institute, Qatar; Kareem Darwish, aiXplain Inc., US; Tamer Elsayed, Qatar University, Qatar; Mona Ali, Northeastern University, Canada Looking forward to your participation and to seeing you in LERC-COLING in May 2024!
************************************************************************************