SIGUL

sigul@list.elra.info

320 discussions

Third Call for Papers: NLPAICS'2026 – NLP & AI for Cyber Security (11-12 June, Alicante, Spain)
by ernesto.estevanell＠ua.es 30 Jan '26

30 Jan '26

Second International Conference on Natural Language Processing and Artificial Intelligence for Cyber Security (NLPAICS'2026) University of Alicante, Alicante, Spain 11 and 12 June 2026 https://nlpaics2026.gplsi.es/ Third Call for Papers Recent advances in Natural Language Processing (NLP), Deep Learning and Large Language Models (LLMs) have resulted in improved performance of applications. In particular, there has been a growing interest in employing AI methods in different Cyber Security applications. In today's digital world, Cyber Security has emerged as a heightened priority for both individual users and organisations. As the volume of online information grows exponentially, traditional security approaches often struggle to identify and prevent evolving security threats. The inadequacy of conventional security frameworks highlights the need for innovative solutions that can effectively navigate the complex digital landscape to ensure robust security. NLP and AI in Cyber Security have vast potential to significantly enhance threat detection and mitigation by fostering the development of advanced security systems for autonomous identification, assessment, and response to security threats in real time. Recognising this challenge and the capabilities of NLP and AI approaches to fortify Cyber Security systems, the Second International Conference on Natural Language Processing (NLP) and Artificial Intelligence (AI) for Cyber Security (NLPAICS'2026) continues the tradition from NLPAICS'2024 to be a gathering place for researchers in NLP and AI methods for Cyber Security. We invite contributions that present the latest NLP and AI solutions for mitigating risks in processing digital information. Conference topics The conference invites submissions on a broad range of topics related to the employment of NLP and AI (and in general, language studies and models) for Cyber Security, including but not limited to: _Societal and Human Security and Safety_ * Content Legitimacy and Quality * Detection and mitigation of hate speech and offensive language * Fake news, deepfakes, misinformation and disinformation * Detection of machine-generated language in multimodal context (text, speech and gesture) * Trust and credibility of online information * User Security and Safety * Cyberbullying and identification of internet offenders * Monitoring extremist fora * Suicide prevention * Clickbait and scam detection * Fake profile detection in online social networks * Technical Measures and Solutions * Social engineering identification, phishing detection * NLP for risk assessment * Controlled languages for safe messages * Prevention of malicious use of ai models * Forensic linguistics * Human Factors in Cyber Security _Speech Technology and Multimodal Investigations for Cyber Security_ * Voice-based security: Analysis of voice recordings or transcripts for security threats * Detection of machine-generated language in multimodal context (text, speech and gesture) * NLP and biometrics in multimodal context _Data and Software Security_ * Cryptography * Digital forensics * Malware detection, obfuscation * Models for documentation * NLP for data privacy and leakage prevention (DLP) * Addressing dataset "poisoning" attacks _Human-Centric Security and Support_ * Natural language understanding for chatbots: NLP-powered chatbots for user support and security incident reporting * User behaviour analysis: analysing user-generated text data (e.g., chat logs and emails) to detect insider threats or unusual behaviour * Human supervision of technology for Cyber Security _Anomaly Detection and Threat Intelligence_ * Text-Based Anomaly Detection * Identification of unusual or suspicious patterns in logs, incident reports or other textual data * Detecting deviations from normal behaviour in system logs or network traffic * Threat Intelligence Analysis * Processing and analysing threat intelligence reports, news, articles and blogs on latest Cyber Security threats * Extracting key information and indicators of compromise (IoCs) from unstructured text _Systems and Infrastructure Security_ * Systems Security * Anti-reverse engineering for protecting privacy and anonymity * Identification and mitigation of side-channel attacks * Authentication and access control * Enterprise-level mitigation * NLP for software vulnerability detection * Malware Detection through Code Analysis * Analysing code and scripts for malware * Detection using NLP to identify patterns indicative of malicious code _Financial Cyber Security_ * Financial fraud detection * Financial risk detection * Algorithmic trading security * Secure online banking * Risk management in finance * Financial text analytics _Ethics, Bias, and Legislation in Cyber Security_ * Ethical and Legal Issues * Digital privacy and identity management * The ethics of NLP and speech technology * Explainability of NLP and speech technology tools * Legislation against malicious use of AI * Regulatory issues * Bias and Security * Bias in Large Language Models (LLMs) * Bias in security related datasets and annotations _Datasets and resources for Cyber Security Applications_ _Specialised Security Applications and Open Topics_ * Intelligence applications * Emerging and innovative applications in Cyber Security _Special Theme Track - Future of Cyber Security in the Era of LLMs and Generative AI_ NLPAICS 2026 will feature a special theme track with the goal of stimulating discussion around Large Language Models (LLMs), Generative AI and ensuring their safety. The latest generation of LLMs, such as ChatGPT, Gemini, DeepSeek, LLAMA and open-source alternatives, has showcased remarkable advancements in text and image understanding and generation. However, as we navigate through uncharted territory, it becomes imperative to address the challenges associated with employing these models in everyday tasks, focusing on aspects such as fairness, ethics, and responsibility. The theme track invites studies on how to ensure the safety of LLMs in various tasks and applications and what this means for the future of the field. The possible topics of discussion include (but are not limited to) the following: * Detection of LLM-generated language in multimodal context (text, speech and gesture) * LLMs for forensic linguistics * Bias in LLMs * Safety benchmarks for LLMs * Legislation against malicious use of LLMs * Tools to evaluate safety in LLMs * Methods to enhance the robustness of language models Submissions and Publication NLPAICS welcomes high-quality submissions in English, which can take two forms: * Regular long papers: These can be up to eight (8) pages long, presenting substantial, original, completed, and unpublished work. * Short (poster) papers: These can be up to four (4) pages long and are suitable for describing small, focused contributions, ongoing research, negative results, system demonstrations, etc. Short papers will be presented as part of a poster session. The conference will not consider and evaluate abstracts only. Accepted papers, including both long and short papers, will be published as e-proceedings with ISBN will be available online on the conference website at the time of the conference and are expected to be uploaded into the ACL Anthology. To prepare your submission, please make sure to use the NLPAICS 2026 style files available here: LaTeX: NLPAICS_2026_LaTeX.zip [1] Overleaf: https://www.overleaf.com/read/sgwmrzbmjfhc#aeea77 Word: https://nlpaics2026.gplsi.es/wp-content/uploads/2025/11/NLPAICS2026_Proceed… Papers should be submitted through Softconf/START using the following link: https://softconf.com/p/nlpaics2026/user/ The conference will feature a student workshop, and awards will be offered to the authors of best papers. Important dates * Submissions due: 16 March 2026 * Reviewing process: 1 April - 30 April 2026 * Notification of acceptance: 5 May 2026 * Camera-ready due: 19 May 2026 * Conference camera-ready proceedings ready 1 June 2026 * Conference: 11-12 June 2026 Organisation Conference Chairs Ruslan Mitkov (University of Alicante) Rafael Muñoz (University of Alicante) Programme Committee Chairs Elena Lloret (University of Alicante) Tharindu Ranasinghe (Lancaster University) Publication Chair Ernesto Estevanell (University of Alicante) Sponsorship Chair Andres Montoyo (University of Alicante) Student Workshop Chair Salima Lamsiyah (University of Luxembourg) Best Paper Award Chair Saad Ezzini (King Fahd University of Petroleum & Minerals) Publicity Chair Beatriz Botella (University of Alicante) Social Programme Chair Alba Bonet (University of Alicante) Venue The Second International Conference on Natural Language Processing and Artificial Intelligence for Cyber Security (NLPAICS'2026) will take place at the University of Alicante and is organised by the University of Alicante GPLSI research group. Further information and contact details The follow-up calls will list keynote speakers and members of the programme committee once confirmed. The conference website is https://nlpaics2026.gplsi.es/ and will be updated on a regular basis. For further information, please email nlpaics2026(a)dlsi.ua.es Registration will open in March 2026. Links: ------ [1] http://summer-school.gplsi.es/NLPAICS_2026_LaTeX.zip

1 0

CfP: 9th Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL-9)
by Antti Arppe 30 Jan '26

30 Jan '26

ComputEL-9: Ninth Workshop on the Use of Computational Methods in the Study of Endangered Languages First CALL FOR PAPERS for REGULAR SESSION Submission deadline: March 20, 2026 Submission link: https://softconf.com/acl2026/ComputEL2026 ComputEL-9 will be co-located with ACL 2026 in San Diego, California. We encourage submissions that explore the interface and intersection of computational linguistics, documentary linguistics, and community-based efforts in language revitalization and reclamation. This includes submissions that: (i) demonstrate new methods or technologies for tasks or applications focused on low-resource settings, and in particular, endangered languages, (ii) examine the use of specific methods in the analysis of data from low-resource languages, or demonstrate new methods for analysis of such data, oriented toward the goals of language reclamation and revitalization, (iii) propose new models for the collection, management, and mobilization of language data in community settings, with attention to e.g. issues of data sovereignty and community protocols, (iv) explore concrete steps for a more fruitful interaction among computer scientists, documentary linguists, and language communities. IMPORTANT DATES 20-Mar-2026 Deadline for submission of papers or extended abstracts 1-May-2026 Notification of Acceptance Early May Camera-ready papers due (Exact date TBD by ACL) First week of July Workshop (Exact date TBD by ACL) PRESENTATIONS Presentation of accepted papers will be in both oral sessions and a poster session. The decision on whether a presentation for a paper will be oral and/or poster will be made by the Organizing Committee on the advice of the Program Committee, taking into account the subject matter and how the content might be best conveyed. Oral and poster presentations will not be distinguished in the Proceedings. SUBMISSIONS In line with our goal of reaching multiple overlapping communities, authors can submit to one of the workshop’s tracks: (a) language community perspective and (b) academic perspective. All submissions must be anonymous following ACL guidelines and will be peer-reviewed by the scientific Program Committee. PROCEEDINGS The authors of selected accepted full papers (long or short) will be invited by the Organizing Committee to submit their papers for online publication via the open-access ACL Anthology. Final versions of long and short papers will be allotted one additional page (altogether 5 and 9 pages) excluding references. Proceedings papers should be revised and improved versions of the work that was submitted for, and which underwent, review. Any revisions should concern responses to reviewer comments or the addition of relevant details and clarifications, but not entirely new, unreviewed content. ADDITIONAL AND CONTACT INFORMATION Please see the ComputEL-9 website for further information: https://computel-workshop.org/computel-9/ Organizing Committee Email: computel.workshop(a)gmail.com -- ====================================================================== Antti Arppe - Ph.D (General Linguistics), M.Sc. (Engineering) Professor of Quantitative Linguistics Director, Alberta Language Technology Lab (ALTLab) Project Director, 21st Century Tools for Indigenous Languages (21C) Department of Linguistics, University of Alberta Algonquian Studies Association - Secretary-Treasurer E-mail: arppe(a)ualberta.ca, antti.arppe(a)iki.fi WWW: www.ualberta.ca/~arppe, altlab.ualberta.ca Mānahtu ina rēdûti ihza ummânūti ihannaq - dulum ugulak úmun ingul ----------------------------------------------------------------------

1 0

Second CfP Seventh Workshop on Resources for African Indigenous Languages (RAIL)@LREC
by Menno Van Zaanen 28 Jan '26

28 Jan '26

Second Call for Papers Seventh Workshop on Resources for African Indigenous Languages (RAIL) Co-located with LREC 2026 RAIL Workshop date: 12 May 2026 RAIL website: https://sadilar.org/en/seventh-workshop-on-resources-for-african-indigenous… Submission link for the RAIL workshop: https://softconf.com/lrec2026/RAIL2026/ LREC Conference dates: 11-16 May 2026 LREC website: https://www.elra.info/lrec2026/ Venue: Palau de Congressos de Palma, Palma de Mallorca (Spain) The Resources for African Indigenous Languages (RAIL) workshop provides an interdisciplinary platform for researchers working on resources such as data collections and annotations, Human Language Technologies (HLT) and Natural Language Processing (NLP) tools, and their applications, specifically targeted towards African indigenous languages. In particular, it aims to create the conditions for the emergence of a scientific community of practice that focuses on data, as well as computational linguistic tools specifically designed for or applied to indigenous languages found in Africa. The seventh Resources for African Indigenous Languages (RAIL) workshop will be co-located with the Language Resources and Evaluation Conference (LREC) 2026 in Palau de Congressos de Palma, Palma, Mallorca (Spain). Many African languages are under-resourced while only a few are considered to be somewhat better resourced. These languages often share interesting properties such as writing systems, making them different from most high-resourced languages. From a computational perspective, these languages lack enough corpora to undertake high level development of NLP and HLT tools, which in turn impedes the development of African languages in these areas. During previous workshops, it was noted that the problems and solutions presented were not only applicable to African languages but were also relevant to many other low-resource languages across the world. Because these languages share similar challenges, this workshop provides researchers with opportunities to work collaboratively on issues of language resource development and learn from each other. The RAIL workshop has several aims. First, the workshop brings together researchers who work on African indigenous languages, forming a community of practice for people working on indigenous languages. Second, the workshop aims to reveal currently unknown or unpublished existing resources (corpora, NLP tools, and applications), resulting in a better overview of the current state-of-the-art, and also allows for discussions on novel, desired resources for future research in this area. Third, it enhances sharing of knowledge on the development of low-resource languages. Finally, it enables discussions on how to improve the quality as well as availability of the resources. The workshop theme is “Creating resources for less-resourced African languages”, but submissions on any topic related to properties of African indigenous languages (including related non-African languages) may be accepted. Suggested topics include (but are not limited to) the following: * Digital representations of linguistic structures * Descriptions of corpora or other data sets of African indigenous languages * Building resources for (under-resourced) African indigenous languages * Developing and using African indigenous languages in the digital age * Effectiveness of digital technologies for the development of African indigenous languages * Revealing unknown or unpublished existing resources for African indigenous languages * Developing desired resources for African indigenous languages * Improving quality, availability and accessibility of African indigenous language resources * Applications that make use of data collections of African indigenous languages Submission requirements: We invite papers on original, unpublished work related to the topics of the workshop. Submissions, presenting completed work, should adhere to the LREC conference requirements. These requirements are described in LREC’s authors kit: https://lrec2026.info/authors-kit/. The submission should be double blind and each submission should be between four and eight pages. Only oral papers should be submitted. The maximum number of pages excludes a compulsory ethics statement, discussion on limitations, and references and optional acknowledgements, as well as data and code availability statements if applicable. Appendices or supplementary material are allowed, but this information will not necessarily be taken into account during the review process. The submission link for the RAIL workshop: https://softconf.com/lrec2026/RAIL2026/ Authors are encouraged to upload their datasets to the SADiLaR repository: https://repo.sadilar.org/. In case of difficulties uploading the datasets, please reach out to Benito Trollip (benito.trollip(a)nwu.ac.za). Important dates: Submission deadline: 23 February 2026 AoE Date of notification: 11 March 2026 AoE Camera ready copy deadline: 30 March 2026 AoE Workshop: 12 May 2026 Organising Committee: Muzi Matfunjwa, South African Centre for Digital Language Resources (SADiLaR), South Africa Mmasibidi Setaka, South African Centre for Digital Language Resources (SADiLaR), South Africa Rooweither Mabuya, South African Centre for Digital Language Resources (SADiLaR), South Africa Menno van Zaanen, South African Centre for Digital Language Resources (SADiLaR), South Africa -- Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za Professor in Digital Humanities South African Centre for Digital Language Resources https://www.sadilar.org ________________________________ NWU PRIVACY STATEMENT: http://www.nwu.ac.za/it/gov-man/disclaimer.html DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system. ________________________________

1 0

2nd Call for Papers and Mentorship Provision — Workshop on Dialects in NLP: A Resource Perspective (at LREC 2026)
by s.bompolas＠athenarc.gr 19 Jan '26

19 Jan '26

2nd Call for Papers and Mentorship Provision Dialects in NLP: A Resource Perspective (DialRes-LREC26) Workshop at LREC 2026 — Palma de Mallorca, Spain Dialectal and non-standard varieties pose persistent challenges for linguistic resource development. While in-depth study and large-scale resource creation for dominant or standard varieties have driven major advances in language technology, linguistic resources that adequately represent dialectal variation remain scarce. It therefore remains an open question whether standard-centric practices address dialectal variation or instead create new problems for dialects. DialRes-LREC26 invites submissions on the creation, analysis, and evaluation of dialectal resources, including—but not limited to—work that critically examines how standard-centric methodologies impact dialects in the development of linguistic resources and models. We especially encourage contributions addressing the consequences of such practices for speech and morphosyntactic modelling, OCR of dialectal and historical texts, orthographic normalisation and homogenisation, annotation practices and lemmatisation strategies that abstract away or suppress dialectal forms, as well as analyses of how these choices affect dialects and their communities methodologically, economically, and socially. The workshop focuses on problems, limitations, and trade-offs in developing dialectal resources from a linguistic perspective, while encouraging the creation and evaluation of resources in formats that enable reuse by the NLP community. Workshop Topics • Development and evaluation of dialectal oral and textual resources • Orthographic normalisation and homogenisation, including their impact on dialectal variation • Dialects vs. standard language varieties in annotation frameworks • Cross-lingual and cross-dialectal transfer and model adaptation • Resource scalability issues and techniques • Use and limitations of large language models (LLMs) in dialectal resource development • OCR for dialectal, non-standard, and historical texts: challenges, errors, and downstream effects • Resources for, and applications supporting, dialect revitalisation and preservation • Dialectal studies and teaching from a resource-oriented perspective • Working on dialectal resources: academic, financial, legal, and societal issues • Enabling and empowering dialect communities to develop their own resources Author Support The workshop will offer individual tutoring and mentoring upon request. Interested authors should contact the organizers at least 10 days before the paper submission deadline at: dialres-lrec26(a)googlegroups.com This support is addressed especially to early-career researchers and contributors working with dialectal data who have limited or no prior experience in developing NLP-oriented resources. Submission Information Instructions for Authors Submissions are electronic, using the Softconf START conference management system via the link: https://softconf.com/lrec2026/DialRes. They must be 4 to 8 pages long (excluding references and potential Ethics Statements) and follow the LREC stylesheet, available on the conference website on the Author’s kit page Author’s Kit. All templates are also available from this page. Important Dates • 20 February 2026 — Submission Deadline • 11 March 2026 — Notification of Acceptance • 28 March 2026 — Camera-ready Papers Due Resubmissions from the LREC Main Conference It will also be possible to submit papers that were rejected from the LREC 2026 main conference to DialRes 2026. Such submissions must be revised to fit the scope and format of the workshop and must comply with the same anonymization requirements. Endorsements The workshop is endorsed by UniDive COST Action CA21167 and Archimedes Athena R.C. Organizing Committee • Antonios Anastasopoulos — George Mason University / Archimedes–Athena RC • Stella Markantonatou — ILSP / Archimedes–Athena RC • Angela Ralli — University of Patras / Archimedes–Athena RC • Marcos Zampieri — George Mason University • Stavros Bompolas — Archimedes–Athena RC • Vivian Stamou — Archimedes–Athena RC

1 0

Third and Final Call for Papers: The Second Workshop on Language Models for Low-Resource Languages (LoResLM 2026 @ EACL)
by Ranasinghe, Tharindu 02 Jan '26

02 Jan '26

𝗧𝗵𝗶𝗿𝗱 𝗮𝗻𝗱 𝗙𝗶𝗻𝗮𝗹 𝗖𝗮𝗹𝗹 𝗳𝗼𝗿 𝗣𝗮𝗽𝗲𝗿𝘀 - 𝗧𝗵𝗲 𝗦𝗲𝗰𝗼𝗻𝗱 𝗪𝗼𝗿𝗸𝘀𝗵𝗼𝗽 𝗼𝗻 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹𝘀 𝗳𝗼𝗿 𝗟𝗼𝘄-𝗥𝗲𝘀𝗼𝘂𝗿𝗰𝗲 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲𝘀 [Workshop website - https://loreslm.github.io/home] [CFP - https://loreslm.github.io/cfp] [Submissions - https://openreview.net/group?id=eacl.org/EACL/2026/Workshop/LoResLM] Neural language models have revolutionised natural language processing (NLP) and have provided state-of-the-art results for many tasks. However, their effectiveness is largely dependent on the pre-training resources. Therefore, language models (LMs) often struggle with low-resource languages in both training and evaluation. Recently, there has been a growing trend in developing and adopting LMs for low-resource languages. Supporting this important shift, LoResLM aims to provide a forum for researchers to share and discuss their ongoing work on LMs for low-resource languages. 𝗧𝗼𝗽𝗶𝗰𝘀 LoResLM 2026 invites submissions on a broad range of topics related to the development and evaluation of neural language models for low-resource languages. We welcome research that explores modalities beyond text and encourage work on low-resource dialects in addition to major language varieties. Topics of interest include, but are not limited to: • Building language models for low-resource languages. • Adapting/extending existing language models/large language models for low-resource languages. • Corpora creation and curation technologies for training language models/large language models for low-resource languages. • Benchmarks to evaluate language models/large language models in low-resource languages. • Prompting/in-context learning strategies for low-resource languages with large language models. • Review of available corpora to train/fine-tune language models/large language models for low-resource languages. • Multilingual/cross-lingual language models/large language models for low-resource languages. • Multimodal language models/large language models for low-resource languages • Applications of language models/large language models for low-resource languages (i.e. machine translation, chatbots, content moderation, etc.) 𝗦𝘂𝗯𝗺𝗶𝘀𝘀𝗶𝗼𝗻 𝗚𝘂𝗶𝗱𝗲𝗹𝗶𝗻𝗲𝘀 We follow the EACL 2026 standards for submission format and guidelines. LoResLM 2026 invites submissions of long papers up to 8 pages and short papers up to 4 pages. These page limits only apply to the main body of the paper. At the end of the paper (after the conclusions but before the references), papers need to include a mandatory section discussing the limitations of the work and, optionally, a section discussing ethical considerations. Papers can include unlimited pages of references and an appendix. To prepare your submission, please make sure to use the EACL 2026 style files available here: • LaTeX - https://github.com/acl-org/acl-style-files • Overleaf - https://www.overleaf.com/latex/templates/association-for-computational-ling… Papers should be submitted through OpenReview using the following link: https://openreview.net/group?id=eacl.org/EACL/2026/Workshop/LoResLM 𝗜𝗺𝗽𝗼𝗿𝘁𝗮𝗻𝘁 𝗗𝗮𝘁𝗲𝘀 • Paper submission: 6th January 2026 • Notification of acceptance: 28th January 2026 • Camera-ready submission: 3rd February 2026 • Workshop: 29th March 2026 @ EACL 𝗩𝗲𝗻𝘂𝗲 LoResLM 2026 will be held in conjunction with EACL 2026 in Rabat, Morocco. 𝗣𝗿𝗼𝗰𝗲𝗲𝗱𝗶𝗻𝗴𝘀 Proceedings of the workshop will appear in the ACL Anthology. For the past proceedings, please refer https://scholar.google.co.uk/citations?user=rvm3HOgAAAAJ&hl=en 𝗞𝗲𝘆𝗻𝗼𝘁𝗲 𝗦𝗽𝗲𝗮𝗸𝗲𝗿 Prof Barbara Plank - Full professor and chair for AI and Computational Linguistics at Ludwig-Maximilians-Universität München, Head of the Munich AI and NLP (MaiNLP) lab, and co-director of the Centre for Information and Language Processing (CIS) 𝗣𝗿𝗼𝗴𝗿𝗮𝗺𝗺𝗶𝗻𝗴 𝗖𝗼𝗺𝗺𝗶𝘁𝘁𝗲𝗲 David Ifeoluwa Adelani - McGill School of Computer Science, Canada Idris Abdulmumin - University of Pretoria, South Africa Godfred Agyapong - University of Florida, USA Isuri Anuradha - Lancaster University, UK Laura Bernardy - University of Luxembourg, Luxembourg Ana-Maria Bucur - University of Lugano, Switzerland Eleftheria Briakou - Google Tommaso Caselli - University of Groningen, Netherlands Çağrı Çöltekin - University of Tübingen, Germany Charibeth Ko Cheng - De La Salle University, Philippines Claudiu Creanga - University of Bucharest Sourabh Deoghare - Indian Institute of Technology, Bombay, India Bosheng Ding - Nanyang Technological University, Singapore Alphaeus Dmonte - George Mason University, USA Daan van Esch - Google Ignatius Ezeani - Lancaster University, UK Anna Furtado - University of Galway, Ireland Ona de Gibert - University of Helsinki, Finland Amal Htait - Aston University, UK Diptesh Kanojia - University of Surrey, UK Jaroslav Kopčan - Kempelen Institute of Intelligent Technologies, Slovakia Constantine Lignos - Brandeis University, USA Cedric Lothritz - Luxembourg Institute of Science and Technology, Luxembourg Anne-Marie Lutgen - University of Luxembourg, Luxembourg Sheng Li - Institute of Science Tokyo, Japan Veronika Lipp - Hungarian Research Centre for Linguistics, Hungary Vukosi Marivate - University of Pretoria, South Africa Muhidin Mohamed - Aston University, UK Simon Münker - Trier University, Germany Abiodun Modupe - University of Pretoria, South Africa Fred Philippy - University of Luxembourg, Luxembourg Md Nishat Raihan - George Mason University, USA Mariana Romanyshyn - Grammarly Guokan Shang - Mohamed bin Zayed University of Artificial Intelligence, France Ravi Shekhar - University of Essex, UK Archchana Sindhujan - University of Surrey, UK Hristo Tanev - Joint Research Centre, European Commission Uthayasanker Thayasivam - University of Moratuwa, Sri Lanka Raúl Vázquez - University of Helsinki, Finland Taro Watanabe - Nara Institute of Science and Technology, Japan- Zheng Xin Yong - Brown University, USA Alexandra Zbaganu - University of Bucharest, Romania 𝗢𝗿𝗴𝗮𝗻𝗶𝘀𝗶𝗻𝗴 𝗖𝗼𝗺𝗺𝗶𝘁𝘁𝗲𝗲 Hansi Hettiarachchi – Lancaster University, UK Tharindu Ranasinghe – Lancaster University, UK Alistair Plum – University of Luxembourg, Luxembourg Damith Premasiri – Lancaster University, UK Fiona Anting Tan – National University of Singapore, Singapore Lasitha Uyangodage – University of Münster, Germany 𝗔𝗱𝘃𝗶𝘀𝗼𝗿𝘀 Paul Rayson – Lancaster University, UK Ruslan Mitkov – Lancaster University, UK Mohamed Gaber – Queensland University of Technology, Australia 𝗦𝘂𝗽𝗽𝗼𝗿𝘁𝗲𝗱 𝗯𝘆 The workshop is supported in part by the Artificial Intelligence Journal, which promotes and disseminates AI research. 𝗖𝗼𝗻𝘁𝗮𝗰𝘁 𝘂𝘀 Contact us through loreslm.contact(a)gmail.com. Follow us on social media • LinkedIn - https://www.linkedin.com/company/loreslm/ • X - https://x.com/LoResLM2026 • BlueSky - https://bsky.app/profile/loreslm.bsky.social Best Regards Tharindu Ranasinghe, on behalf of the organising committee, LoResLM 2026 Dr Tharindu Ranasinghe | Lecturer in Security and Protection Science School of Computing and Communications | Lancaster University www.lancaster.ac.uk<https://www.lancaster.ac.uk/>

1 0

2cfp: Speech Language Models in Low-Resource Settings (SPEAKABLE-2026) at LREC2026
by Nina H-Kivanani 22 Dec '25

22 Dec '25

****APOLOGIES FOR CROSS-POSTING**** *2st CfP: SPEAKABLE 2026* 📍*Location:* Palau de Congressos de Palma, Palma de Mallorca (Spain) 🌐 Website: https://speakable-2026.github.io/ We are pleased to announce the upcoming full-day *SPEAKABLE 2026* Workshop on Speech Language Models in Low-Resource Settings: Performance, Evaluation, and Bias Analysis, co-located with LREC 2026 in Palma de Mallorca. This workshop brings together researchers, practitioners, and industry experts working to advance speech technology for under-resourced languages. We invite contributions that address the unique challenges and opportunities in this space. *Workshop Topics of Interest* We encourage submissions on (but not limited to): - Performance of speech language models in low-resource and underrepresented languages - Evaluation methodologies and creation of benchmarks for low-resource speech - Bias analysis, detection, and mitigation strategies in speech technologies - Real-world applications, deployment challenges, and case studies - Speech recognition, speech-to-text, language modeling, multilingual and cross-lingual approaches - Fairness, ethical considerations, and inclusive NLP for low-resource speech communities - Parameter-efficient adaptation methods and knowledge distillation for speech models - Edge-constrained inference and computational efficiency in low-resource settings *--> SPEAKABLE* will only accept direct submissions through the given Submission link: https://softconf.com/lrec2026/SPEAKABLE2026/ <https://softconf.com/lrec2026/SPEAKABLE2026/login/scmd.cgi?scmd=logout> *Invited Speaker* *Dr. Jordi Luque (Lead Research Scientist, Telefónica Research): https://eloquenceai.eu/imprint/ <https://eloquenceai.eu/imprint/>* Further details will be posted on the workshop website. *Info for Papers* We welcome original research papers and ongoing work relevant to speech and language modeling for low-resource settings. Submissions should be 4 to 8 pages in length and follow the LREC 2026 stylesheet. Submissions should follow LREC formatting guidelines (https://lrec2026.info/authors-kit/). The maximum number of pages excludes potential Ethics Statements and discussion on Limitations, acknowledgements, and references, as well as data and code availability statements. Appendices or supplementary material are not permitted during the initial submission phase, as papers should be self-contained and reviewable on their own. Submissions will be judged on correctness, originality, technical strength, significance, relevance to the conference, and interest to the attendees. Papers must be of original, previously *unpublished* work. All submissions should follow the two-column LREC style guidelines. We strongly recommend the use of the LaTeX/Overleaf style files. All papers will undergo a *double-blind peer review* process, with final acceptance decisions made by the workshop chairs. Submissions that violate the requirements above will be rejected without review. Accepted papers will be presented as *oral or poster* presentations. The mode of presentation will be determined by the workshop chairs and does not reflect the quality of the submission. *SPEAKABLE 2026* will primarily be an in-person event, but online participation will also be possible for participants who cannot travel to the conference. *Important Dates* Paper Submission Deadline: February 16, 2026 Notification of Acceptance: March 12, 2026 Camera-Ready Papers: March 30, 2026 Workshop Date: May 2026 (11/05/2026) All deadlines are anywhere-on-earth (AoE). *Workshop Organizers* Nina Hosseini-Kivanani (RTL & University of Luxembourg, Luxembourg) Alessio Brutti (Fondazione Bruno Kessler, Italy) Marco Matassoni (Fondazione Bruno Kessler, Italy) Sandipana Dowerah (Tallinn University of Technology, Estonia) Davide Liga (University of Luxembourg, Luxembourg) Christoph Schommer (University of Luxembourg, Luxembourg) 📄 Learn more and submit: https://speakable-2026.github.io/ For questions, contact: speakable2026(a)gmail.com

1 0

Second CFP: LoResMT 2026 at EACL 2026
by Atul K. Ojha 19 Dec '25

19 Dec '25

Apologies for cross-posting --------------------------------------------------------------------------- *The Ninth Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT 2026)* *https://www.loresmt.org/ <https://www.loresmt.org/>* *@ EACL 2026 (March 24-29, 2026)* *Rabat, Morocco* *SUBMISSION* ARR submission link: https://openreview.net/group?id=eacl.org/EACL/2026/Workshop/LoResMT *TIMELINE* - Submission deadline: December 19, 2025 (Anywhere on Earth) - Pre-reviewed (ARR) submission deadline: January 2, 2026 - Notification of acceptance: January 23, 2026 - Camera-ready paper due: February 3, 2026 (Anywhere on Earth) - Pre-recorded video due (hard deadline): February 24, 2026 - Workshop dates at EACL 2026: TBD - EACL 202 Main Conference: March 24-29, 2026 *SCOPE* Based on the success of past low-resource machine translation (MT) workshops at AMTA 2018, MT Summit 2019, AACL-IJCNLP 2020, AMTA 2021, COLING 2022, EACL 2023, ACL 2024, NAACL 2025, we introduce LoResMT 2026 workshop at EACL 2025. The workshop provides a discussion panel for researchers working on MT systems/methods for low-resource and under-represented languages in general. We would like to help review/overview the state of MT for low-resource languages and define the most important directions. Fundamental work on low-resource languages in MT and NLP is still crucial and unavoidable. We also solicit papers dedicated to supplementary natural language processing (NLP) tools that are used in any language and especially in low-resource languages. Overview papers of these NLP tools are very welcome. It will be beneficial if the evaluations of these tools in research papers include their impact on the quality of MT output. *TOPICS* We are highly interested in (1) original research papers, (2) review/opinion papers, and (3) online systems on the topics below; however, we welcome all novel ideas that cover research on low-resource languages. - Neural machine translation for low-resource languages - Work that presents online systems for practical use by native speakers - Word tokenizers/de-tokenizers for specific languages - Word/morpheme segmenters for specific languages - Alignment/Re-ordering tools for specific language pairs - Use of morphology analyzers and/or morpheme segmenters in MT - Multilingual/cross-lingual NLP tools for MT - Corpora creation and curation technologies for low-resource languages - COVID-related corpora, their translations and corresponding NLP/MT systems - Review of available parallel corpora for low-resource languages - Research and review papers of MT methods for low-resource languages - MT systems/methods (e.g. rule-based, SMT, NMT) for low-resource languages - Pivot MT for low-resource languages - Zero-shot MT for low-resource languages - Fast building of MT systems for low-resource languages - Re-usability of existing MT systems for low-resource languages - Machine translation for language preservation *SUBMISSION INFORMATION* We are soliciting two types of submissions: (1) research, review, and position papers and (2) system demonstration papers. For research, review and position papers, the length of each paper should be at least four (4) and not exceed eight (8) pages, plus unlimited pages for references. For system demonstration papers, the limit is four (4) pages. Submissions should be formatted according to the official ACL style templates (Overleaf). Please refer to the EACL submission guidelines for further information <https://2026.eacl.org/calls/papers/>. Accepted papers will be published online in the EACL 2026 proceedings and will be presented at the conference. Submissions must be anonymized and should be done using the provided submission system. Scientific papers that have been or will be submitted to other venues must be declared as such and must be withdrawn from the other venues if accepted and published at LoResMT. The review will be double-blind. Authors of an accepted paper should present their paper in person at EACL 2026. Papers should be submitted in PDF to the LoResMT Open Review. We would like to encourage authors to cite papers written in ANY language that are related to the topics, as long as both original bibliographic items and their corresponding English translations are provided. Registration is handled by the main conference ( https://2026.eacl.org/registration). *Shared Tasks: * We are going to have a machine translation competition for several low resource Turkic languages, namely, Chuvash, Bashkir, Tatar and Kazakh. The pairs you can evaluate your models are: Chuvash-English, Chuvash-Russian, Bashkir-English, Bashkir-Russian, Tatar-English, Kazakh-Russian. Unfortunately, we cannot provide you with training data for these language pairs, but nevertheless you can use any data you find useful or collect by yourself. We expect you to share your findings as a report. Best reports will be included in the LoResMT proceedings. for futher information please visit to LoResMT website. *ORGANIZING COMMITTEE (LISTED ALPHABETICALLY)* Atul Kr. Ojha Chao-Hong Liu Ekaterina Vylomova Flammie Pirinen Jonathan Washington Nathaniel Oco Xiaobing Zhao *CONTACT* Please email loresmt(a)googlegroups.com if you have any questions/comments/suggestions.

1 1

Deadline extended: VarDial 2026 @ EACL – The Thirteenth Workshop on NLP for Similar Languages, Varieties and Dialects
by verena.blaschke＠cis.lmu.de 15 Dec '25

15 Dec '25

Dear all, This is the last CfP for VarDial 2026 - The Thirteenth Workshop on NLP for Similar Languages, Varieties and Dialects. We have extended the submission deadlines (January 2 for direct submissions, January 10 for committing pre-reviewed submissions), see details below. Apologies for cross-posting! -- VarDial 2026: https://sites.google.com/view/vardial-2026/ VarDial 2026 will be colocated with EACL 2026 in Rabat, Morocco. We anticipate a discussion on computational methods and language resources for closely related languages, language varieties, and dialects. We welcome papers dealing with one or more of the following topics: - Language resources and tools for similar languages, varieties and dialects; - Evaluation of language resources and tools applied to non-dominant language varieties; - Cross-lingual transfer and adaptation of models to similar languages, varieties and dialects; - Automatic identification of lexical variation; - Automatic classification of language varieties; - Machine translation between closely-related languages, language varieties and dialects; - Corpus-driven studies in dialectology and language variation; - Computational approaches to mutual intelligibility between dialects and similar languages; - Text similarity and adaptation between language varieties; - Linguistic issues in the adaptation of language resources and tools (e.g., cognate detection, semantic discrepancies, lexical gaps, false friends); - Studies focusing on related creole languages and their lexifier languages; - Studies focusing on diachronic language variation (e.g. phylogenetic methods, historical dialects). In addition to the topics listed above, we also welcome papers dealing with diachronic language variation (e.g. phylogenetic methods, historical dialects). Instructions for Authors Submissions should be formatted according to the ACL Rolling Review template and submitted as a PDF. The review process will be double-blind. More information is on the website (https://sites.google.com/view/vardial-2026/). Important Dates - Direct Submission deadline: January 2, 2025 (updated!) - Pre-reviewed (ARR) submission deadline: January 10, 2026 (updated!) - Notification of acceptance: January 23, 2026 - Camera-ready paper due: February 3, 2026 - Workshop at EACL (hybrid): March 24-29, 2026 (exact date TBD) Shared Task: Arabic Modeling In Your Accent (AMIYA) VarDial 2026 will have a shared task on language modelling for dialectal Arabic (DA), where participants can contribute LLMs trained or adapted for DA. These will be evaluated using the AL-QASIDA benchmark (Robinson et al., 2025), an evaluation suite that comprehensively measures an LLM’s dialectal fidelity, understanding, generation quality, and MSA-DA diglossia in DA. More information: https://sites.google.com/view/vardial-2026/shared-tasks - Training data release: November 30, 2025 - Registration deadline, eval data finalized: December 15, 2025 - System submission deadline: January 10, 2025 - System description paper deadline: January 20, 2025 Workshop Organizers Yves Scherrer – University of Oslo (Norway) Noëmi Aepli – University of Pennsylvania (USA) Verena Blaschke – LMU Munich and Munich Center for Machine Learning (Germany) Tommi Jauhiainen – University of Helsinki (Finland) Nikola Ljubešić – Jožef Stefan Institute and University of Ljubljana (Slovenia) Preslav Nakov – Mohamed bin Zayed University of Artificial Intelligence (UAE) Jörg Tiedemann – University of Helsinki (Finland) Marcos Zampieri – George Mason University (USA) Contact: yves.scherrer(a)ifi.uio.no or verena.blaschke(a)cis.lmu.de

1 0

2st CfP: SPEAKABLE Co-located with LREC2026
by Marco Matassoni 15 Dec '25

15 Dec '25

*[APOLOGIES FOR CROSS-POSTING]* *2st CfP: SPEAKABLE 2026* https://speakable-2026.github.io/ *Submission link:* https://softconf.com/lrec2026/SPEAKABLE2026/ <https://softconf.com/lrec2026/SPEAKABLE2026/login/scmd.cgi?scmd=logout> *Location:* Palau de Congressos de Palma, Palma de Mallorca (Spain) We are pleased to announce the upcoming full-day *SPEAKABLE 2026* Workshop on Speech Language Models in Low-Resource Settings: Performance, Evaluation, and Bias Analysis, co-located with LREC 2026 in Palma de Mallorca. This workshop brings together researchers, practitioners, and industry experts working to advance speech technology for under-resourced languages. We invite contributions that address the unique challenges and opportunities in this space. *Workshop Topics of Interest* We encourage submissions on (but not limited to): - Performance of speech language models in low-resource and underrepresented languages - Evaluation methodologies and creation of benchmarks for low-resource speech - Bias analysis, detection, and mitigation strategies in speech technologies - Real-world applications, deployment challenges, and case studies - Speech recognition, speech-to-text, language modeling, multilingual and cross-lingual approaches - Fairness, ethical considerations, and inclusive NLP for low-resource speech communities - Parameter-efficient adaptation methods and knowledge distillation for speech models - Edge-constrained inference and computational efficiency in low-resource settings *Invited Speaker* *Dr. Jordi Luque (Lead Research Scientist, Telefónica Research): **https://eloquenceai.eu/imprint/ <https://eloquenceai.eu/imprint/>* Further details will be posted on the workshop website. *Important Dates* Paper Submission Deadline: February 16, 2026 Notification of Acceptance: March 12, 2026 Camera-Ready Papers: March 30, 2026 Workshop Date: May 2026 (11/05/2026) All deadlines are anywhere-on-earth (AoE). *Info for Papers* We welcome original research papers and ongoing work relevant to speech and language modeling for low-resource settings. Submissions should be 4 to 8 pages in length and follow the LREC 2026 stylesheet. Submissions should follow LREC formatting guidelines (https://lrec2026.info/authors-kit/). The maximum number of pages excludes potential Ethics Statements and discussion on Limitations, acknowledgements, and references, as well as data and code availability statements. Appendices or supplementary material are not permitted during the initial submission phase, as papers should be self-contained and reviewable on their own. Submissions will be judged on correctness, originality, technical strength, significance, relevance to the conference, and interest to the attendees. Papers must be of original, previously *unpublished* work. All submissions should follow the two-column LREC style guidelines. We strongly recommend the use of the LaTeX/Overleaf style files. All papers will undergo a *double-blind peer review* process, with final acceptance decisions made by the workshop chairs. Submissions that violate the requirements above will be rejected without review. Accepted papers will be presented as *oral or poster* presentations. The mode of presentation will be determined by the workshop chairs and does not reflect the quality of the submission. *SPEAKABLE 2026* will primarily be an in-person event, but online participation will also be possible for participants who cannot travel to the conference. *Workshop Organizers* Nina Hosseini-Kivanani (RTL & University of Luxembourg, Luxembourg) Alessio Brutti (Fondazione Bruno Kessler, Italy) Marco Matassoni (Fondazione Bruno Kessler, Italy) Sandipana Dowerah (Tallinn University of Technology, Estonia) Davide Liga (University of Luxembourg, Luxembourg) Christoph Schommer (University of Luxembourg, Luxembourg) Learn more and submit: https://speakable-2026.github.io/ For questions, contact: speakable2026(a)gmail.com Many thanks, Marco Matassoni, FBK -- -- Le informazioni contenute nella presente comunicazione sono di natura privata e come tali sono da considerarsi riservate ed indirizzate esclusivamente ai destinatari indicati e per le finalità strettamente legate al relativo contenuto. Se avete ricevuto questo messaggio per errore, vi preghiamo di eliminarlo e di inviare una comunicazione all’indirizzo e-mail del mittente. -- The information transmitted is intended only for the person or entity to which it is addressed and may contain confidential and/or privileged material. If you received this in error, please contact the sender and delete the material.

1 0

Third Call for Papers, Participation and Demo - IWSLT 2025
by Atul K. Ojha 12 Dec '25

12 Dec '25

Apologies for cross-posting. ---------------------------------------- *The International Conference on Spoken Language Translation* *ACL – 22nd IWSLT 2025 – **Third** Call for Participation* *31 July-1 August 2025 - Vienna, Austria* http://iwslt.org The International Conference on Spoken Language Translation (IWSLT) <https://iwslt.org/> is the premier annual conference for all aspects of Spoken Language Translation. Every year, the conference organises and sponsors open evaluation campaigns around key challenges in simultaneous and consecutive translation, under real-time/low latency or offline conditions and under low-resource or multilingual constraints. System descriptions and results from participants’ systems and scientific papers related to key algorithmic advances and best practices are presented. IWSLT is the venue of the SIGSLTs <https://iwslt.org/sigslt/>, the Special Interest Group on Spoken Language Translation <https://iwslt.org/sigslt/> of ACL <https://www.aclweb.org/portal/>, ISCA <https://www.isca-speech.org/> and ELRA <https://www.elra.info/>. With a track record of 21 years, IWSLT benchmarks and proceedings serve as reference for all researchers and practitioners working on speech translation and related fields. The 22nd edition of IWSLT will be run as a hybrid ELRA <https://www.elra.info/>/ACL <https://www.aclweb.org/portal/> event, co-located with ACL 2025 <https://2025.aclweb.org/> from 31 July to 1 August 2025. *Important Dates* *January 1, 2025*: Release of shared task training and dev data *March 15, 2025*: Scientific paper submission deadline *Apr 1-15, 2025*: Evaluation period *April 21, 2025*: System description paper and demo submission deadline *May 15, 2025*: Notification of acceptance *June 1, 2025*: Camera-ready deadline (all paper) *July 31-Aug 1*, *2025*: IWSLT conference *Evaluation* The IWSLT 2025 features shared tasks <https://iwslt.org/2025/#shared-tasks> that address the following focus areas: - High-resource ST: Offline track, Simultaneous track, Subtitling track, Model compression track - Low-resource ST: Low-resource and Indic (multilingual) tracks - Instruction-following Speech Processing track: Technical domain ST, ASR, Summarization, and QA Training and development data for each shared task will be prepared and released by the respective organisers (for further information on this initiative, please refer to the IWSLT website <https://iwslt.org/2025/>). Participants will receive instructions about how to submit their runs. In addition, participants have the opportunity to present their work through a system paper that will be published in the ACL Proceedings. *Conference* IWSLT also invites submissions of scientific papers to be published in the ACL Proceedings and presented either in oral or poster format. The conference selects high-quality, original contributions on theoretical and practical issues of spoken language translation research, technologies and applications. Submissions will be accepted directly through the IWSLT submission site (to be announced on the website <https://iwslt.org/2025/>). We will also accept commitments of submissions with reviews from the ACL Rolling Review. Additionally, to foster cross-pollination of ideas, the conference also invites the presentation of papers on speech translation recently published elsewhere. Please note that this is for non-archival presentation of papers relevant to speech translation already published in other venues (e.g., Findings for the *ACL, speech, NLP or MT conferences). Submissions for this category will be accepted through a dedicated form (to be announced on the website <https://iwslt.org/2025/>). Papers will be checked for relevance to IWSLT, and assigned either oral or poster presentation slots if selected. *Demo Session* We invite researchers, practitioners, and industry professionals to participate in an engaging demo session highlighting innovative systems, tools, and component technologies that advance the field of speech translation. The session will include live and interactive system demonstrations to foster discussion and knowledge exchange among participants across the field. For more information, please see our Call for Demos <https://iwslt.org/2025/call-for-demos>. *Contact* Please email iwslt-evaluation-campaign(a)googlegroups.com if you have any questions related to the shared tasks. Thanks, Marine, Marcello, Alex, Jan, Sebastian, Elizabeth, Atul (IWSLT organisers)

1 1

Jump to page: