Dear all,
Here is our CfP for VarDial 2026 - The Thirteenth Workshop on NLP for Similar Languages, Varieties and Dialects:
--
VarDial 2026: https://sites.google.com/view/vardial-2026/
VarDial 2026 will be colocated with EACL 2026 in Rabat, Morocco. We anticipate a discussion on computational methods and language resources for closely related languages, language varieties, and dialects.
We welcome papers dealing with one or more of the following topics:
- Language resources and tools for similar languages, varieties and dialects;
- Evaluation of language resources and tools applied to non-dominant language varieties;
- Cross-lingual transfer and adaptation of models to similar languages, varieties and dialects;
- Automatic identification of lexical variation;
- Automatic classification of language varieties;
- Machine translation between closely-related languages, language varieties and dialects;
- Corpus-driven studies in dialectology and language variation;
- Computational approaches to mutual intelligibility between dialects and similar languages;
- Text similarity and adaptation between language varieties;
- Linguistic issues in the adaptation of language resources and tools (e.g., cognate detection, semantic discrepancies, lexical gaps, false friends);
- Studies focusing on related creole languages and their lexifier languages;
- Studies focusing on diachronic language variation (e.g. phylogenetic methods, historical dialects).
In addition to the topics listed above, we also welcome papers dealing with diachronic language variation (e.g. phylogenetic methods, historical dialects).
Instructions for Authors
Submissions should be formatted according to the ACL Rolling Review template and submitted as a PDF. The review process will be double-blind. More information is on the website (https://sites.google.com/view/vardial-2026/).
Important Dates
- Direct Submission deadline: December 19, 2025
- Pre-reviewed (ARR) submission deadline: January 2, 2026
- Notification of acceptance: January 23, 2026
- Camera-ready paper due: February 3, 2026
- Workshop at EACL (hybrid): March 24-29, 2026 (exact date TBD)
Organizers
Yves Scherrer - University of Helsinki (Finland)
Noëmi Aepli - University of Pennsylvania (USA)
Verena Blaschke - LMU Munich and Munich Center for Machine Learning (Germany)
Tommi Jauhiainen - University of Helsinki (Finland)
Nikola Ljubešić - Jožef Stefan Institute (Slovenia) and University of Zagreb (Croatia)
Preslav Nakov - Mohamed bin Zayed University of Artificial Intelligence (UAE)
Jörg Tiedemann - University of Helsinki (Finland)
Marcos Zampieri - George Mason University (USA)
Contact: yves.scherrer(a)helsinki.fi or tommi.jauhiainen(a)helsinki.fi
--
Best regards,
Verena Blaschke
Final Call for Participation
DHASA Conference and RAIL workshop 2025
https://dh2025.digitalhumanities.org.zahttps://sadilar.org/en/rail-2025/
DHASA conference dates: 11 November 2025-14 November 2025
RAIL workshop date: 10 November 2025
Conference venue: CSIR ICC, Pretoria, South Africa
Registration: https://dh2025.digitalhumanities.org.za/registration/
DHASA CONFERENCE
Theme: The role of humanities in digital humanities and artificial
intelligence
The Digital Humanities Association of Southern Africa (DHASA) is
pleased to announce its fifth conference, focusing on the theme The
role of humanities in digital humanities and artificial intelligence.
In a region where the field of Digital Humanities is still relatively
underdeveloped, this conference aims to address this gap and foster
growth and collaboration in the field. The conference offers an
opportunity for researchers interested in showcasing their work in the
broad field of Digital Humanities to come together. By doing so, the
conference provides a comprehensive overview of the current state-of-
the-art in Digital Humanities, particularly within the Southern Africa
region. As such, we welcome submissions related to Digital Humanities
research conducted by individuals from Southern Africa or research
focused on the geographical area of Southern Africa in the broad sense.
Furthermore, the conference serves as a platform for information
sharing and networking among researchers passionate about Digital
Humanities. By bringing together experts working on Digital Humanities
in Southern Africa or with a focus on Southern Africa, we aim to
promote collaboration and facilitate further research in this dynamic
field. In addition to the main conference, affiliated workshops and
tutorials will be organised, providing researchers with valuable
insights into novel technologies and tools. These supplementary events
are designed for researchers interested in specific aspects of Digital
Humanities or seeking practical information to enter or advance their
knowledge in the field.
The DHASA conference welcomes interdisciplinary contributions from
researchers in various domains of Digital Humanities, including, but
not limited to, language, literature, visual art, performance and
theatre studies, media studies, music, history, sociology, psychology,
language technologies, library studies, philosophy, methodologies,
software and computation, AI, and more. Our goal is to cultivate an
inclusive scientific community of practice within Digital Humanities.
RAIL WORKSHOP
Theme: Language resources in the age of large language models
The sixth Resources for African Indigenous Languages (RAIL) workshop
will be co-located with the Digital Humanities Association of Southern
Africa (DHASA) 2025 conference at the CSIR International Convention
Centre in Pretoria, South Africa, on 10 November 2025. The RAIL
workshop is an interdisciplinary platform for researchers working on
African indigenous languages resources such as natural languages
processing (NLP) tools, Human Language Technologies (HLT), data
collections, and annotations. This workshop aims to foster a scientific
community of practice that focuses on computational linguistic tools
and data that are designed for or applied to the indigenous languages
of Africa.
Many African languages are under-resourced while only a few are
considered to be somewhat better resourced. These languages often share
interesting properties such as writing systems, making them different
from most high-resourced languages. From a computational perspective,
these languages lack enough corpora to undertake high level development
of NLP and HLT tools, which in turn impedes the development of African
languages in these areas. During previous workshops, it was noted that
the problems and solutions presented were not only applicable to
African languages but were also relevant to many other low-resource
languages across the world. Because these languages share similar
challenges, this workshop provides researchers with opportunities to
work collaboratively on issues of language resource development and
learn from each other.
The RAIL workshop has several aims. First, the workshop brings together
researchers who work on African indigenous languages, forming a
community of practice for people working on indigenous languages.
Second, the workshop aims to reveal currently unknown or unpublished
existing resources (corpora, NLP tools, and applications), resulting in
a better overview of the current state-of-the-art, and also allows for
discussions on novel, desired resources for future research in this
area. Third, it enhances sharing of knowledge on the development of
low-resource languages. Finally, it enables discussions on how to
improve the quality as well as availability of the resources.
Organising Committees
DHASA conference
Aby Louw, Council for Scientific and Industrial Research
Franco Mak, Council for Scientific and Industrial Research
Franziska Pannach, Rijksuniversiteit Groningen
Ilana Wilken, Council for Scientific and Industrial Research
Johannes Sibeko, Nelson Mandela University
Juan Steyn, South African Centre for Digital Language Resources
Laurette Marais, Council for Scientific and Industrial Research
Marissa Griesel, South African Centre for Digital Language Resources
Menno van Zaanen, South African Centre for Digital Language Resources
Privolin Naidoo, Council for Scientific and Industrial Research
Sthembiso Mkhwanazi, Council for Scientific and Industrial Research
RAIL workshop
Rooweither Mabuya, South African Centre for Digital Language Resources
Muzi Matfunjwa, South African Centre for Digital Language Resources
Mmasibidi Setaka, South African Centre for Digital Language Resources
Menno van Zaanen, South African Centre for Digital Language Resources
--
Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za
Professor in Digital Humanities
South African Centre for Digital Language Resources
https://www.sadilar.org
________________________________
NWU PRIVACY STATEMENT:
http://www.nwu.ac.za/it/gov-man/disclaimer.html
DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system.
________________________________
******************************
PROPOR 2026: 17th International Conference on Computational Processing of
Portuguese
Salvador - BA, Brazil
April 13th to 16th 2026
https://propor2026.ufba.br/
CALL FOR PAPERS
******************************
The International Conference on Computational Processing of Portuguese
(PROPOR) is the main event in the area of human language processing that is
focused on theoretical and technological issues of written and spoken
Portuguese and Galician. The meeting has been a very rich forum for the
exchange of ideas and partnerships for the research and industry
communities dedicated to automated language processing, promoting the
development of methodologies, resources, and projects.
We call for papers describing work on any topic related to the
computational processing of Portuguese and Galician by researchers in
industry or academia. Topics of interest include, but are not limited to:
-
Natural language processing tasks (e.g., parsing, word sense
disambiguation, coreference resolution)
-
Natural language processing applications (e.g., question answering,
subtitling, summarization, sentiment analysis)
-
Natural language generation
-
Information extraction and information retrieval
-
Speech technologies (e.g., spoken language generation, speech and
speaker recognition, spoken language understanding)
-
Speech applications (e.g., spoken language interfaces, dialogue systems,
speech-to-speech translation)
-
Resources, standardization, and evaluation (e.g., corpora, ontologies,
lexicons, grammars)
-
NLP-oriented linguistic description or theoretical analysis
-
Distributional semantics and language modeling
-
Portuguese language varieties and dialect processing (including the
language varieties of Angola, Brazil, Cape Verde, East Timor, Galicia,
Guinea-Bissau, Macau, Mozambique, Portugal, and Sao Tome and Principe)
-
Multilingual studies, methods, applications, and resources, Portuguese
and/or Galician
PROPOR 2026 will take place from April 13th to 16th in Salvador - BA
(Brazil), a city that stands as a historical meeting point between the
Portuguese language, the Indigenous languages of Brazil, and the African
languages brought by enslaved peoples from Africa. This linguistic and
cultural contact profoundly shaped Brazilian Portuguese and Brazilian
culture.
PROPOR 2026 will be the 17th edition of the biannual PROPOR conference,
hosted alternately in Brazil and Portugal, and more recently also in
Galiza. Past meetings were held in Lisbon, PT (1993); Curitiba, BR
(1996); Porto Alegre, BR (1998); Évora, PT (1999); Atibaia, BR (2000);
Faro, PT (2003); Itatiaia, BR (2006); Aveiro, PT (2008); Porto Alegre, BR
(2010); Coimbra, PT (2012); São Carlos, BR (2014); Tomar, PT (2016);
Canela, BR (2018); Évora, PT (2020); Fortaleza, BR (2022); and Santiago de
Compostela, GZ (2026).
Submissions
Submissions should describe original, unpublished work. Authors are invited
to submit two kinds of papers:
-
Full papers reporting substantial and completed work, especially those
that may contribute in a significant way to the advancement of the area.
Wherever appropriate, concrete evaluation results should be included. Full
papers can have up to 8 content pages + 2 pages for references.
-
Short papers reporting small, focused contributions such as ongoing
work, position papers, potential ideas to be discussed, negative results,
or an interesting application nugget. Short papers can have up to 4
content pages + 1 page for references.
Each submission will be evaluated by at least two reviewers. As reviewing
will be double-blind, submitted papers must be anonymized. Submissions
should not include the authors’ names, affiliations, or any other
information that could be used to identify them. Authors must avoid
self-references that reveal identity, like “We previously showed (Freitas,
1991) …”. Instead, they should prefer citations such as “Freitas (1991)
previously showed …”. Separate author identification information will be
required as part of the submission process.
While recent editions have only accepted submissions in English, this year
we are pleased to also accept papers written in Portuguese, reaffirming our
commitment to promoting scientific exchange in our language.
At submission time, only PDF format is accepted. For the final versions,
authors of accepted papers will be given 1 extra content page to
incorporate the reviews’ suggestions. Authors of accepted papers will be
requested to send the source files for the production of the proceedings.
All submitted papers must conform to the ACL style guidelines and use the
LaTeX or MS Word stylesheets below:
LaTeX stylesheet
<https://github.com/acl-org/acl-style-files/tree/master/latex>
MS Word stylesheet
<https://github.com/acl-org/acl-style-files/tree/master/word>
Papers should be submitted via the following URL
https://cmt3.research.microsoft.com/PROPOR2026 by either selecting the
track
PROPOR2026 Long and short papers.
Multiple-submission policy
For submissions that have been or will be submitted to other meetings or
publications, this information must be provided at submission time. If a
submission is accepted, authors must notify the program chairs, indicating
which meeting they choose for presentation of their work. Papers that will
be (or have been) published elsewhere cannot be accepted for publication or
presentation.
Mandatory Reviewing Workload
As the pace of research in the field continues to increase, we need to
strengthen the commitment to reviewing for each paper submission. During
the submission process, authors will be required to specify which
co-authors are committing to cover reviewing in the event.
Publication
The proceedings of PROPOR 2026 will be published in the ACL Anthology. They
will be available online. To ensure publication, at least one author of
each accepted paper must complete a full registration for PROPOR 2026 by
the early registration deadline.
Ethics Policy
Authors are advised to follow the ACL Ethics Policy for submission, which
can be found at: https://aclrollingreview.org/cfp#ethics-policy
Authors are also strongly advised to follow the ACL guidelines for
generative AI assistance in authorship, which can be found at:
https://www.aclweb.org/adminwiki/index.php/ACL_Policy_on_Publication_Ethics…
Important dates
Full and short paper submission deadline: 16/11/2025 (23:59 GMT-12)
Notification of paper acceptance or rejection: 02/02/2026
Camera-ready papers due: 15/03/2026
Conference: April 13th - 16th, 2026
Kindest regards,
Iria de-Dios-Flores & Marlo Souza
PROPOR 2026 General Chairs
propor2026(a)ufba.br
Apologies for cross-posting
---------------------------------------------------------------------------
*The Ninth Workshop on Technologies for Machine Translation of Low-Resource
Languages (LoResMT 2026)*
*https://www.loresmt.org/ <https://www.loresmt.org/>*
*@ EACL 2026 (March 24-29, 2026)*
*Rabat, Morocco*
*SUBMISSION*
ARR submission link:
https://openreview.net/group?id=eacl.org/EACL/2026/Workshop/LoResMT
*TIMELINE*
- Submission deadline: December 19, 2025 (Anywhere on Earth)
- Pre-reviewed (ARR) submission deadline: January 2, 2026
- Notification of acceptance: January 23, 2026
- Camera-ready paper due: February 3, 2026 (Anywhere on Earth)
- Pre-recorded video due (hard deadline): February 24, 2026
- Workshop dates at EACL 2026: TBD
- EACL 202 Main Conference: March 24-29, 2026
*SCOPE*
Based on the success of past low-resource machine translation (MT)
workshops at AMTA 2018, MT Summit 2019, AACL-IJCNLP 2020, AMTA 2021, COLING
2022, EACL 2023, ACL 2024, NAACL 2025, we introduce LoResMT 2026 workshop
at EACL 2025. The workshop provides a discussion panel for researchers
working on MT systems/methods for low-resource and under-represented
languages in general. We would like to help review/overview the state of MT
for low-resource languages and define the most important directions.
Fundamental work on low-resource languages in MT and NLP is still crucial
and unavoidable. We also solicit papers dedicated to supplementary natural
language processing (NLP) tools that are used in any language and
especially in low-resource languages. Overview papers of these NLP tools
are very welcome. It will be beneficial if the evaluations of these tools
in research papers include their impact on the quality of MT output.
*TOPICS*
We are highly interested in (1) original research papers, (2)
review/opinion papers, and (3) online systems on the topics below; however,
we welcome all novel ideas that cover research on low-resource languages.
- Neural machine translation for low-resource languages
- Work that presents online systems for practical use by native speakers
- Word tokenizers/de-tokenizers for specific languages
- Word/morpheme segmenters for specific languages
- Alignment/Re-ordering tools for specific language pairs
- Use of morphology analyzers and/or morpheme segmenters in MT
- Multilingual/cross-lingual NLP tools for MT
- Corpora creation and curation technologies for low-resource languages
- COVID-related corpora, their translations and corresponding NLP/MT systems
- Review of available parallel corpora for low-resource languages
- Research and review papers of MT methods for low-resource languages
- MT systems/methods (e.g. rule-based, SMT, NMT) for low-resource languages
- Pivot MT for low-resource languages
- Zero-shot MT for low-resource languages
- Fast building of MT systems for low-resource languages
- Re-usability of existing MT systems for low-resource languages
- Machine translation for language preservation
*SUBMISSION INFORMATION*
We are soliciting two types of submissions: (1) research, review, and
position papers and (2) system demonstration papers. For research, review
and position papers, the length of each paper should be at least four (4)
and not exceed eight (8) pages, plus unlimited pages for references. For
system demonstration papers, the limit is four (4) pages. Submissions
should be formatted according to the official ACL style templates
(Overleaf). Please refer to the EACL submission guidelines for further
information <https://2026.eacl.org/calls/papers/>. Accepted papers will be
published online in the EACL 2026 proceedings and will be presented at the
conference.
Submissions must be anonymized and should be done using the provided
submission system. Scientific papers that have been or will be submitted to
other venues must be declared as such and must be withdrawn from the other
venues if accepted and published at LoResMT. The review will be
double-blind. Authors of an accepted paper should present their paper in
person at EACL 2026. Papers should be submitted in PDF to the LoResMT Open
Review.
We would like to encourage authors to cite papers written in ANY language
that are related to the topics, as long as both original bibliographic
items and their corresponding English translations are provided.
Registration is handled by the main conference (
https://2026.eacl.org/registration).
*ORGANIZING COMMITTEE (LISTED ALPHABETICALLY)*
Atul Kr. Ojha
Chao-Hong Liu
Ekaterina Vylomova
Flammie Pirinen
Jonathan Washington
Nathaniel Oco
Xiaobing Zhao
*PROGRAM COMMITTEE (To be confirmed)*
Abigail Walsh, ADAPT Centre, Dublin City University, Ireland
Alberto Poncelas, Rakuten, Singapore
Ali Hatami, University of Galway
Alina Karakanta, Leiden University
Amirhossein Tebbifakhr, Fondazione Bruno Kessler
Anna Currey, Amazon Web Services
Aswarth Abhilash Dara, Walmart Global Technology
Arturo Oncevay, University of Edinburgh
Atul Kr. Ojha, DSI, University of Galway
Barry Haddow, University of Edinburgh
Bogdan Babych, Heidelberg University
Chao-Hong Liu, Potamu Research Ltd
Constantine Lignos, Brandeis University, USA
Daan van Esch, Google
Diptesh Kanojia, University of Surrey, UK
Duygu Ataman, University of Zurich
Ekaterina Vylomova, University of Melbourne, Australia
Eleni Metheniti, CLLE-CNRS and IRIT-CNRS
Flammie Pirinen, UiT The Arctic University of Norway, Tromsø
Koel Dutta Chowdhury, Saarland University (Germany)
Jade Abbott, Retro Rabbit
Jasper Kyle Catapang, University of the Philippines
Jinliang Lu, Institute of Automation, Chinese Academy of Sciences
John P. McCrae, DSI, University of Galway
Liangyou Li, Noah’s Ark Lab, Huawei Technologies
Majid Latifi, University of York, York, UK
Maria Art Antonette Clariño, University of the Philippines Los Baños
Mathias Müller, University of Zurich
Milind Agarwal, George Mason University
Nathaniel Oco, De La Salle University (Philippines)
Pavel Rychlý, Masaryk University
Pengwei Li, Meta
Rico Sennrich, University of Zurich
Saliha Muradoglu, The Australian National University
Sangjee Dondrub, Qinghai Normal University
Santanu Pal, WIPRO AI
Sardana Ivanova, University of Helsinki
Sourabrata Mukherjee, Charles University
Surafel Melaku Lakew, Amazon AI
Thepchai Supnithi, National Electronics and Computer Technology Centre
Timothee Mickus, University of Helsinki
Wen Lai, Center for Information and Language Processing, LMU Munich
Xuebo Liu, Harbin Institute of Technolgy, Shenzhen
Yalemisew Abgaz, Dublin City University
Yasmin Moslem, ADAPT Centre, Dublin City University, Ireland
Zhanibek Kozhirbayev, National Laboratory Astana, Nazarbayev University
*CONTACT*
Please email loresmt(a)googlegroups.com if you have any
questions/comments/suggestions.
Dear Colleagues,
The SIGUL Board is pleased to invite nominations for the positions of
*Chair(s)* and *Secretary* of the /Special Interest Group on
Under-resourced Languages (SIGUL)/.
The newly elected Board will serve for the term *2026-2027*.
Each proposer may nominate *up to three candidates*, one for each position.
Please submit your nominations by *October 31* using the form below:
https://forms.gle/ctqWNLhmEhodFd8V7<https://forms.gle/ctqWNLhmEhodFd8V7>
You will be asked to provide details of the nominated person, together
with a short bio and a motivation. All nominations will be acknowledged
upon receipt.
For further details about SIGUL and its governance, please visit:
https://www.elra.info/en/about/sig/sigul/<https://www.elra.info/en/about/sig/sigul/>
Thank you for your participation and continued support of the SIGUL
community.
Warm regards,
The SIGUL Board (Sakriani Sakti, Claudia Soria, Maite Melero)*
*
Call for Participation
DHASA Conference and RAIL workshop 2025
https://dh2025.digitalhumanities.org.zahttps://sadilar.org/en/rail-2025/
DHASA conference dates: 11 November 2025-14 November 2025
RAIL workshop date: 10 November 2025
Conference venue: CSIR ICC, Pretoria, South Africa
Registration: https://dh2025.digitalhumanities.org.za/registration/
DHASA CONFERENCE
Theme: The role of humanities in digital humanities and artificial
intelligence
The Digital Humanities Association of Southern Africa (DHASA) is
pleased to announce its fifth conference, focusing on the theme The
role of humanities in digital humanities and artificial intelligence.
In a region where the field of Digital Humanities is still relatively
underdeveloped, this conference aims to address this gap and foster
growth and collaboration in the field. The conference offers an
opportunity for researchers interested in showcasing their work in the
broad field of Digital Humanities to come together. By doing so, the
conference provides a comprehensive overview of the current state-of-
the-art in Digital Humanities, particularly within the Southern Africa
region. As such, we welcome submissions related to Digital Humanities
research conducted by individuals from Southern Africa or research
focused on the geographical area of Southern Africa in the broad sense.
Furthermore, the conference serves as a platform for information
sharing and networking among researchers passionate about Digital
Humanities. By bringing together experts working on Digital Humanities
in Southern Africa or with a focus on Southern Africa, we aim to
promote collaboration and facilitate further research in this dynamic
field. In addition to the main conference, affiliated workshops and
tutorials will be organised, providing researchers with valuable
insights into novel technologies and tools. These supplementary events
are designed for researchers interested in specific aspects of Digital
Humanities or seeking practical information to enter or advance their
knowledge in the field.
The DHASA conference welcomes interdisciplinary contributions from
researchers in various domains of Digital Humanities, including, but
not limited to, language, literature, visual art, performance and
theatre studies, media studies, music, history, sociology, psychology,
language technologies, library studies, philosophy, methodologies,
software and computation, AI, and more. Our goal is to cultivate an
inclusive scientific community of practice within Digital Humanities.
RAIL WORKSHOP
Theme: Language resources in the age of large language models
The sixth Resources for African Indigenous Languages (RAIL) workshop
will be co-located with the Digital Humanities Association of Southern
Africa (DHASA) 2025 conference at the CSIR International Convention
Centre in Pretoria, South Africa, on 10 November 2025. The RAIL
workshop is an interdisciplinary platform for researchers working on
African indigenous languages resources such as natural languages
processing (NLP) tools, Human Language Technologies (HLT), data
collections, and annotations. This workshop aims to foster a scientific
community of practice that focuses on computational linguistic tools
and data that are designed for or applied to the indigenous languages
of Africa.
Many African languages are under-resourced while only a few are
considered to be somewhat better resourced. These languages often share
interesting properties such as writing systems, making them different
from most high-resourced languages. From a computational perspective,
these languages lack enough corpora to undertake high level development
of NLP and HLT tools, which in turn impedes the development of African
languages in these areas. During previous workshops, it was noted that
the problems and solutions presented were not only applicable to
African languages but were also relevant to many other low-resource
languages across the world. Because these languages share similar
challenges, this workshop provides researchers with opportunities to
work collaboratively on issues of language resource development and
learn from each other.
The RAIL workshop has several aims. First, the workshop brings together
researchers who work on African indigenous languages, forming a
community of practice for people working on indigenous languages.
Second, the workshop aims to reveal currently unknown or unpublished
existing resources (corpora, NLP tools, and applications), resulting in
a better overview of the current state-of-the-art, and also allows for
discussions on novel, desired resources for future research in this
area. Third, it enhances sharing of knowledge on the development of
low-resource languages. Finally, it enables discussions on how to
improve the quality as well as availability of the resources.
Organising Committees
DHASA conference
Aby Louw, Council for Scientific and Industrial Research
Franco Mak, Council for Scientific and Industrial Research
Franziska Pannach, Rijksuniversiteit Groningen
Ilana Wilken, Council for Scientific and Industrial Research
Johannes Sibeko, Nelson Mandela University
Juan Steyn, South African Centre for Digital Language Resources
Laurette Marais, Council for Scientific and Industrial Research
Marissa Griesel, South African Centre for Digital Language Resources
Menno van Zaanen, South African Centre for Digital Language Resources
Privolin Naidoo, Council for Scientific and Industrial Research
Sthembiso Mkhwanazi, Council for Scientific and Industrial Research
RAIL workshop
Rooweither Mabuya, South African Centre for Digital Language Resources
Muzi Matfunjwa, South African Centre for Digital Language Resources
Mmasibidi Setaka, South African Centre for Digital Language Resources
Menno van Zaanen, South African Centre for Digital Language Resources
--
Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za
Professor in Digital Humanities
South African Centre for Digital Language Resources
https://www.sadilar.org
________________________________
NWU PRIVACY STATEMENT:
http://www.nwu.ac.za/it/gov-man/disclaimer.html
DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system.
________________________________
𝗦𝗲𝗰𝗼𝗻𝗱 𝗜𝗻𝘁𝗲𝗿𝗻𝗮𝘁𝗶𝗼𝗻𝗮𝗹 𝗖𝗼𝗻𝗳𝗲𝗿𝗲𝗻𝗰𝗲 𝗼𝗻 𝗡𝗮𝘁𝘂𝗿𝗮𝗹 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗣𝗿𝗼𝗰𝗲𝘀𝘀𝗶𝗻𝗴 𝗮𝗻𝗱 𝗔𝗿𝘁𝗶𝗳𝗶𝗰𝗶𝗮𝗹 𝗜𝗻𝘁𝗲𝗹𝗹𝗶𝗴𝗲𝗻𝗰𝗲 𝗳𝗼𝗿 𝗖𝘆𝗯𝗲𝗿 𝗦𝗲𝗰𝘂𝗿𝗶𝘁𝘆 (𝗡𝗟𝗣𝗔𝗜𝗖𝗦’𝟮𝟬𝟮𝟲)
University of Alicante, Alicante, Spain
11 and 12 June 2026
https://nlpaics2026.gplsi.es/
𝗙𝗶𝗿𝘀𝘁 𝗖𝗮𝗹𝗹 𝗳𝗼𝗿 𝗣𝗮𝗽𝗲𝗿𝘀
Recent advances in Natural Language Processing (NLP), Deep Learning and Large Language Models (LLMs) have resulted in improved performance of applications. In particular, there has been a growing interest in employing AI methods in different Cyber Security applications.
In today's digital world, Cyber Security has emerged as a heightened priority for both individual users and organisations. As the volume of online information grows exponentially, traditional security approaches often struggle to identify and prevent evolving security threats. The inadequacy of conventional security frameworks highlights the need for innovative solutions that can effectively navigate the complex digital landscape for ensuring robust security. NLP and AI in Cyber Security have vast potential to significantly enhance threat detection and mitigation by fostering the development of advanced security systems for autonomous identification, assessment, and response to security threats in real-time. Recognising this challenge and the capabilities of NLP and AI approaches to fortify Cyber Security systems, the Second International Conference on Natural Language Processing (NLP) and Artificial Intelligence (AI) for Cyber Security (NLPAICS’2026) continues the tradition from NLPAICS’2024 to be a gathering place for researchers in NLP and AI methods for Cyber Security. We invite contributions that present the latest NLP and AI solutions for mitigating risks in processing digital information.
𝗖𝗼𝗻𝗳𝗲𝗿𝗲𝗻𝗰𝗲 𝘁𝗼𝗽𝗶𝗰𝘀
The conference invites submissions on a broad range of topics related to the employment of NLP and AI (and in general, language studies and models) for Cyber Security including but not limited to:
- 𝘚𝘰𝘤𝘪𝘦𝘵𝘢𝘭 𝘢𝘯𝘥 𝘏𝘶𝘮𝘢𝘯 𝘚𝘦𝘤𝘶𝘳𝘪𝘵𝘺 𝘢𝘯𝘥 𝘚𝘢𝘧𝘦𝘵𝘺
- Content Legitimacy and Quality
- Detection and mitigation of hate speech and offensive language
- Fake news, deepfakes, misinformation and disinformation
- Detection of machine generated language in multimodal context (text, speech and gesture)
- Trust and credibility of online information
- User Security and Safety
- Cyberbullying and identification of internet offenders
- Monitoring extremist fora
- Suicide prevention
- Clickbait and scam detection
- Fake profile detection in online social networks
- Technical Measures and Solutions
- Social engineering identification, phishing detection
- NLP for risk assessment
- Controlled languages for safe messages
- Prevention of malicious use of ai models
- Forensic linguistics
- Human Factors in Cyber Security
- 𝘚𝘱𝘦𝘦𝘤𝘩 𝘛𝘦𝘤𝘩𝘯𝘰𝘭𝘰𝘨𝘺 𝘢𝘯𝘥 𝘔𝘶𝘭𝘵𝘪𝘮𝘰𝘥𝘢𝘭 𝘐𝘯𝘷𝘦𝘴𝘵𝘪𝘨𝘢𝘵𝘪𝘰𝘯𝘴 𝘧𝘰𝘳 𝘊𝘺𝘣𝘦𝘳 𝘚𝘦𝘤𝘶𝘳𝘪𝘵𝘺
- Voice-based security: Analysis of voice recordings or transcripts for security threats
- Detection of machine generated language in multimodal context (text, speech and gesture)
- NLP and biometrics in multimodal context
- 𝘋𝘢𝘵𝘢 𝘢𝘯𝘥 𝘚𝘰𝘧𝘵𝘸𝘢𝘳𝘦 𝘚𝘦𝘤𝘶𝘳𝘪𝘵𝘺
- Cryptography
- Digital forensics
- Malware detection, obfuscation
- Models for documentation
- NLP for data privacy and leakage prevention (DLP)
- Addressing dataset “poisoning” attacks
- 𝘏𝘶𝘮𝘢𝘯-𝘊𝘦𝘯𝘵𝘳𝘪𝘤 𝘚𝘦𝘤𝘶𝘳𝘪𝘵𝘺 𝘢𝘯𝘥 𝘚𝘶𝘱𝘱𝘰𝘳𝘵
- Natural language understanding for chatbots: NLP-powered chatbots for user support and security incident reporting
- User behaviour analysis: analysing user-generated text data (e.g., chat logs and emails) to detect insider threats or unusual behaviour
- Human supervision of technology for Cyber Security
- 𝘈𝘯𝘰𝘮𝘢𝘭𝘺 𝘋𝘦𝘵𝘦𝘤𝘵𝘪𝘰𝘯 𝘢𝘯𝘥 𝘛𝘩𝘳𝘦𝘢𝘵 𝘐𝘯𝘵𝘦𝘭𝘭𝘪𝘨𝘦𝘯𝘤𝘦
- Text-Based Anomaly Detection
- Identification of unusual or suspicious patterns in logs, incident reports or other textual data
- Detecting deviations from normal behaviour in system logs or network traffic
- Threat Intelligence Analysis
- Processing and analysing threat intelligence reports, news, articles and blogs on latest Cyber Security threats
- Extracting key information and indicators of compromise (IoCs) from unstructured text
- 𝘚𝘺𝘴𝘵𝘦𝘮𝘴 𝘢𝘯𝘥 𝘐𝘯𝘧𝘳𝘢𝘴𝘵𝘳𝘶𝘤𝘵𝘶𝘳𝘦 𝘚𝘦𝘤𝘶𝘳𝘪𝘵𝘺
- Systems Security
- Anti-reverse engineering for protecting privacy and anonymity
- Identification and mitigation of side-channel attacks
- Authentication and access control
- Enterprise-level mitigation
- NLP for software vulnerability detection
- Malware Detection through Code Analysis
- Analysing code and scripts for malware
- Detection using NLP to identify patterns indicative of malicious code
- 𝘍𝘪𝘯𝘢𝘯𝘤𝘪𝘢𝘭 𝘊𝘺𝘣𝘦𝘳 𝘚𝘦𝘤𝘶𝘳𝘪𝘵𝘺
- Financial fraud detection
- Financial risk detection
- Algorithmic trading security
- Secure online banking
- Risk management in finance
- Financial text analytics
- 𝘌𝘵𝘩𝘪𝘤𝘴, 𝘉𝘪𝘢𝘴, 𝘢𝘯𝘥 𝘓𝘦𝘨𝘪𝘴𝘭𝘢𝘵𝘪𝘰𝘯 𝘪𝘯 𝘊𝘺𝘣𝘦𝘳 𝘚𝘦𝘤𝘶𝘳𝘪𝘵𝘺
- Ethical and Legal Issues
- Digital privacy and identity management
- The ethics of NLP and speech technology
- Explainability of NLP and speech technology tools
- Legislation against malicious use of AI
- Regulatory issues
- Bias and Security
- Bias in Large Language Models (LLMs)
- Bias in security related datasets and annotations
- 𝘋𝘢𝘵𝘢𝘴𝘦𝘵𝘴 𝘢𝘯𝘥 𝘙𝘦𝘴𝘰𝘶𝘳𝘤𝘦𝘴 𝘧𝘰𝘳 𝘊𝘺𝘣𝘦𝘳 𝘚𝘦𝘤𝘶𝘳𝘪𝘵𝘺 𝘈𝘱𝘱𝘭𝘪𝘤𝘢𝘵𝘪𝘰𝘯𝘴
- 𝘚𝘱𝘦𝘤𝘪𝘢𝘭𝘪𝘴𝘦𝘥 𝘚𝘦𝘤𝘶𝘳𝘪𝘵𝘺 𝘈𝘱𝘱𝘭𝘪𝘤𝘢𝘵𝘪𝘰𝘯𝘴 𝘢𝘯𝘥 𝘖𝘱𝘦𝘯 𝘛𝘰𝘱𝘪𝘤s
- Intelligence applications
- Emerging and innovative applications in Cyber Security
𝘚𝘱𝘦𝘤𝘪𝘢𝘭 𝘛𝘩𝘦𝘮𝘦 𝘛𝘳𝘢𝘤𝘬 - 𝘍𝘶𝘵𝘶𝘳𝘦 𝘰𝘧 𝘊𝘺𝘣𝘦𝘳 𝘚𝘦𝘤𝘶𝘳𝘪𝘵𝘺 𝘪𝘯 𝘵𝘩𝘦 𝘌𝘳𝘢 𝘰𝘧 𝘓𝘓𝘔𝘴 𝘢𝘯𝘥 𝘎𝘦𝘯𝘦𝘳𝘢𝘵𝘪𝘷𝘦 𝘈𝘐
NLPAICS 2026 will feature a special theme track with the goal of stimulating discussion around Large Language Models (LLMs), Generative AI and ensuring their safety. The latest generation of LLMs, such as CHATGPT, Gemini, DeepSeek, LLAMA and open-source alternatives, has showcased remarkable advancements in text and image understanding and generation. However, as we navigate through uncharted territory, it becomes imperative to address the challenges associated with employing these models in everyday tasks, focusing on aspects such as fairness, ethics, and responsibility. The theme track invites studies on how to ensure the safety of LLMs in various tasks and applications and what this means for the future of the field. The possible topics of discussion include (but are not limited to) the following:
• Detection of LLM-generated language in multimodal context (text, speech and gesture)
• LLMs for forensic linguistics
• Bias in LLMs
• Safety benchmarks for LLMs
• Legislation against malicious use of LLMs
• Tools to evaluate safety in LLMs
• Methods to enhance the robustness of language models
𝗦𝘂𝗯𝗺𝗶𝘀𝘀𝗶𝗼𝗻𝘀 𝗮𝗻𝗱 𝗣𝘂𝗯𝗹𝗶𝗰𝗮𝘁𝗶𝗼𝗻
NLPAICS welcomes high-quality submissions in English, which can take two forms:
• Regular long papers: These can be up to eight (8) pages long, presenting substantial, original, completed, and unpublished work.
• Short (poster) papers: These can be up to four (4) pages long and are suitable for describing small, focused contributions, ongoing research, negative results, system demonstrations, etc. Short papers will be presented as part of a poster session.
The conference will not consider and evaluate abstracts only.
Accepted papers, including both long and short papers, will be published as e-proceedings with ISBN will available online on the conference website at the time of the conference and are expected to be uploaded into the ACL Anthology.
Further details on the submission procedure will be made available in the Second Call for Papers due in October 2025.
The conference will feature a student workshop and awards will be offered to the authors of best papers.
𝗜𝗺𝗽𝗼𝗿𝘁𝗮𝗻𝘁 𝗱𝗮𝘁𝗲𝘀
• Submissions due: 16 March 2026
• Reviewing process: 1 April – 30 April 2026
• Notification of acceptance: 5 May 2026
• Camera-ready due: 19 May 2026
• Conference camera-ready proceedings ready 1 June 2026
• Conference: 11-12 June 2026
𝗢𝗿𝗴𝗮𝗻𝗶𝘀𝗮𝘁𝗶𝗼𝗻
𝙲̲𝚘̲𝚗̲𝚏̲𝚎̲𝚛̲𝚎̲𝚗̲𝚌̲𝚎̲ ̲𝙲̲𝚑̲𝚊̲𝚒̲𝚛̲𝚜̲ ̲
Ruslan Mitkov (University of Alicante)
Rafael Muñoz (University of Alicante)
𝙿̲𝚛̲𝚘̲𝚐̲𝚛̲𝚊̲𝚖̲𝚖̲𝚎̲ ̲𝙲̲𝚘̲𝚖̲𝚖̲𝚒̲𝚝̲𝚝̲𝚎̲𝚎̲ ̲𝙲̲𝚑̲𝚊̲𝚒̲𝚛̲𝚜̲
Elena Lloret (University of Alicante)
Tharindu Ranasinghe (Lancaster University)
𝙿̲𝚞̲𝚋̲𝚕̲𝚒̲𝚌̲𝚊̲𝚝̲𝚒̲𝚘̲𝚗̲ ̲𝙲̲𝚑̲𝚊̲𝚒̲𝚛̲
Ernesto Estevanell (University of Alicante)
𝚂̲𝚙̲𝚘̲𝚗̲𝚜̲𝚘̲𝚛̲𝚜̲𝚑̲𝚒̲𝚙̲ ̲𝙲̲𝚑̲𝚊̲𝚒̲𝚛̲
Andres Montoyo (University of Alicante)
𝚂̲𝚝̲𝚞̲𝚍̲𝚎̲𝚗̲𝚝̲ ̲𝚆̲𝚘̲𝚛̲𝚔̲𝚜̲𝚑̲𝚘̲𝚙̲ ̲𝙲̲𝚑̲𝚊̲𝚒̲𝚛̲
Salima Lamsiyah (University of Luxembourg)
𝙱̲𝚎̲𝚜̲𝚝̲ ̲𝙿̲𝚊̲𝚙̲𝚎̲𝚛̲ ̲𝙰̲𝚠̲𝚊̲𝚛̲𝚍̲ ̲𝙲̲𝚑̲𝚊̲𝚒̲𝚛̲
Saad Ezzini (King Fahd University of Petroleum & Minerals)
𝙿̲𝚞̲𝚋̲𝚕̲𝚒̲𝚌̲𝚒̲𝚝̲𝚢̲ ̲𝙲̲𝚑̲𝚊̲𝚒̲𝚛̲
Beatriz Botella (University of Alicante)
𝚂̲𝚘̲𝚌̲𝚒̲𝚊̲𝚕̲ ̲𝙿̲𝚛̲𝚘̲𝚐̲𝚛̲𝚊̲𝚖̲𝚖̲𝚎̲ ̲𝙲̲𝚑̲𝚊̲𝚒̲𝚛̲
Alba Bonet (University of Alicante)
𝗩𝗲𝗻𝘂𝗲
The Second International Conference on Natural Language Processing and Artificial Intelligence for Cyber Security (NLPAICS’2026) will take place at the University of Alicante and is organised by the University of Alicante GPLSI research group.
Further information and contact details
The follow-up calls will list keynote speakers and members of the programme committee once confirmed.
The conference website is https://nlpaics2026.gplsi.es/ and will be updated on a regular basis. For further information, please email nlpaics2026(a)dlsi.ua.es
Registration will open in February 2026.
Best Regards
Tharindu Ranasinghe
Dr Tharindu Ranasinghe | Lecturer in Security and Protection Science
School of Computing and Communications | Lancaster University
Contact me on Teams<https://teams.microsoft.com/l/chat/0/0?users=t.ranasinghe@lancaster.ac.uk>
www.lancaster.ac.uk<https://www.lancaster.ac.uk/>
ALPS 2026: The sixth Advanced Language Processing School
Dates: Sunday 29th March to Friday 3rd April, 2026
Location: Aussois, French Alps
Website: https://lig-alps.imag.fr/
Application: https://lig-alps.imag.fr/index.php/application/
Previous year website (for reference, e.g. program): http://alps-2025.imag.fr/
About ALPS
ALPS is co-organized by LIG (Univ. Grenoble Alpes), Naver Labs Europe, and Cohere, and consists in a week-long series of lectures by world-class NLP researchers, sessions where participants present their work, social sessions, lab sessions, and outdoor activities in the mountains. The school will take place in the Alps (the Vanoise massif) at Aussois (1500m) and close to France’s first national park.
What ALPS represents:
Advanced lectures by first class researchers.
An atmosphere that fosters connections and interactions.
A poster session for attendees to present their work, gather feedback and brainstorm future work ideas..
Target audience
The intended audience of ALPS 2026 are graduate students (PhD students or advanced master students) in natural language processing or related fields. We also welcome other NLP practitioners working in industry or academia.
Application
See more details at https://lig-alps.imag.fr/index.php/application/
You will need a resumé (maximum length: 2 pages) and a cover letter. For the cover letter, it should be in English and explain your motivation in attending this advanced research school. We are unfortunately not capable of accepting all candidates, and this letter is a key component that helps us to ensure that the participant’s profiles are balanced and diverse.
Important dates:
Application deadline: 12th October 2025
Acceptance notification: 14th November 2025
Registration deadline: 15th January 2026
Winter School: 29th March to 3th April
Fees
The registration fees for the event encompass accommodation and full board at the conference venue, the Centre Paul Langevin.
Fees:
students: 700 euros
academic non student: 900 euros
industry & independents: 1300 euros
fee waiver recipients: 0 euro
If you have any questions, please contact us at the email specified at the following page: https://lig-alps.imag.fr/index.php/organizers/
Call for Participation and late breaking submissions
DHASA Conference and RAIL workshop 2025
https://dh2025.digitalhumanities.org.zahttps://sadilar.org/en/rail-2025/
Late breaking submissions deadline: 10 October 2025
DHASA conference dates: 11 November 2025-14 November 2025
RAIL workshop date: 10 November 2025
Conference venue: CSIR ICC, Pretoria, South Africa
Registration: https://dh2025.digitalhumanities.org.za/registration/
Late breaking submission Guidelines
* Late breaking submissions: Authors can submit a late breaking
submission, limited to 1 page. Late breaking submissions accepted for
the conference will be presented as a short presentation during a
dedicated late breaking submission presentation slot. The late breaking
submissions will be published in a book of abstracts before the
conference.
We particularly encourage student submissions where the first author is
a student.
All submissions should adhere to the ACL style guide:
https://acl-org.github.io/ACLPUB/formatting.html
Submissions should be submitted in PDF format. Submissions that do not
adhere to the prescribed style guide will be rejected.
Follow this link to go to the submission platform:
https://dh2025.digitalhumanities.org.za/submission/
Authors are encouraged to upload their datasets to the SADiLaR
repository: https://repo.sadilar.org/. In case of difficulties
uploading the datasets, please reach out to Benito Trollip
(benito.trollip(a)nwu.ac.za).
Important dates for late breaking submissions
Submission deadline: 10 October 2025
Date of notification: 17 October 2025
Camera-ready copy deadline: 24 October 2025
Conference: 10 November 2025 – 14 November 2025
Conference venue: CSIR ICC, Pretoria, South Africa
DHASA CONFERENCE
Theme: The role of humanities in digital humanities and artificial
intelligence
The Digital Humanities Association of Southern Africa (DHASA) is
pleased to announce its fifth conference, focusing on the theme The
role of humanities in digital humanities and artificial intelligence.
In a region where the field of Digital Humanities is still relatively
underdeveloped, this conference aims to address this gap and foster
growth and collaboration in the field. The conference offers an
opportunity for researchers interested in showcasing their work in the
broad field of Digital Humanities to come together. By doing so, the
conference provides a comprehensive overview of the current state-of-
the-art in Digital Humanities, particularly within the Southern Africa
region. As such, we welcome submissions related to Digital Humanities
research conducted by individuals from Southern Africa or research
focused on the geographical area of Southern Africa in the broad sense.
Furthermore, the conference serves as a platform for information
sharing and networking among researchers passionate about Digital
Humanities. By bringing together experts working on Digital Humanities
in Southern Africa or with a focus on Southern Africa, we aim to
promote collaboration and facilitate further research in this dynamic
field. In addition to the main conference, affiliated workshops and
tutorials will be organised, providing researchers with valuable
insights into novel technologies and tools. These supplementary events
are designed for researchers interested in specific aspects of Digital
Humanities or seeking practical information to enter or advance their
knowledge in the field.
The DHASA conference welcomes interdisciplinary contributions from
researchers in various domains of Digital Humanities, including, but
not limited to, language, literature, visual art, performance and
theatre studies, media studies, music, history, sociology, psychology,
language technologies, library studies, philosophy, methodologies,
software and computation, AI, and more. Our goal is to cultivate an
inclusive scientific community of practice within Digital Humanities.
RAIL WORKSHOP
Theme: Language resources in the age of large language models
The sixth Resources for African Indigenous Languages (RAIL) workshop
will be co-located with the Digital Humanities Association of Southern
Africa (DHASA) 2025 conference at the CSIR International Convention
Centre in Pretoria, South Africa, on 10 November 2025. The RAIL
workshop is an interdisciplinary platform for researchers working on
African indigenous languages resources such as natural languages
processing (NLP) tools, Human Language Technologies (HLT), data
collections, and annotations. This workshop aims to foster a scientific
community of practice that focuses on computational linguistic tools
and data that are designed for or applied to the indigenous languages
of Africa.
Many African languages are under-resourced while only a few are
considered to be somewhat better resourced. These languages often share
interesting properties such as writing systems, making them different
from most high-resourced languages. From a computational perspective,
these languages lack enough corpora to undertake high level development
of NLP and HLT tools, which in turn impedes the development of African
languages in these areas. During previous workshops, it was noted that
the problems and solutions presented were not only applicable to
African languages but were also relevant to many other low-resource
languages across the world. Because these languages share similar
challenges, this workshop provides researchers with opportunities to
work collaboratively on issues of language resource development and
learn from each other.
The RAIL workshop has several aims. First, the workshop brings together
researchers who work on African indigenous languages, forming a
community of practice for people working on indigenous languages.
Second, the workshop aims to reveal currently unknown or unpublished
existing resources (corpora, NLP tools, and applications), resulting in
a better overview of the current state-of-the-art, and also allows for
discussions on novel, desired resources for future research in this
area. Third, it enhances sharing of knowledge on the development of
low-resource languages. Finally, it enables discussions on how to
improve the quality as well as availability of the resources.
Organising Committees
DHASA conference
Aby Louw, Council for Scientific and Industrial Research
Franco Mak, Council for Scientific and Industrial Research
Franziska Pannach, Rijksuniversiteit Groningen
Ilana Wilken, Council for Scientific and Industrial Research
Johannes Sibeko, Nelson Mandela University
Juan Steyn, South African Centre for Digital Language Resources
Laurette Marais, Council for Scientific and Industrial Research
Marissa Griesel, South African Centre for Digital Language Resources
Menno van Zaanen, South African Centre for Digital Language Resources
Privolin Naidoo, Council for Scientific and Industrial Research
Sthembiso Mkhwanazi, Council for Scientific and Industrial Research
RAIL workshop
Rooweither Mabuya, South African Centre for Digital Language Resources
Muzi Matfunjwa, South African Centre for Digital Language Resources
Mmasibidi Setaka, South African Centre for Digital Language Resources
Menno van Zaanen, South African Centre for Digital Language Resources
--
Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za
Professor in Digital Humanities
South African Centre for Digital Language Resources
https://www.sadilar.org
________________________________
NWU PRIVACY STATEMENT:
http://www.nwu.ac.za/it/gov-man/disclaimer.html
DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system.
________________________________
𝗝𝗼𝘂𝗿𝗻𝗮𝗹 𝗡𝗮𝘁𝘂𝗿𝗮𝗹 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗣𝗿𝗼𝗰𝗲𝘀𝘀𝗶𝗻𝗴 - 𝗦𝗽𝗲𝗰𝗶𝗮𝗹 𝗜𝘀𝘀𝘂𝗲 𝗼𝗻 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹𝘀 𝗳𝗼𝗿 𝗟𝗼𝘄-𝗥𝗲𝘀𝗼𝘂𝗿𝗰𝗲 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲𝘀
URL - https://loreslm.github.io/specialissue
Neural language models have revolutionised natural language processing (NLP) and have provided state-of-the-art results for many tasks. However, their effectiveness is largely dependent on the pre-training resources. Therefore, language models (LMs) often struggle with low-resource languages in both training and evaluation. Recently, there has been a growing trend in developing and adopting LMs for low-resource languages. This special issue aims to provide a forum for researchers to share and discuss their ongoing work on LMs for low-resource languages.
𝗧𝗼𝗽𝗶𝗰𝘀
We invite submissions on a broad range of topics related to the development and evaluation of neural language models for low-resource languages, including but not limited to the following.
- Building language models for low-resource languages.
- Adapting/extending existing language models/large language models for low-resource languages.
- Corpora creation and curation technologies for training language models/large language models for low-resource languages.
- Benchmarks to evaluate language models/large language models in low-resource languages.
- Prompting/in-context learning strategies for low-resource languages with large language models.
- Review of available corpora to train/fine-tune language models/large language models for low-resource languages.
- Multilingual/cross-lingual language models/large language models for low-resource languages.
- Applications of language models/large language models for low-resource languages (i.e. machine translation, chatbots, content moderation, etc.)
𝗜𝗺𝗽𝗼𝗿𝘁𝗮𝗻𝘁 𝗗𝗮𝘁𝗲𝘀
Paper submission: December 31, 2025
First decision: March 31, 2026- April 30, 2026
Revised version submission: May 1, 2026- June 1, 2026
Final decision: August 30, 2026
𝗦𝘂𝗯𝗺𝗶𝘀𝘀𝗶𝗼𝗻
Submissions should be formatted according to the journal guidelines available - https://www.cambridge.org/core/journals/natural-language-processing/informa… and submitted through the manuscript submission system - https://mc.manuscriptcentral.com/nlp. To ensure your manuscript is considered for this special issue, please select “Language Models for Low-Resource Languages” under Special Issue Designation when uploading your manuscript.
Guest Editors
Hansi Hettiarachchi, Lancaster University, UK
Tharindu Ranasinghe, Lancaster University, UK
Paul Rayson, Lancaster University, UK
Ruslan Mitkov, Lancaster University, UK
Mohamed Gaber, Queensland University of Technology, Australia
Guest Editorial Board
Gábor Bella - IMT Atlantique, France
Ana-Maria Bucur - University of Bucharest, Romania
Çağrı Çöltekin - University of Tübingen, Germany
Vera Danilova - Uppsala University, Sweden
Ona de Gibert - University of Helsinki, Finland
Ignatius Ezeani - Lancaster University, UK
Amal Htait - Aston University, UK
Ali Hürriyetoğlu - Wageningen University & Research, Netherlands
Danka Jokic - University of Belgrade, Serbia
Diptesh Kanojia - University of Surrey, UK
Taro Watanabe - Nara Institute of Science and Technology, Japan
Muhidin Mohamed - Aston University, UK
Alistair Plum - University of Luxembourg, Luxembourg
Damith Premasiri - Lancaster University, UK
Guokan Shang - Mohamed bin Zayed University of Artificial Intelligence, France
Ravi Shekhar - University of Essex, UK
Best Regards
Tharindu Ranasinghe on behalf of the Guest Editors