****Call for Papers WASP @ IJCNLP-AACL 2025****
*https://ui.adsabs.harvard.edu/WIESP/2025/
<https://ui.adsabs.harvard.edu/WIESP/2025/>*
Building on the success of the First Workshop on Information Extraction
from Scientific Publications (WIESP) at AACL-IJCNLP 2022 and the Second
WIESP at IJCNLP-AACL 2023, the Third Workshop on Artificial intelligence
for Scientific Publications (WASP) at IJCNLP-AACL 2025 aims to establish
itself as a pivotal platform for promoting discussions and research in the
field of Natural Language Processing (NLP) and Artificial Intelligence
(AI). This gathering will bring together esteemed experts and renowned
organizations with students and early-career researchers who are interested
and invested in efforts to extract and mine the worldโs scientific
knowledge from research papers. Their collaboration will be focused on
developing advanced algorithms, models, and tools that will lay the
foundation for future machine comprehension of scientific literature. The
third iteration of WASP will specifically concentrate on various topics
related to Artificial Intelligence research for scientific publications:
****Topics (not limited to)****
- *Scientific document parsing and structured information extraction*
- *Scientific named-entity recognition and concept identification*
- *Citation context/span extraction and citation-based knowledge mining*
- *Argument extraction and scientific discourse analysis*
- *Scientific article summarization and headline generation*
- *Question-answering and fact retrieval from scientific literature*
- *Prompt engineering and retrieval-augmented generation (RAG) for
science Q&A*
- *Chain-of-thought reasoning and scientific problem-solving with LLMs*
- *LLM-powered information extraction from scientific texts*
- *Pretraining and fine-tuning LLMs on scientific corpora*
- *Evaluation and alignment of LLMs for scientific understanding*
- *AI-assisted scientific discovery and hypothesis generation*
- *Ethical and responsible use of LLMs in scientific publishing*
- *Large Language Reasoning Models for Scientific Discovery*
- *LLM hallucinations and impact on scientific knowledge, publications*
- *Challenges, Future of AI in Scientific Publishing*
- *AI, Peer Review, and Scientific Publishing*
- *Impact of Generative AI on Scientific Publishing*
*In addition to papers, WASP will also host a shared task. *
****Telescope Reference and Astronomy Categorization Shared (TRACS)****
https://ui.adsabs.harvard.edu/WIESP/2025/shared_task
We will publish a separate CfP on the shared task. Shared task authors will
be invited to write their system descriptions, which will then undergo
light peer review.
All accepted papers and shared task system papers will be published in the
WASP proceedings as part of IJCNLP-AACL 2025 and indexed in the ACL
Anthology.
****Important Dates****
- *NEW Paper submission deadline (WASP): October 13, 2025*
- *ARR commitment deadline: October 27, 2025*
- Notification of paper acceptance (WASP+TRACS): November 3, 2025
- Camera-ready submission deadline (WASP+TRACS): November 11, 2025
- Workshop: December 23, 2025 (hybrid)
- All submission deadlines are 11.59 pm UTC -12h (โAnywhere on Earthโ)
****Paper Submission Site****
*https://openreview.net/group?id=aclweb.org/AACL-IJCNLP/2025/Workshop/WASP
<https://openreview.net/group?id=aclweb.org/AACL-IJCNLP/2025/Workshop/WASP>*
Submission will be via OpenReview. Submissions should follow the ACLPUB
formatting guidelines and use the provided template files. Paper formatting
guidelines - ACLPUB <https://acl-org.github.io/ACLPUB/formatting.html>
Submissions (Long and Short Papers) will be subject to a double-blind
peer-review process. We follow the same policies as IJCNLP-AACL 2025
regarding anonymity, preprints, and double submissions.
Please reach out to the organizers (cc'ed) for any queries.
Thank you!
--
+++++++++++++++++++++++++++++++++++
Dr. Tirthankar Ghosal <https://www.tirthankarghosal.com/>
Scientist (NLP/AI and HPC)
<https://www.ornl.gov/staff-profile/tirthankar-ghosal>
National Center for Computational Sciences (NCCS)
Oak Ridge National Laboratory, United States
&
Affiliate Faculty (NLP/AI)
<https://bredesencenter.utk.edu/faculty/tirthankar-ghosal/>
University of Tennessee Knoxville
United States
++++++++++++++++++++++++++++++++++++
๐ฆ๐ฒ๐ฐ๐ผ๐ป๐ฑ ๐๐ป๐๐ฒ๐ฟ๐ป๐ฎ๐๐ถ๐ผ๐ป๐ฎ๐น ๐๐ผ๐ป๐ณ๐ฒ๐ฟ๐ฒ๐ป๐ฐ๐ฒ ๐ผ๐ป ๐ก๐ฎ๐๐๐ฟ๐ฎ๐น ๐๐ฎ๐ป๐ด๐๐ฎ๐ด๐ฒ ๐ฃ๐ฟ๐ผ๐ฐ๐ฒ๐๐๐ถ๐ป๐ด ๐ฎ๐ป๐ฑ ๐๐ฟ๐๐ถ๐ณ๐ถ๐ฐ๐ถ๐ฎ๐น ๐๐ป๐๐ฒ๐น๐น๐ถ๐ด๐ฒ๐ป๐ฐ๐ฒ ๐ณ๐ผ๐ฟ ๐๐๐ฏ๐ฒ๐ฟ ๐ฆ๐ฒ๐ฐ๐๐ฟ๐ถ๐๐ (๐ก๐๐ฃ๐๐๐๐ฆโ๐ฎ๐ฌ๐ฎ๐ฒ)
University of Alicante, Alicante, Spain
11 and 12 June 2026
https://nlpaics2026.gplsi.es/
๐๐ถ๐ฟ๐๐ ๐๐ฎ๐น๐น ๐ณ๐ผ๐ฟ ๐ฃ๐ฎ๐ฝ๐ฒ๐ฟ๐
Recent advances in Natural Language Processing (NLP), Deep Learning and Large Language Models (LLMs) have resulted in improved performance of applications. In particular, there has been a growing interest in employing AI methods in different Cyber Security applications.
In today's digital world, Cyber Security has emerged as a heightened priority for both individual users and organisations. As the volume of online information grows exponentially, traditional security approaches often struggle to identify and prevent evolving security threats. The inadequacy of conventional security frameworks highlights the need for innovative solutions that can effectively navigate the complex digital landscape for ensuring robust security. NLP and AI in Cyber Security have vast potential to significantly enhance threat detection and mitigation by fostering the development of advanced security systems for autonomous identification, assessment, and response to security threats in real-time. Recognising this challenge and the capabilities of NLP and AI approaches to fortify Cyber Security systems, the Second International Conference on Natural Language Processing (NLP) and Artificial Intelligence (AI) for Cyber Security (NLPAICSโ2026) continues the tradition from NLPAICSโ2024 to be a gathering place for researchers in NLP and AI methods for Cyber Security. We invite contributions that present the latest NLP and AI solutions for mitigating risks in processing digital information.
๐๐ผ๐ป๐ณ๐ฒ๐ฟ๐ฒ๐ป๐ฐ๐ฒ ๐๐ผ๐ฝ๐ถ๐ฐ๐
The conference invites submissions on a broad range of topics related to the employment of NLP and AI (and in general, language studies and models) for Cyber Security including but not limited to:
- ๐๐ฐ๐ค๐ช๐ฆ๐ต๐ข๐ญ ๐ข๐ฏ๐ฅ ๐๐ถ๐ฎ๐ข๐ฏ ๐๐ฆ๐ค๐ถ๐ณ๐ช๐ต๐บ ๐ข๐ฏ๐ฅ ๐๐ข๐ง๐ฆ๐ต๐บ
- Content Legitimacy and Quality
- Detection and mitigation of hate speech and offensive language
- Fake news, deepfakes, misinformation and disinformation
- Detection of machine generated language in multimodal context (text, speech and gesture)
- Trust and credibility of online information
- User Security and Safety
- Cyberbullying and identification of internet offenders
- Monitoring extremist fora
- Suicide prevention
- Clickbait and scam detection
- Fake profile detection in online social networks
- Technical Measures and Solutions
- Social engineering identification, phishing detection
- NLP for risk assessment
- Controlled languages for safe messages
- Prevention of malicious use of ai models
- Forensic linguistics
- Human Factors in Cyber Security
- ๐๐ฑ๐ฆ๐ฆ๐ค๐ฉ ๐๐ฆ๐ค๐ฉ๐ฏ๐ฐ๐ญ๐ฐ๐จ๐บ ๐ข๐ฏ๐ฅ ๐๐ถ๐ญ๐ต๐ช๐ฎ๐ฐ๐ฅ๐ข๐ญ ๐๐ฏ๐ท๐ฆ๐ด๐ต๐ช๐จ๐ข๐ต๐ช๐ฐ๐ฏ๐ด ๐ง๐ฐ๐ณ ๐๐บ๐ฃ๐ฆ๐ณ ๐๐ฆ๐ค๐ถ๐ณ๐ช๐ต๐บ
- Voice-based security: Analysis of voice recordings or transcripts for security threats
- Detection of machine generated language in multimodal context (text, speech and gesture)
- NLP and biometrics in multimodal context
- ๐๐ข๐ต๐ข ๐ข๐ฏ๐ฅ ๐๐ฐ๐ง๐ต๐ธ๐ข๐ณ๐ฆ ๐๐ฆ๐ค๐ถ๐ณ๐ช๐ต๐บ
- Cryptography
- Digital forensics
- Malware detection, obfuscation
- Models for documentation
- NLP for data privacy and leakage prevention (DLP)
- Addressing dataset โpoisoningโ attacks
- ๐๐ถ๐ฎ๐ข๐ฏ-๐๐ฆ๐ฏ๐ต๐ณ๐ช๐ค ๐๐ฆ๐ค๐ถ๐ณ๐ช๐ต๐บ ๐ข๐ฏ๐ฅ ๐๐ถ๐ฑ๐ฑ๐ฐ๐ณ๐ต
- Natural language understanding for chatbots: NLP-powered chatbots for user support and security incident reporting
- User behaviour analysis: analysing user-generated text data (e.g., chat logs and emails) to detect insider threats or unusual behaviour
- Human supervision of technology for Cyber Security
- ๐๐ฏ๐ฐ๐ฎ๐ข๐ญ๐บ ๐๐ฆ๐ต๐ฆ๐ค๐ต๐ช๐ฐ๐ฏ ๐ข๐ฏ๐ฅ ๐๐ฉ๐ณ๐ฆ๐ข๐ต ๐๐ฏ๐ต๐ฆ๐ญ๐ญ๐ช๐จ๐ฆ๐ฏ๐ค๐ฆ
- Text-Based Anomaly Detection
- Identification of unusual or suspicious patterns in logs, incident reports or other textual data
- Detecting deviations from normal behaviour in system logs or network traffic
- Threat Intelligence Analysis
- Processing and analysing threat intelligence reports, news, articles and blogs on latest Cyber Security threats
- Extracting key information and indicators of compromise (IoCs) from unstructured text
- ๐๐บ๐ด๐ต๐ฆ๐ฎ๐ด ๐ข๐ฏ๐ฅ ๐๐ฏ๐ง๐ณ๐ข๐ด๐ต๐ณ๐ถ๐ค๐ต๐ถ๐ณ๐ฆ ๐๐ฆ๐ค๐ถ๐ณ๐ช๐ต๐บ
- Systems Security
- Anti-reverse engineering for protecting privacy and anonymity
- Identification and mitigation of side-channel attacks
- Authentication and access control
- Enterprise-level mitigation
- NLP for software vulnerability detection
- Malware Detection through Code Analysis
- Analysing code and scripts for malware
- Detection using NLP to identify patterns indicative of malicious code
- ๐๐ช๐ฏ๐ข๐ฏ๐ค๐ช๐ข๐ญ ๐๐บ๐ฃ๐ฆ๐ณ ๐๐ฆ๐ค๐ถ๐ณ๐ช๐ต๐บ
- Financial fraud detection
- Financial risk detection
- Algorithmic trading security
- Secure online banking
- Risk management in finance
- Financial text analytics
- ๐๐ต๐ฉ๐ช๐ค๐ด, ๐๐ช๐ข๐ด, ๐ข๐ฏ๐ฅ ๐๐ฆ๐จ๐ช๐ด๐ญ๐ข๐ต๐ช๐ฐ๐ฏ ๐ช๐ฏ ๐๐บ๐ฃ๐ฆ๐ณ ๐๐ฆ๐ค๐ถ๐ณ๐ช๐ต๐บ
- Ethical and Legal Issues
- Digital privacy and identity management
- The ethics of NLP and speech technology
- Explainability of NLP and speech technology tools
- Legislation against malicious use of AI
- Regulatory issues
- Bias and Security
- Bias in Large Language Models (LLMs)
- Bias in security related datasets and annotations
- ๐๐ข๐ต๐ข๐ด๐ฆ๐ต๐ด ๐ข๐ฏ๐ฅ ๐๐ฆ๐ด๐ฐ๐ถ๐ณ๐ค๐ฆ๐ด ๐ง๐ฐ๐ณ ๐๐บ๐ฃ๐ฆ๐ณ ๐๐ฆ๐ค๐ถ๐ณ๐ช๐ต๐บ ๐๐ฑ๐ฑ๐ญ๐ช๐ค๐ข๐ต๐ช๐ฐ๐ฏ๐ด
- ๐๐ฑ๐ฆ๐ค๐ช๐ข๐ญ๐ช๐ด๐ฆ๐ฅ ๐๐ฆ๐ค๐ถ๐ณ๐ช๐ต๐บ ๐๐ฑ๐ฑ๐ญ๐ช๐ค๐ข๐ต๐ช๐ฐ๐ฏ๐ด ๐ข๐ฏ๐ฅ ๐๐ฑ๐ฆ๐ฏ ๐๐ฐ๐ฑ๐ช๐คs
- Intelligence applications
- Emerging and innovative applications in Cyber Security
๐๐ฑ๐ฆ๐ค๐ช๐ข๐ญ ๐๐ฉ๐ฆ๐ฎ๐ฆ ๐๐ณ๐ข๐ค๐ฌ - ๐๐ถ๐ต๐ถ๐ณ๐ฆ ๐ฐ๐ง ๐๐บ๐ฃ๐ฆ๐ณ ๐๐ฆ๐ค๐ถ๐ณ๐ช๐ต๐บ ๐ช๐ฏ ๐ต๐ฉ๐ฆ ๐๐ณ๐ข ๐ฐ๐ง ๐๐๐๐ด ๐ข๐ฏ๐ฅ ๐๐ฆ๐ฏ๐ฆ๐ณ๐ข๐ต๐ช๐ท๐ฆ ๐๐
NLPAICS 2026 will feature a special theme track with the goal of stimulating discussion around Large Language Models (LLMs), Generative AI and ensuring their safety. The latest generation of LLMs, such as CHATGPT, Gemini, DeepSeek, LLAMA and open-source alternatives, has showcased remarkable advancements in text and image understanding and generation. However, as we navigate through uncharted territory, it becomes imperative to address the challenges associated with employing these models in everyday tasks, focusing on aspects such as fairness, ethics, and responsibility. The theme track invites studies on how to ensure the safety of LLMs in various tasks and applications and what this means for the future of the field. The possible topics of discussion include (but are not limited to) the following:
โข Detection of LLM-generated language in multimodal context (text, speech and gesture)
โข LLMs for forensic linguistics
โข Bias in LLMs
โข Safety benchmarks for LLMs
โข Legislation against malicious use of LLMs
โข Tools to evaluate safety in LLMs
โข Methods to enhance the robustness of language models
๐ฆ๐๐ฏ๐บ๐ถ๐๐๐ถ๐ผ๐ป๐ ๐ฎ๐ป๐ฑ ๐ฃ๐๐ฏ๐น๐ถ๐ฐ๐ฎ๐๐ถ๐ผ๐ป
NLPAICS welcomes high-quality submissions in English, which can take two forms:
โข Regular long papers: These can be up to eight (8) pages long, presenting substantial, original, completed, and unpublished work.
โข Short (poster) papers: These can be up to four (4) pages long and are suitable for describing small, focused contributions, ongoing research, negative results, system demonstrations, etc. Short papers will be presented as part of a poster session.
The conference will not consider and evaluate abstracts only.
Accepted papers, including both long and short papers, will be published as e-proceedings with ISBN will available online on the conference website at the time of the conference and are expected to be uploaded into the ACL Anthology.
Further details on the submission procedure will be made available in the Second Call for Papers due in October 2025.
The conference will feature a student workshop and awards will be offered to the authors of best papers.
๐๐บ๐ฝ๐ผ๐ฟ๐๐ฎ๐ป๐ ๐ฑ๐ฎ๐๐ฒ๐
โข Submissions due: 16 March 2026
โข Reviewing process: 1 April โ 30 April 2026
โข Notification of acceptance: 5 May 2026
โข Camera-ready due: 19 May 2026
โข Conference camera-ready proceedings ready 1 June 2026
โข Conference: 11-12 June 2026
๐ข๐ฟ๐ด๐ฎ๐ป๐ถ๐๐ฎ๐๐ถ๐ผ๐ป
๐ฒฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒโฬฒ๐ฒฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒโฬฒ
Ruslan Mitkov (University of Alicante)
Rafael Muรฑoz (University of Alicante)
๐ฟฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒโฬฒ๐ฒฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒโฬฒ๐ฒฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ
Elena Lloret (University of Alicante)
Tharindu Ranasinghe (Lancaster University)
๐ฟฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒโฬฒ๐ฒฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ
Ernesto Estevanell (University of Alicante)
๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒโฬฒ๐ฒฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ
Andres Montoyo (University of Alicante)
๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒโฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒโฬฒ๐ฒฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ
Salima Lamsiyah (University of Luxembourg)
๐ฑฬฒ๐ฬฒ๐ฬฒ๐ฬฒโฬฒ๐ฟฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒโฬฒ๐ฐฬฒ๐ ฬฒ๐ฬฒ๐ฬฒ๐ฬฒโฬฒ๐ฒฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ
Saad Ezzini (King Fahd University of Petroleum & Minerals)
๐ฟฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ขฬฒโฬฒ๐ฒฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ
Beatriz Botella (University of Alicante)
๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒโฬฒ๐ฟฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒโฬฒ๐ฒฬฒ๐ฬฒ๐ฬฒ๐ฬฒ๐ฬฒ
Alba Bonet (University of Alicante)
๐ฉ๐ฒ๐ป๐๐ฒ
The Second International Conference on Natural Language Processing and Artificial Intelligence for Cyber Security (NLPAICSโ2026) will take place at the University of Alicante and is organised by the University of Alicante GPLSI research group.
Further information and contact details
The follow-up calls will list keynote speakers and members of the programme committee once confirmed.
The conference website is https://nlpaics2026.gplsi.es/ and will be updated on a regular basis. For further information, please email nlpaics2026(a)dlsi.ua.es
Registration will open in February 2026.
Best Regards
Tharindu Ranasinghe
Dr Tharindu Ranasinghe | Lecturer in Security and Protection Science
School of Computing and Communications | Lancaster University
Contact me on Teams<https://teams.microsoft.com/l/chat/0/0?users=t.ranasinghe@lancaster.ac.uk>
www.lancaster.ac.uk<https://www.lancaster.ac.uk/>
**3rdย CALL FOR PARTICIPATION**
Two peas in a pod:PARSEME 2.0 and AdMiRe 2.0
multilingual UniDive shared tasks
on idiomaticity and multiword expressions
https://unidive.lisn.upsaclay.fr/doku.php?id=other-events:parseme-admire-stโฆ
<https://unidive.lisn.upsaclay.fr/doku.php?id=other-events:parseme-admire-stโฆ>
Expression of interest:
https://forms.gle/rwSfUmNR1sTsHDfx6
<https://forms.gle/rwSfUmNR1sTsHDfx6>
====================================================================
The UniDive COST Action
<https://unidive.lisn.upsaclay.fr/>is happy to
announce ADMIRE 2 and the PARSEME 2.0 shared tasks
dedicated to detecting and interpreting
idiomaticity and multiword expressions(MWEs). MWEs
are groups of words that have non-compositional
semantics, i.e. their meanings cannot be
straightforwardly deduced from the meanings of
their components. For instance, a bad appleis a
person who has a bad influence on others.
Both shared tasks will take place together and we
hope to co-organise the workshop with SIGLEX-MWE
section <https://multiword.org/>and co-locate it
with EACL 2026 in Morocco(24-29 March 2026).
The participating teams are to submit the results
of their systems on CodaBench
<https://www.codabench.org/>. The submission links
will be published at the same time as the test data.
We are delighted to confirm that UniDive
<https://unidive.lisn.upsaclay.fr/>will provide
funding
<https://unidive.lisn.upsaclay.fr/doku.php?id=other-events:parseme-admire-stโฆ>for
selected system presenters.
Important dates
-----------------
*
[1 OCTOBER] Training data and baseline systems
released
*
[3 DECEMBER] Publication of test blind data
*
[8 DECEMBER] Submission of system predictions
*
[19 DECEMBER] Systems evaluated
*
[5 JANUARY] Submission deadline for system
description papers
*
[9-23 JANUARY] Reviewing period (system teams
will participate as reviewers)
*
[3 FEBRUARY] Submission deadline for
camera-ready papers
*
[24-29 MARCH 2026] EACL, including the MWE
workshop(to confirm)
PARSEME 2.0is a shared task whose main objective
is to identify and paraphrase multiword
expressions (MWEs) in written text. We propose two
subtasks: the first corresponds to the classical
identification task in running text. The second
consists in paraphrasing a sentence containing a
MWE, so as to remove idiomaticity. Data annotation
is finished and 17 languages are covered: Dutch,
Egyptian (ca. 2700-2000 BC), French, Georgian,
Greek (Ancient), Greek (Modern), Hebrew, Japanese,
Latvian, Persian, Polish, Portuguese (Brazilian),
Romanian, Serbian, Slovene, Swedish, and
Ukrainian. Subtask 1 is on MWEs identification and
Subtask 2 on paraphrasing MWEs.
AdMIRe 2.0 (Advancing Multimodal Idiomaticity
Representation) addresses the challenge of
multilingual and multimodal idiomatic language
understanding by evaluating how well models
interpret potentially idiomatic expressions (PIEs)
across languages and across modalities using both
text and images. This new edition extends the
AdMIRe 1 task
<https://arxiv.org/pdf/2503.15358>adding more
languages from the UNIDIVE network and beyond.
Given a context sentence containing a PIE and a
set of five images, the task is to rank the images
based on how accurately they depict the meaning of
the PIE used in that sentence. The task will be
zero-shot for newly introduced languages. While
the task is designed to encourage participation
from teams working on multilingual and multimodal
technologies, it also accommodates approaches
focused only on a subset of the languages and on a
single modality (text) with automatically
generated descriptive captions for each image,
allowing models to rely exclusively on text input
if desired.
Data
-----------------
*
Training data for AdMiRe 2.0:
https://semeval2025-task1.github.io/data/training/training_data.html
<https://semeval2025-task1.github.io/data/training/training_data.html>
*
Training data for PARSEME 2.0:
https://gitlab.com/parseme/sharedtask-data/-/tree/master/2.0
<https://gitlab.com/parseme/sharedtask-data/-/tree/master/2.0>
Organizing team
---------------
PARSEME 2.0:
*
Manon Scholivet, Universitรฉ Paris Saclay, LISN, FR
*
Takuya Nakamura, Universitรฉ Paris Saclay, LISN, FR
*
Agata Savary, Universitรฉ Paris Saclay, LISN, FR
*
รric Bilinski, Universitรฉ Paris Saclay, LISN, FR
*
Carlos Ramisch, Aix-Marseille Universitรฉ, LIS, FR
ADMIRE 2 :
*
Adriana Pagano
<https://secure-web.cisco.com/1YAGKjWKddhtqiA-9wwpTzBrRHqMWqraLLDCi63yoSQPHpโฆ>,
Universidade Federal de Minas Gerais, BR
*
Aline Villavicencio
<https://secure-web.cisco.com/17hRYtc48CxUTuQ_Lm5LvtIDhREp6JpTTNFu3smb4Yyjp1โฆ>,
University of Exeter, UK
*
Dilara Torunoฤlu Selamet
<https://secure-web.cisco.com/1U3Kz5oRS8032U7C3ikqTwLrLuHnRujaiXILauGPxfilqdโฆ>,
Istanbul Technical University, TR
*
Doฤukan Arslan
<https://secure-web.cisco.com/1iCwzaJ-FPO1KSa7r0luNlHcUTrCy6K9Wm8I3pk9d_-iG8โฆ>,
Istanbul Technical University, TR
*
Gรผlลen Eryiฤit
<https://secure-web.cisco.com/1nP_yCeZKo55vzyCSp6J79GtmP_8EODYLoOnic4AHIQmdVโฆ>,
Istanbul Technical University, TR
*
Rodrigo Wilkens
<https://secure-web.cisco.com/1zTIs9aO7VfKy_Sg1CYj8xiCPOZhZHkSPR2xYyMQE456pFโฆ>,
University of Exeter, UK
*
Tom Pickard
<https://secure-web.cisco.com/1AbiBJ6cGhN9SrjpkIlBYQeo08-8YJIDUds7Qfs3H5_KpLโฆ>,
University of Sheffield, UK
*
Wei He
<https://secure-web.cisco.com/1HrUa3BUU6pl9p4Ia2mMqmEqrPU834VAhFFDUvAV6PbjrPโฆ>,
University of Exeter, UK
Mozilla Data Collective (the new platform where Mozilla Common Voice
datasets, among other datasets, are hosted)
just kicked off a Shared Task on Spontaneous Speech ASR. It targets 21
underrepresented languages (from Africa, the Americas, Europe, and
Asia),
brand-new datasets, and prizes for the best systems in each task.
For more information, visit
https://community.mozilladatacollective.com/shared-task-mozilla-common-voicโฆ
Robert Pugh
Senior Community Manager
mozillafoundation.org
(UTC-7)
Monthly online ILFC Seminar: interactions between formal and computational
linguistics
https://gdr-lift.loria.fr/monthy-online-ilfc-seminar/
The LIFT 2 research group is happy to announce the forthcoming sessions of
the ILFC seminar on the interactions between formal and computational
linguistics.
The seminar is held on Zoom. To attend the seminar and get updates, please
subscribe to our mailing list (we now only rarely communicate through other
mailing lists): https://sympa.inria.fr/sympa/subscribe/seminaire_ilfc
- 2025/10/15 16:30-17:30 UTC+2: *Noga Zaslavsky* (New York University)
Title: *Cultural evolution of efficient semantic systems in humans and
AI*
Abstract: *Human languages efficiently compress meanings into words, but
how did our semantic systems evolve to be that way? Are AI systems capable
of evolving efficient semantic systems and representing meaning as we do?
In this talk, I address these open questions from cognitive, cultural, and
computational perspectives. First, I show that individual human learners
favor efficiently compressed semantic representations. This inductive
learning bias, when amplified via cultural transmission, drives the
evolution of near-optimally efficient semantic systems. Second, I consider
large language models (LLMs) and show that while they vary widely in their
semantic alignment with humans, they nevertheless exhibit a similar
tendency toward efficient compression: when simulating cultural evolution
with LLMs, they iteratively restructure initially random semantic systems
towards greater efficiency. Finally, I show that introducing an explicit
pressure for efficient compression, grounded in the information bottleneck
principle, enables multi-agent reinforcement learning systems to evolve
efficient, human-like semantic systems without any human supervision. Taken
together, these results demonstrate how humans and AI can evolve efficient
semantic systems through social interaction and cultural transmission, and
more broadly, they suggest that efficient compression may be a fundamental
principle of intelligence.*
- 2025/11/26 16:30-17:30 UTC+1: *Ece Takmaz* (Utrecht University)
Title: [TBA]
Abstract: [TBA]
- 2025/12/17 16:30-17:30 UTC+1: *Ethan Wilcox* (Georgetown University)
Title: [TBA]
Abstract: [TBA]
- 2026/01/21 16:30-17:30 UTC+1: *Gemma Boleda* (Universitat Pompeu Fabra)
Title: [TBA]
Abstract: [TBA]
- 2026/03/18 16:30-17:30 UTC+1: *Adele Goldberg* (Princeton University)
Title: [TBA]
Abstract: [TBA]
Dear all,
We are pleased to announce that the submission deadline for the
16th International Workshop on Spoken Dialogue Systems (IWSDS 2026) has been extended:
๐ Important Dates (Extended):
๐ Paper Submission Deadline: October 12 โ October 22, 2025
๐ Paper Update Deadline: October 18 โ October 28, 2025
โ Acceptance Notification: December 10, 2025
๐ค Workshop Dates: February 26 โ March 1, 2026
We invite submissions of long papers, short papers, position papers, industry track papers, and
demonstrations on a broad range of topics related to the Theoretical Foundations, Systems and Methods, and
Applications of spoken and multimodal dialogue systems.
Accepted papers will be included in the ACL Anthology.
This yearโs theme is:
๐ฏโHuman-Machine Dialogue in the Era of Multimodal Foundation Modelsโ
Location: Trento, Italy โ the gateway to the Dolomites, right after the Milano Cortina 2026 Winter Olympics
Website & CfP: https://sites.google.com/unitn.it/iwsds26/
Twitter/X: https://x.com/iwsdsmeeting
Bluesky: https://bsky.app/profile/iwsdsmeeting.bsky.social
---
Prof. Dr.-Ing. Giuseppe Riccardi
Founder and Director of the Signals and Interactive Systems Lab
Department of Computer Science and Information Engineering
University of Trento
Room D206, via Sommarive 5
38123 Povo di Trento, Italy
Home Page: http://disi.unitn.it/~riccardi/
๏ฟผ
October is back and so are HPLT datasets (we've been doing this for
three consecutive years now!)
This time, we announce the release of the massive HPLT v3.0 multilingual
dataset which can be considered a major upgrade for large-scale
multilingual corpora.
Accounting for 29 billion documents, 198 language-script combinations
and 112 trillion characters, v3.0 shows significant gains over v2,
driven by several improvements, including a new global deduplication
process:
- Unique content boosted from 52% to 73% on average.
- Data substance and robustness remains high with better extraction and
improved language identification.
- Shows increased variety and better representativeness of natural web
content.
This release provides a cleaner, more robust dataset for building
powerful LLMs and machine translation systems, including a myriad of
low- to medium-resourced languages. And we have not said our last word:
wait for more data soon because we are already working on it.
Special thanks to all the collaborators and funding bodies, including
the European Union's Horizon Europe program and UK Research and Innovation.
Explore the data and see the full analysis and evaluation highlights on
our website:
https://hplt-project.org/datasets/v3.0
--
Andrey
Language Technology Group (LTG)
University of Oslo
Dear colleagues,
We are sharing the following details about the online event Corpus linguistics & applied linguistics research 2025, hosted by the University of Murcia from 3 to 27 November 2025.
This yearโs talks will focus on the impact of AI on corpus linguistics.
Speakers:
* Dr Lisa Cheung, The University of Hong Kong, & Dr Peter Crosthwaite, The University of Queensland โ 3 Nov
* Prof Qing Ma, The Education University of Hong Kong โ 11 Nov
* Prof Laurence Anthony, Waseda University โ 25 Nov
* Prof Atsushi Mizumoto, Kansai University, Osaka โ 27 Nov
Info: https://www.um.es/languagecorpora/clresearch2025/
The talks will take place via ZOOM at 11:00 a.m. (Madrid) / 5:00 p.m. (Hong Kong).
Registration link: https://umurcia.zoom.us/webinar/register/WN_NAQ8nFTgSCO2obOFNNgc1A#/registrโฆ
For this edition, attendees who request it will receive a certificate of participation.
Corpus linguistics & applied linguistics research 2025 is organized by the University of Murcia research group E020-07 โLenguajes de especialidad, corpus lingรผรญsticos y lingรผรญstica inglesa aplicada a la ingenierรญa del conocimientoโ, with support from the Faculty of Arts, the Department of English Philology at the University of Murcia, and The Education University of Hong Kong.
Follow updates on X: @languagecorpora
Watch talks from previous editions: https://www.youtube.com/@corporaappliedlinguistics8358/playlists
Kind regards from the organizing committee,
Pascual Pรฉrez-Paredes
https://webs.um.es/pascualf
***Apologies for possible cross-posting ***
First Call for Papers: 6th International Workshop on Computational
Approaches to Historical Language Change (LChangeโ26)
Co-located with EACL 2026, Rabat, Morocco & Online | March 24โ29, 2026
๐ Website: https://www.changeiskey.org/event/2026-eacl-lchange/
๐ง Contact: lchange(a)changeiskey.org
== About the Workshop ==
The LChange workshop brings together researchers interested in
computational modeling of language change โ both historical and
synchronic. Following the success of LChange in 2019, 2021, 2022, 2023
and 2024, this sixth edition will be held as a hybrid half-day workshop
at EACL 2026 conference in Rabat.
We welcome contributions addressing all aspects of computational
approaches to language change. Our goal is to foster dialogue on
state-of-the-art computational methodologies, resources, and theories
that explore the dynamic, time-varying nature of language.
In addition to paper presentations and keynotes, we offer a mentorship
program for students to engage with experienced researchers, regardless
of whether they are submitting a paper or not.
== Important Dates (tentative) ==
Direct Submission deadline: December 19, 2025
Pre-reviewed (ARR) submission deadline: January 2, 2026
Notification of acceptance: January 23, 2026
Camera-ready paper due: February 3, 2026
Workshop dates: March 24-29, 2026
== Submission Information ==
We accept the following types of submissions:
- Long papers: up to 8 pages (+ references)
- Short papers: up to 4 pages (+ references). Dataset and model
release papers should be submitted as short papers.
Final versions will be given one additional page of content so that
reviewers' comments can be taken into account.
== Review Process ==
Papers must be submitted anonymously.
All submissions will undergo double-blind peer review by at least three
reviewers, with final acceptance decisions made by the workshop organizers.
Accepted papers will be published in the workshop proceedings and
presented orally or as posters.
Call for reviewers: If you have published in the field previously and
are interested in helping out in the program committee to review papers,
please email us at lchange(a)changeiskey.org!
== Topics of Interest ==
We invite original research papers on (but not limited to):
- Novel methods for detecting diachronic semantic change and lexical
replacement
- Automatic discovery and quantitative evaluation of laws of language
change
- Computational theories and generative models of language change
- Sense-aware (semantic) change analysis
- Diachronic word sense disambiguation
- Novel methods for diachronic analysis of low-resource languages
- Novel methods for diachronic linguistic data visualization
- Novel applications and implications of language change detection
- Quantification of sociocultural influences on language change
- Cross-linguistic, phylogenetic, and developmental approaches to
language change
- Novel datasets for cross-linguistic and diachronic analyses of language
== Organizers ==
Nina Tahmasebi, University of Gothenburg
Pierluigi Cassotti, University of Gothenburg
Syrielle Montariol, UC Berkeley, รcole polytechnique fรฉdรฉrale de Lausanne
Andrey Kutuzov, University of Oslo
Netta Huebscher, University of Gothenburg
Elena Spaziani, Sapienza University of Rome
Naomi Baes, University of Melbourne
--
Andrey
Language Technology Group (LTG)
University of Oslo
Dear Colleagues,
We are looking for a PhD student in Multimodal AI for Proactive Herd Health and Dairy Farm Management (CLรR Project) at the School of Electronics, Electrical Engineering and Computer Science, Queenโs University Belfast, UK. This fully funded PhD studentship is supported by SUSTAIN (https://www.sustain-cdt.ai/), the UKRI Centre for Doctoral Training in Sustainable Understandable agri-food Systems Transformed by Artificial Intelligence. SUSTAIN empowers the next generation of AI scientists to invent, develop, and deploy technologies co-created with growers and agri-food practitioners, facilitating meaningful partnerships between academia and industry.
The successful candidate will work with our multidisciplinary supervisory team and our industry partner, CattleEye, gaining access to global real-world data and industry expertise. This project offers a unique opportunity to develop a skill set in AI applied to globally significant sustainability challenges in dairy farming.
If you have a background in NLP, Data Science, ML/DL, or Computer Vision and are motivated to apply your expertise to real-world sustainability challenges in agriculture, this could be a fantastic opportunity for you.
[??] Location: Queenโs University Belfast, UK
[??] Eligibility: Open to students worldwide
[??] Funding: Fully funded PhD studentship (tuition fees, tax-free stipend, Research Training Support Grant, and additional development support)
[??] Deadline: Friday, October 17, 2025
[??] More details: https://www.findaphd.com/phds/project/project-q3138-cl-r-multimodal-ai-for-โฆ
For enquiries, contact m.hasanuzzaman(a)qub.ac.uk<mailto:m.hasanuzzaman@qub.ac.uk> with your CV.
Best,
M
[https://mail.google.com/mail/u/0?ui=2&ik=83b4a8df94&attid=0.1&permmsgid=msgโฆ]