Call for Papers: Second International Workshop on Construction Grammars and NLP (CxGs+NLP 2025)
Workshop Website: https://sites.google.com/view/2ndcxgsnlpworkshop/home
Please join the workshop’s Google Group for the latest updates and to post any questions you might have: https://groups.google.com/g/cxgsnlp-workshop
Overview
Constructionist approaches to language posit that all linguistic knowledge needed for language comprehension and production can be captured as a network of form-meaning mappings, called constructions. Construction Grammars (CxGs) do not distinguish between words and grammar rules, but allow for mappings between forms and meanings of arbitrary complexity and degree of abstraction. CxGs are thereby able to uniformly capture the compositional and non-compositional aspects of language use, making the theory particularly attractive to researchers in the field of Natural Language Processing (NLP). CxG theories, for example, can serve as a valuable ‘lens’ to assess and investigate the abilities of today’s large language models, which lack explicit, theoretically grounded linguistic insights. At the same time, techniques from the field of NLP are often employed for the further development and scaling of CxG theories and applications.
This workshop aims to bring together researchers across theory and practice from the two complementary perspectives of Construction Grammar and NLP to explore how CxG approaches can both inform and benefit from NLP methods, with an emphasis on LLMs. Therefore, we invite original research papers from a broad spectrum of topics, including but not limited to:
Contributions to Construction Grammar theory
Construction Grammar Formalisms
Computational Construction Grammar Implementations
Natural Language Understanding (NLU)
Opinion pieces on the interplay between Construction Grammar and NLP
Constructions and Language Models (Mechanistic interpretability, probing (e.g., BERTology), and evaluation of LLMs)
Resources: Constructicons and corpora annotated for Construction Grammar
Construction Grammar learning and adaptation
Applications at the intersection of Construction Grammar and NLP
Invited Speakers
Adele Goldberg, Professor of Psychology, Princeton University
Thomas Hoffmann, Professor of English Language and Linguistics, Catholic University of Eichstätt-Ingolstadt
Laura Michaelis, Professor of Linguistics, University of Colorado Boulder
Venue & Workshop Details
The 2nd CxGs+NLP workshop will be co-located with the 16th International Conference on Computational Semantics (IWCS), organized by the Heinrich Heine University (HHU) in Düsseldorf, Germany. The workshop will be a full day on 24 September 2025. Additionally, we will be hosting a community-building event in Düsseldorf on 25 September 2025, including panel discussions and breakout sessions on how to organize CxG community resources.
We are expecting the workshop to be in-person only, but are awaiting details on the possibility of a hybrid presentation option.
Important Dates
Jun 06: submission deadline
Aug 01: notification of acceptance, registration opens
Aug 22: camera-ready papers due
Sep 22-23: IWCS main conference
Sep 24: workshop
Sep 25: community-building event
Submission information
Two types of submission are solicited: long papers and short papers. Long papers should describe original research and must not exceed 8 pages. Short papers (typically system or project descriptions, or ongoing research) must not exceed 4 pages. Acknowledgments, references, a limitations section (optional), an ethics statement (optional), and a technical appendix (optional, not subject to reviewing) do not count towards the page limit.
Accepted papers get an extra page in the camera-ready version and will be published in the conference proceedings in the ACL Anthology. Additionally, non-archival publications will be considered for acceptance into the workshop as in-person poster presentations only.
CxGs+NLP 2 papers should be formatted following the common two-column structure as used by IWCS 2021 (borrowed from ACL 2021). Please use these specific style-files or the Overleaf template.
Style files: https://iwcs2021.github.io/download/iwcs2021-templates.zip
Overleaf template: https://www.overleaf.com/latex/templates/instructions-for-iwcs-2021-proceed…
Double submission policy: We will accept submissions that have been submitted elsewhere, but require that the authors notify us, including information on where else they are submitting and let us know if the work is accepted for publication elsewhere.
Submission site TBA.
Instructions for Double-Blind Review
As reviewing will be double blind, papers must not include authors’ names and affiliations. Furthermore, self-references or links (such as github) that reveal the author’s identity, e.g., “We previously showed (Smith, 1991) …” must be avoided. Instead, use citations such as “Smith previously showed (Smith, 1991) …” Papers that do not conform to these requirements will be rejected without review. Papers should not refer, for further detail, to documents that are not available to the reviewers. For example, do not omit or redact important citation information to preserve anonymity. Instead, use third person or named reference to this work, as described above (“Smith showed” rather than “we showed”). If important citations are not available to reviewers (e.g., awaiting publication), these paper/s should be anonymised and included in the appendix. They can then be referenced from the submission without compromising anonymity. Papers may be accompanied by a resource (software and/or data) described in the paper, but these resources should also be anonymized.
Workshop Chairs
Claire Bonial (U.S. Army Research Lab)
Harish Tayyar Madabushi (The University of Bath)
Workshop Organizing Committee
Melissa Torgbi (The University of Bath)
Leonie Weissweiler (University of Texas at Austin)
Austin Blodgett (U.S. Army Research Lab)
Katrien Beuls (University of Namur, Belgium)
Paul Van Eecke (Vrije Universiteit Brussel, Belgium)
Contact: Please join the workshop’s Google Group for the latest updates and to post any questions you might have: https://groups.google.com/g/cxgsnlp-workshop
We are pleased to announce a brand new Model Compression track
<https://www2.statmt.org/wmt25/model-compression.html> at WMT 2025
<https://www2.statmt.org/wmt25/index.html>.
This shared task aims to evaluate the potential of model compression
techniques in reducing the size of large, general-purpose large language
models, with the goal of achieving an optimal balance between practical
deployability and high translation quality in specific machine translation
(MT) scenarios. The task’s broader objectives include fostering research
into efficient, accessible, and sustainable deployment of LLMs for MT,
establishing a common evaluation framework to monitor progress in model
compression across a wide range of languages, and enabling meaningful
comparisons with state-of-the-art MT systems through standardized
evaluation protocols aimed at assessing not only translation quality but
also efficiency.
Although the focus is on model compression, the task is closely aligned
with the General MT shared task
<https://www2.statmt.org/wmt25/translation-task.html>, sharing language
directions, test data, and protocols for automatic MT quality evaluation.
Additionally, the task follows the same timeline as the flagship WMT task.
We warmly invite participation from academic teams and industry players
interested in applying existing compression methods to MT or exploring
innovative, cutting-edge approaches.
THE TASK IN A NUTSHELL
-
Goal: Reduce the size of a general-purpose LLM while maintaining a
balance between model compactness and MT performance.
-
Languages: The first round will focus on the same language pairs as the
General MT track.
-
Conditions:
-
Constrained: Participants work within a predefined model and language
setting for directly comparable results.
-
Unconstrained: Participants are free to compress any model across
language directions of their choice.
-
Evaluation Criteria:
-
Translation quality: Automatically measured using the LLM-as-a-judge
framework from the General MT task
-
Model size: Defined by the memory usage
-
Inference speed: Measured by total processing time over the test set
IMPORTANT DATES
-
Test data released: 26th June 2025
-
Translation submission deadline: 3rd July 2025
-
System description abstract paper: 10th July 2025
-
System description submission: 14th August 2025
WEBSITE: https://www2.statmt.org/wmt25/model-compression.html
ORGANIZERS:
-
Marco Gaido, Fondazione Bruno Kessler
-
Matteo Negri, Fondazione Bruno Kessler
-
Roman Grundkiewicz - Microsoft Translator
-
TG Gowda - Microsoft Translator
CONTACTS:
-
Marco Gaido - mgaido(a)fbk.eu
Matteo Negri - negri(a)fbk.eu
--
--
Le informazioni contenute nella presente comunicazione sono di natura
privata e come tali sono da considerarsi riservate ed indirizzate
esclusivamente ai destinatari indicati e per le finalità strettamente
legate al relativo contenuto. Se avete ricevuto questo messaggio per
errore, vi preghiamo di eliminarlo e di inviare una comunicazione
all’indirizzo e-mail del mittente.
--
The information transmitted is
intended only for the person or entity to which it is addressed and may
contain confidential and/or privileged material. If you received this in
error, please contact the sender and delete the material.
Touché @ CLEF 2025: Shared Tasks on Argumentation Systems (Classification, Detection, Retrieval, Generation)
Call for Participation
We'd like to invite you to participate in the following shared tasks at Touché 2025 held in conjunction with the CLEF conference in Madrid, Spain.
We extended the submission deadline to May 23rd.
1. Retrieval-Augmented Debating.
Sub-Task 1: Generate responses to argue against a simulated debate partner.
Sub-Task 2: Evaluate systems of sub-task 1.
https://touche.webis.de/clef25/touche25-web/retrieval-augmented-debating.ht…
2. Ideology and Power Identification in Parliamentary Debates.
Sub-Task 1: Given a parliamentary speech in one of several languages, identify the ideology of the speaker's party.
Sub-Task 2: Given a parliamentary speech in one of several languages, identify whether the speaker's party is currently governing or in opposition.
Sub-Task 3: Given a parliamentary speech, identify the position of the speaker's party in populist - pluralist scale.
https://touche.webis.de/clef25/touche25-web/ideology-and-power-identificati…
3. Image Retrieval/Generation for Arguments.
Given an argument, find (retrieve or generate) images that help to convey the argument's premise.
https://touche.webis.de/clef25/touche25-web/image-retrieval-for-arguments.h…
4. Advertisement in Retrieval-Augmented Generation.
Sub-Task 1: Create relevant responses for a given query, based on a set of document segments.
Sub-Task 2: Given a query and a response, classify whether the response contains an advertisement or not.
https://touche.webis.de/clef25/touche25-web/advertisement-detection.html
Find out more at https://touche.webis.de/clef25/touche25-web/
and join our mailing list at https://groups.google.com/g/touche-lab for staying up to date.
Important Dates
--------------------------
2025-05-23: Approaches submission deadline
2025-05-30: Participant paper submission
2025-06-10: Peer review notification
2025-07-07: Camera-ready participant papers submission
2025-09 09-12: CLEF Conference in Madrid and Touché Workshop
Links
--------------------------
Touché: https://touche.webis.de
Contact: touche(a)webis.de<mailto:touche@webis.de>
We are looking forward to your submission!
The Touché team
This coming Monday 12 May ReproducibiliTea in the HumaniTeas is
delighted to welcome Nathan Dykes (FAU Erlangen) for a short input talk
(20 minutes) entitled "Beyond the gold standard: Transparency in
qualitative corpus analysis" followed by a 60-minute discussion on the
application of Open Sciences practices in qualitative research.
ReproducibiliTea in the HumaniTeas is an informal place to network with
linguists and other humanities scholars to learn more about
reproducibility, Open Science, and good scientific practice. We meet on
selected Mondays 16-17:30 pm CEST. Our programme this semester also
includes a session on "Reproducibility when working with large language
models: A hallucination?" with Nils Reiter and on "Language and its role
for replicability" with Xenia Schmalz, Anna Yi Leung and Johannes
Breuer. We aim to be as inclusive as possible: from B.A. students to
full professors, everyone is welcome and there are no silly questions!
Details can be found here:
https://ub.uni-koeln.de/kurse-beratung/specials/reproducibilitea-in-the-hum….
You can join us in person at the University Library in Cologne where we
serve tea and biscuits or online via Zoom. Please join our mailing list
to get the Zoom links:
https://lists.uni-koeln.de/mailman/listinfo/reproducibilitea-humaniteas.
--
*Dr. Elen Le Foll*
/Post-Doctoral Researcher & Lecturer/
Department of Romance Studies
<https://romanistik.phil-fak.uni-koeln.de/> • Data Center for the
Humanities <https://dch.phil-fak.uni-koeln.de/> • University of Cologne
<https://portal.uni-koeln.de/en/uoc-home>
Applied Linguistics • Corpus Linguistics • Language Teaching & Learning
ORCID <https://orcid.org/0000-0002-5839-8010> • HAL Science
<https://cv.hal.science/elenlefoll>
*Recent publications:*
Wagne, Ahmadou, Elen Le Foll, Florentine Frantz & Jana Lasser. 2025.
Giving the outrage a name – how researchers are challenging employment
conditions under the hashtags #IchBinHanna and #IchBinReyhan.
Information, Communication & Society. 1–27.
https://doi.org/10.1080/1369118X.2025.2452273.
Le Foll, Elen & Muhammad Shakir. 2025. The Multi-Feature Tagger of
English (MFTE): Rationale, Description and Evaluation. Research in
Corpus Linguistics 13(2). 63–93. https://doi.org/10.32714/ricl.13.02.03.
Le Foll, Elen. 2024. Textbook English: A Multi-Dimensional Approach
(Studies in Corpus Linguistics 116). Amsterdam: John Benjamins.
https://doi.org/10.1075/scl.116.
IndiREAD Workshop 2025: 2nd Call for Papers
Saarbrücken, Germany, November 26-27, 2025
IndiREAD is a workshop jointly organized by the ERC Project
"Individualized Interaction in Discourse" IDDISC [1] and the MultiplEYE
COST [2] action "Enabling multilingual eye-tracking data collection for
human and machine language processing research".
While experimental research in reading has a long tradition in
identifying key factors that influence reading patterns--including text
properties such as font difficulty, word and structure frequency, word
predictability, and dependency length--recent studies have emphasized
the importance of individual variability in reading behaviour (e.g.,
Haeuser & Kray, 2024; Kuperman et al., 2018; Nicenboim et al., 2016;
Staub, 2021). This work has linked individual variability in reading
patterns to differences in working memory capacity, reading skills,
linguistic experience, and domain expertise among readers. This informs
our understanding of how text characteristics and individual reader
attributes interact to shape eye movements during reading.
IndiREAD aims to bring together researchers interested in investigating
individual differences in reading using both experimental and
computational approaches. This workshop will focus on methods such as
eye-tracking, self-paced reading, and the Maze task, with particular
interest in how reading behaviour is correlated with individual
differences. We also encourage submissions of computational models for
eye movements or reading behavior that shed light on the mechanisms
behind these differences. The goal is to foster collaboration between
experimental and computational researchers to better understand
individual variability among readers. We especially welcome submissions
of reading time experiments and modelling of languages beyond English.
The IndiREAD Workshop invites submissions of abstracts addressing the
following questions:
* How do individual differences impact the way people read?
* How do reading patterns vary across different languages,
particularly in bilinguals?
* How do reading patterns change across the lifespan?
* Which individual difference measures are most suitable for capturing
variability in reading patterns?
* How can we evaluate psycholinguistic theories of reading and
sentence processing across languages?
* How can computational models account for individual differences in
reading?
* How does text adaptation influence reading patterns and
comprehension among different individuals?
* What statistical methods are best suited for reliably identifying
latent groups and relating individual differences to reading
performance?
Workshop dates: November 26-27, 2025
Workshop format: The workshop will be held in-person in Saarbrücken,
Germany. It will feature presentations from invited speakers, as well as
contributions based on workshop submissions. The format of the
presentations (oral or poster) will be determined based on the number of
submissions we receive.
Submission deadline: July 23, 2025.
Submissions: The abstracts must not exceed 1000 words for the text
(excl. captions), 10000 characters for references, and a maximum of 2
tables or figures. Abstracts should be submitted in PDF format, with
2.54 cm margins on all sides and 12 point font size, single-spaced.
Please indicate up to three appropriate keywords for your abstract,
which will be used for session planning.
Abstracts must be written in English and should include a clear title
but no information revealing the author(s).
We welcome submissions for work that is being considered by other
conferences, workshops, or journals. Templates for formatting in LaTeX
and Word are provided on the conference website.
Submission platform: https://openreview.net/group?id=IndiREAD/2025
Volunteer reviewers: We also invite all interested parties with relevant
research experience to volunteer to help review abstracts for the
workshop. All reviewers should hold a PhD. Please indicate your interest
using the following form: https://forms.office.com/e/0fGmHW7q11
Conference website: https://www.uni-saarland.de/indiread [3]
Contact email: indiread(a)lst.uni-saarland.de
Travel grants: This workshop is sponsored by the MultiplEYE COST Action,
which will provide financial support to cover travel expenses for a
limited number of participants. Authors will be invited to apply for
travel funding upon abstract acceptance. Funding may be partial, and
priority will be given to junior researchers.
Best,
Iza Škrjanec
IndiREAD Organizing Committee
Links:
------
[1]
https://www.uni-saarland.de/lehrstuhl/demberg/individualized-interaction-in…
[2] https://multipleye.eu/
[3] https://www.uni-saarland.de/indiread
(Apologies for cross-posting)
Deadline for Slavic NLP workshop is postponed to May 10 AOE.
Note the new possibility to _commit papers via ARR_ —
Details on uploading papers+reviews from ARR to START will appear soon
on the Workshop Home page <http://bsnlp.cs.helsinki.fi/>.
**Call for Papers:* *
*
Slav-NLP:10thWorkshoponNLP for Slavic languages
At ACL-2025, Vienna, Austria
31 July 2025
bsnlp.cs.helsinki.fi <http://bsnlp.cs.helsinki.fi/>
Submission Deadline: 10 May
**
WORKSHOPDESCRIPTION
The 10th edition of the Slav-NLP Workshop — at ACL 2025. Sponsored by
SIGSLAV: ACL Special Interest Group on Slavic NLP.
Slavic languages play a crucial role due to their diverse cultural
heritage and wide use — over 400M speakers worldwide. Current political
and economic developments in Central/ Eastern Europe thrust the Slavic
languages into sharp focus, especially in light of rapid technological
advancements, and evolving consumer markets.
Research on applied **and ***theoretical*NLP in the context of Slavic
languages is still lagging. Linguistic phenomena that are common to the
Slavic languages — rich morphology, free word order, etc. — make NLP for
these languages challenging. Slav-NLP Workshops gather researchers from
academia and industry, aiming to stimulate research in Slavic NLP, and
foster the creation of tools and resources. The Workshops welcome the
exchange of ideas and experience, discussing current challenges, and
promoting the available resources. The structural similarity, as well as
the easily recognizable core vocabulary and inflectional inventory
spanning this large language group, creates a special environment where
researchers can appreciate the shared problems and communicate naturally.
We are happy *again *to organize Slav-NLP in Central Europe.
This Workshop addresses Natural Language Processing (NLP) for the Slavic
languages. NLP tasks in urgent need of attention include:
*
language modeling,
*
morphological, syntactic and semantic analysis,
*
lexical semantics,
*
named-entity recognition,
*
text normalization and processing non-standard language,
*
co-reference resolution,
*
information extraction,
*
question answering,
*
text summarization,
*
machine translation,
*
development of linguistic resources,
*
development and assessment of large language models,
*
text classification,
*
text generation,
*
disinformation detection,
*
fact verification,
*
sentiment analysis.
The Workshop continues the proud tradition established by the 9 previous
(B)SNLP Workshops.
IMPORTANT DATES
*
Submission deadline: *10 May*2025
*
Pre-reviewed ARR commitment: 20 May2025
*
Notification of acceptance: *1 June*2025
*
Camera-ready papers due: 15 June 2025
*
Workshop: 31 July 2025
**
SHARED TASK
This year the Slav-NLP Workshop features — Shared Task on Detection and
Classification of Persuasion Techniques— in two types of texts: (a)
parliamentary debateson highly-contested topics, and (b) social media
postsrelated to the spread of propaganda and disinformation.
Read about the Shared Task on the Workshop’s Web page.
SUBMISSION
At the Workshop’s Web page: bsnlp.cs.helsinki.fi
<http://bsnlp.cs.helsinki.fi/call-for-papers.html>
*
*
Workshop Contact: bsnlp(a)cs.helsinki.fi
*
--
Roman Yangarber
Professor, University of Helsinki, Finland
Digital Humanities
INEQ: Helsinki Inequality Initiative
<https://helsinki.fi/en/ineq-helsinki-inequality-initiative> —
Linguistic Inequalities and Translation Technologies
------------------------------------------------------------------------
e-Learning & language learning
Language Learning Lab
Unioninkatu 40, Metsätalo A214
revitaAI.github.io <https://revitaai.github.io>
helsinki.fi/language-learning-lab
<https://www.helsinki.fi/language-learning-lab>
mobile: +358 50 41 51 71 3
------------------------------------------------------------------------
RЯ
The 2nd Large Language Models for Ontology Learning Challenge
Co-located with the 24th International Semantic Web Conference (ISWC 2025)
November 2-6, 2025
Nara, Japan
<https://sites.google.com/view/llms4ol2025/home>https://sites.google.com/view/llms4ol2025
Challenge Overview
The 2nd LLMs4OL Challenge@ISWC 2025 invites researchers and practitioners to explore the capabilities of Large Language Models (LLMs) in automating Ontology Learning (OL). As the Semantic Web evolves, automating the extraction and structuring of knowledge becomes paramount. This challenge focuses on leveraging LLMs to enhance OL processes, contributing to more intelligent and interoperable web systems. Building upon the success of the 1st LLMs4OL Challenge at ISWC 2024, this second edition aims to further the community's understanding and development of LLM-driven OL methodologies.
Challenge Tasks
Participants can engage in one or more of the following tasks:
* Task A - Text2Onto: Extract ontological terminologies and types from a raw text.
* Task B - Term Typing: Discover the generalized type for a lexical term.
* Task C - Taxonomy Discovery: Discover the taxonomic hierarchy between type pairs.
* Task D - Non-Taxonomic Relation Extraction: Identify non-taxonomic, semantic relations between types.
Each task is designed to address specific aspects of OL, encouraging innovative approaches and solutions.
Important Dates
* Test dataset release: June 1st, 2025
* Begin accepting system submissions: June 2nd, 2025
* End accepting system submissions: June 22nd, 2025
* Participants' Papers Submissions Due: July 5th, 2025
* Notification of Acceptance: July 19th, 2025
* Camera-ready due: July 30th, 2025
* ISWC 2025, Nara, Japan: November 2-6, 2025
***================================*
*** FoIKS 2026: Second call for papers ***
*================================*
*|* Apologies if you received multiple copies of this CFP *|*
*
The 14th International Symposium on Foundations of Information and
Knowledge Systems (FoIKS'26) invites contributions from theoretical and
applied research on information and knowledge systems.
FoIKS 2026 (https://foiks2026.github.io/ <https://foiks2026.github.io/>)
will be held on 23rd-26th March 2026 in Hannover, Germany.
===========
** Scope **
===========
The suggested topics include, but are not limited to:
*
Mathematical Foundations of Information and Knowledge Systems:
Discrete structures and algorithms, graphs, and formal languages.
*
Database Design and Management:
Formal models, (in)dependencies and models of transactions, concurrency
control.
*
Logics in Databases and AI:
Classical and non-classical logics, logic programming, description
logics, spatial and temporal logics, argumentation, probability logic,
fuzzy logic.
*
Knowledge Representation and Reasoning:
Logical reasoning, Non-monotonic reasoning (reasoning under inconsistency),
Reasoning under vagueness or uncertainty.
*
Foundations of neuro-symbolic reasoning:
Embedding methods for structured information, such as knowledge graphs,
mathematical expressions, grammars, logical theories.
*
Intelligent Agents:
Multi-agent systems, autonomous agents, formal models of interactions,
Boolean games, coalition formation, reputation systems, epistemic reasoning.
*
Knowledge Discovery and Information Retrieval:
Machine learning, data mining, formal concept analysis and association
rules, information extraction.
*
Security in Information and Knowledge Systems:
Identity theft, privacy, trust, intrusion detection, access control,
inference control, secure Web services, secure Semantic Web, risk
management.
*
Integrity and Constraint Management:
Verification, validation, consistent query answering, and information
cleaning.
*
Knowledge graphs and semi-structured Data:
Data modelling, data processing, data compression, and data exchange.
======================
** Submission Guidelines **
======================
Papers must be typeset using the Springer LaTeX2e style llncs for
Lecture Notes in Computer Science (for guidelines and templates,
see:https://www.springer.com/gp/computer-science/lncs/conference-proceeding…
<https://www.springer.com/gp/computer-science/lncs/conference-proceedings-gu…>).
Submissions that deviate substantially from these guidelines may be
rejected without review. There are the following page limits according
to paper type:
*
Long papers: 16, plus additional pages for references.
*
Short papers: 10, plus additional pages for references.
Missing proofs or details can be added as an additional appendix of up
to 15 pages article style and read at the discretion of the program
committee. All papers must be original and not simultaneously submitted
to another journal or conference. Initial submissions must be in PDF
format, but authors should keep in mind that the LaTeX2e source must be
submitted for the final versions of accepted papers. Submissions in
alternate formats, such as Microsoft Word, cannot be accepted for either
initial or final versions. The submissions will be judged for scientific
quality and for suitability as a basis for broader discussion.
Submission is via the EasyChair
linkhttps://easychair.org/conferences/?conf=foiks2026
<https://easychair.org/conferences/?conf=foiks2026>.
All questions about submissions should be emailed to
foiks2026(a)easychair.org.
=============
** Publication **
=============
The proceedings are planned to be published by Springer-Verlag in the
Lecture Notes in Computer Science series. After the symposium, authors
of selected papers will be invited to submit extended journal versions
of their papers for a FoIKS 2026 special issue.
=================
** Important dates **
=================
*
Submission of abstracts:September 18, 2025
*
Submission of paper: September 25, 2025
*
Notification:December 13, 2025
*
Final version due:January 08, 2026
*
Conference: March 23-26, 2026
==================
** Invited Speakers **
==================
We are excited to announce the invited speakers for FoIKS 2026::
*
Giuseppe De Giacomo(University of Oxford)
*
Floris Geerts(University of Antwerp)
*
Wolfgang Nejdl(Leibniz Universität Hannover)
*
Ana Ozaki(University of Oslo)
==============
** Organization **
==============
* *PC Chairs* *
*
Anni-Yasmin Turhan(University of Paderborn, Germany)
*
Jonni Virtema(University of Sheffield, UK)
* *Local Chair* *
*
Arne Meier(Leibniz University Hannover, Germany)
*
*
We are pleased to invite applications for a fully funded PhD
position (monthly gross salary: €3,400 + €600 mobility allowance +
family allowance) at the Jožef Stefan Institute
<https://www.ijs.si/ijsw/V001/JSI>, Slovenia’s leading scientific
research institution, located in Ljubljana, Slovenia. The position
includes enrollment in a PhD programme at Jožef Stefan International
Postgraduate School <https://mps.si/en/>in Ljubljana, Slovenia, and
secondments at the NGO Danes je nov dan
<https://danesjenovdan.si>(Slovenia) and Università della Svizzera
italiana <https://www.usi.ch/en>(Switzerland). This position is part
of the Horizon Europe MSCA Doctoral Network “Data2Action”.
The successful candidate will conduct research under the supervision
of Dr. Nikola Ljubešić, focusing on developing AI tools to enhance
political participation by improving transparency and accessibility
of parliamentary information from Slovenia, Croatia, Bosnia, Serbia
or any of the ParlaMint <https://www.clarin.eu/parlamint>countries,
collaborating with NGOs like Danes je nov dan
<https://danesjenovdan.si>, and engaging stakeholders to ensure
trustworthy and explainable AI systems.
Eligibility: The candidates must not have resided or carried out
their main activity (work, studies, etc.) in Slovenia for more than
12 months in the 36 months immediately before the recruitment date.
Application deadline: May 18, 2025
Expected start date: September 1, 2025
For more details, please visit:
https://euraxess.ec.europa.eu/jobs/332073
<https://euraxess.ec.europa.eu/jobs/332073>*
TL;DR:
Help to shape a research agenda for IR for climate change impacts:
* 2-4 pages
* Deadline: May 7, 2025 (AOE)
* Workshop: https://sites.google.com/view/ir-for-climate-impact/home
* Submissions: https://openreview.net/group?id=ACM.org/SIGIR/2025/Workshop/MANILA#tab-your…
- - - - - - - -
Final call for submissions - MANILA25: SIGIR 2025 Workshop on Information Retrieval for Climate Impact
Climate change is a far-reaching, global phenomenon that will impact many aspects of our society. The evidence base for observed climate impacts is expanding, and the wider climate literature (white and grey) is growing exponentially. How can effective access be provided to the growing body of peer-reviewed literature on climate change impacts? This year we are particularly interested in tracking climate adaptation literature (white and grey) and welcome contributions on this topic.
Purpose
The emphasis of MANILA25 will be on discussion, not a mini-conference but a dynamic sharing of ideas. The workshop will be organized along three areas of interest: (i) Addressing information needs concerning climate change impacts; (ii) Updates to the IR for Climate Change agenda that resulted from the MANILA24 workshop (see https://arxiv.org/abs/2504.01162), and (iii) Tracking climate adaptation. During the workshop, we will work towards updating the MANILA24 agenda and creating an actionable technical research agendas around tracking climate adaptation.
What we’re looking for
To help shape a research agenda for information retrieval for climate impact, we welcome technical contributions and position papers as extended abstracts (2-4 pages) on a wide range of topics related to information retrieval for climate change impacts, including but not limited to very large-scale systematic reviews, climate language models, geolocated literature with climate information, evidence synthesis. Be sure to emphasize how your ideas connect to IR for climate change impacts and the ambition to create an agenda on the topic.
Important dates
- May 7, 2025: Extended abstracts due (AOE)
- May 21, 2025: Notifications
- July 17, 2025: Workshop at SIGIR 2025
- November 30, 2025: Submission of the Information Retrieval for Climate Impact Agenda for publication in SIGIR Forum
How to submit
Contributions can be submitted at https://openreview.net/group?id=ACM.org/SIGIR/2025/Workshop/MANILA#tab-your…. Please visit https://sites.google.com/view/ir-for-climate-impact/2025/manila25-workshop-… for details.
- - - - - - - -
--
Maarten de Rijke
Distinguished University Professor, University of Amsterdam
Scientific Director, Innovation Center for AI (ICAI)
http://staff.fnwi.uva.nl/m.derijke
Second Call for papers for ConsILR-2025,
the 20th edition of the International Conference on Linguistic Resources
and Tools for Natural Language Processing
(https://conferences.info.uaic.ro/consilr/2025
<https://conferences.info.uaic.ro/consilr/2025/index.html>)
Dates: 8-10 October 2025
Venue: Casa Academiei Române (House of the Romanian Academy), 13, Calea 13
Septembrie, Bucharest, Romania and ONLINE
We invite papers presenting original and unpublished research, as well as
descriptions of accomplished or in-progress work, in all areas of natural
language processing. We welcome contributions covering a range of topics,
including but not limited to:
-
Natural Language Processing (NLP) Techniques and Applications
-
Large Language Models (LLMs) and Applications
-
Digital Humanities in Language Technology
-
(Mono- or multimodal) Language Resources and Tools for text, speech,
images and videos
-
Computational Models and Algorithms in Language Processing
-
Applied Linguistics and NLP Integration
-
Morphosyntactic Structures in Language Processing
-
Semantic and Pragmatic Analysis in NLP
-
Multi-word Expressions and Idiomatic Language in NLP
-
Cultural and Contextual Factors in Language Technology
-
Romanian Language Processing and Contrastive Linguistics
Authors are encouraged to submit, in addition to the papers per se,
open-source linguistic resources, such as corpora (or corpus examples),
demo code, video and sound files.
Confirmed invited speakers:
Agata Savary <https://perso.limsi.fr/savary/>
Amalia Todirașcu <https://fr.linkedin.com/in/amalia-todirascu>
Marius Ursache <https://www.linkedin.com/in/mariusursache/>
Paula Gradu <https://www.linkedin.com/in/paula-gradu-7505591b0>
Organisers:
-
“Mihai Drăgănescu” Research Institute for Artificial Intelligence
of the Romanian
Academy
-
Institute of Computer Science of the Romanian Academy – Iași Branch
-
Faculty of Computer Science of the “Alexandru Ioan Cuza” University of
Iași
-
“Alexandru Philippide” Institute of Philology of the Romanian Academy –
Iași Branch
-
Romanian Association of Computational Linguistics
-
Academy of Technical Sciences of Romania
Important Dates:
August 23, 2025 – abstracts submission (max 300 words)
August 31, 2025 – paper submission
September 21, 2025 – authors’ notification
September 31, 2025 – final form submission
October 8 - 10, 2025 – ConsILR Conference
The abstracts (max. 300 words) and papers (an even number of pages, between
6 and 12, including references) must be written in British English.
Details about the paper format are available on the conference website.
The Proceedings of the Conference will be sent for indexing to Clarivate
Analytics.
Further information can be found on the conference web site:
https://conferences.info.uaic.ro/consilr/2025
<https://conferences.info.uaic.ro/consilr/2025/index.html>
KlarText Workshop on German Text Simplification & Readability Assessment
Co-located with KONVENS 2025 | Hildesheim, Germany | 10 September 2025
Website: https://klar-text.github.io/
============================================================
We are pleased to announce the Call for Papers for the KlarText Workshop on German Text Simplification & Readability Assessment.
The workshop aims to bring together researchers, practitioners, and industry experts to discuss state-of-the-art methods, share resources, and identify future research directions in German text simplification and readability assessment. We particularly aim to raise awareness of the diverse simplification goals and language forms in German and to attract researchers who are tackling the challenges of German text simplification.
Topics of interest include (but are not limited to):
- German Text Simplification
- Readability Assessment
- Resources & Approaches for Leichte Sprache
- The Role of Large Language Models (LLMs)
- Resources & Benchmarks
- Evaluation & Human-Centered Assessment
- Applications & Real-World Impact
- Cross-Linguistic & Multilingual Perspectives
Important Dates
- Submission deadline: June 30, 2025
- Notification of acceptance: August 1, 2025
- Camera-ready version due: August 15, 2025
- Workshop date: September 10, 2025
Submissions are managed via OpenReview (https://openreview.net/group?id=GSCL.org/KONVENS/2025/Workshop/KlarText).
Organizing Committee
- Salar Mohtaj, DFKI
- Stefan Hillmann, Technische Universität Berlin
- Sebastian Möller, Technische Universität Berlin
- Georg Groh, Technische Universität München
- Hadi Asghari, Technische Universität Berlin
- Miriam Anschütz, Technische Universität München
Contact
For questions or inquiries, please contact:
Salar Mohtaj – salar.mohtaj(a)dfki.de
Final Call for Research & Innovation Papers
SEMANTiCS 2025 EU
21st International Conference on Semantic Systems
Vienna, Austria
September 3 - 5, 2025
Follow us on *Twitter/X* <https://x.com/SemanticsConf>, *LinkedIn*
<https://www.linkedin.com/groups/7496190/?highlightedUpdateUrn=urn%3Ali%3Agr…>,
and *Bluesky*. <https://bsky.app/profile/semantics-conf.bsky.social>
Important Dates:
-
*Abstract Submission Deadline: May 16, 2025*
-
*Paper Submission Deadline: May 23, 2025*
-
*Notification of Acceptance: June 27, 2025*
-
*Camera-Ready Paper Deadline: July 15, 2025*
*All deadlines are set for 11:59 pm, Anywhere On Earth time (UTC-12)*
*Submissions will be through Easychair and the submission link will be
provided soon.*
Proceedings of SEMANTiCS 2025 EU will be made available *open access*.
Research and Innovation Track
The SEMANTiCS 2025 conference is excited to invite submissions for the
Research and Innovation Track, welcoming groundbreaking research
contributions, innovative solutions, and experimental studies relevant to
the Semantic Web, Semantic Technologies, and AI-enabled semantics. We also
encourage submissions at the intersections of these fields with other
scientific and applied disciplines, fostering cross-disciplinary exchange
and advancement. Papers should present original work that has not been
published or is not under consideration elsewhere. All submissions must
adhere to the submission guidelines, including reference formatting and any
additional documentation as required. Each submission will undergo a
rigorous review process, with at least three independent reviews,
evaluating the novelty, technical quality, reproducibility, and practical
relevance of the work.
Topics of Interest
SEMANTiCS 2025 calls for submissions of high-quality research papers across
a broad spectrum of topics in Semantic Web, Semantic Technologies, and AI.
We are particularly interested in new and emerging trends, especially where
semantic technologies intersect with evolving fields such as large language
models, explainable AI, and trustworthy data infrastructures. Topics of
interest include, but are not limited to:
- Web Semantics & Linked (Open) Data
- Enterprise Knowledge Graphs, Graph Data Management
- Machine Learning Techniques for/using Knowledge Graphs (e.g.
reinforcement learning, deep learning, data mining and knowledge discovery)
- Generative AI and Knowledge Graphs (e.g., Retrieval-Augmented
Generation (RAG) with knowledge graph integration, generative model
grounding)
- Reasoning, Rules, and Policies on RAG
- Knowledge Engineering and Management (e.g., knowledge acquisition,
extraction, integration, and publication workflows)
- Terminology, Thesaurus & Ontology Management, Ontology engineering
- Web agents
- Natural Language Processing for/using Knowledge Graphs (e.g. entity
linking and resolution using target knowledge such as Wikidata and DBpedia,
foundation models)
- Crowdsourcing for/using Knowledge Graphs
- Data Quality Management and Assurance
- Mathematical and Logical Foundations of Knowledge-aware AI
- Multimodal Knowledge Graphs (e.g., text, image, audio fusion in graph
structures)
- Semantic-Enhanced Data Science Pipelines and Processes
- Semantics in Blockchain environments (e.g., traceability,
decentralized knowledge representation)
- Trust, Data Privacy, and Security with Semantic Technologies
- Internet of Things (IoT), Stream Processing, and Temporal Data
Management (e.g., real-time semantic processing and predictive analytics)
- Conversational AI and Dialogue Systems powered by Knowledge Graphs
- Provenance and Data Change Tracking (e.g., semantic versioning, data
updates in distributed settings)
- Semantic Interoperability (e.g., cross-domain standards, mapping
frameworks, ontology alignment)
- Linked Data storage, triple stores, graph databases
- Robust, Scalable, and Fault-Tolerant Semantic Data Systems (e.g.,
distributed querying, optimization)
- User Interfaces and Usability of Semantic Technologies (e.g.,
visualizations, intelligent user interaction)
- Explainable and Interoperable AI
- Decentralised and Federated Knowledge Graphs (e.g., federated
querying, link traversal)
Applied Semantic Technologies and AI in Real-World Scenarios, such as, but
not limited to:
- Biomedicine and Health (e.g., Knowledge Graphs for biomedical
applications, AI-driven diagnostics, personalized health)
- AI for Environmental and Climate Solutions (e.g., semantic modeling
for environmental impact, biodiversity knowledge graphs)
- Scientific Knowledge Graphs and Open Science (e.g., FAIR data
principles, enhanced scholarly communication)
- Semantic Technologies in GLAM (Galleries, Libraries, Archives, and
Museums)
- Knowledge Graphs and Hybrid AI for Industry 4.0/5.0 and Predictive
Maintenance
- Digital Humanities and Cultural Heritage Preservation
- Legal Technology, AI Ethics, and Regulatory Compliance (e.g., AI and
legal frameworks, semantic-enabled compliance with the EU AI Act)
- Economics and Governance of Data Ecosystems (e.g., data marketplaces,
semantic service interoperability, data policy)
Submission Guidelines
The Research and Innovation Track at SEMANTiCS 2025 invites both
*long* and *short
paper submissions*.
- *Long papers* should be *12-15 pages* in length (excluding
references). These submissions are expected to present comprehensive,
mature research findings, including in-depth theoretical or practical
insights.
- *Short papers* should be a *maximum of 6 pages* (excluding
references). These submissions can include preliminary findings, innovative
ideas, or position papers that aim to spark discussion and exploration.
References are not included in the page count, so authors may add
additional pages for relevant citations if needed. This flexibility allows
authors to fully reference foundational and related work to strengthen the
context and impact of their research.
- Submissions should follow the guidelines of IOS Press. Details are
available at *https://www.iospress.com/book-article-instructions*.
<https://www.iospress.com/book-article-instructions>
- Authors need to use the *Word template*
<https://www.iospress.com/sites/default/files/media/files/2022-06/ECRC-Autho…>
or *LaTeX* <https://vtex-soft.github.io/texsupport.IOS-Book-Article/>
template provided by IOS Press. Overleaf users can copy the project *from
here* <https://www.overleaf.com/read/gkkspcvjgwxv#563836> (follow
instructions in the abstract).
- Abstract submission is mandatory for all papers. To aid the review and
bidding process, we highly encourage authors to submit structured
abstracts.
- All papers and abstracts have to be submitted electronically via
EasyChair.
- Submissions must be in English.
- Submissions must adhere to the fair use of Large Language Models.
Please refer to the SEMANTiCS *full policy*
<https://2025-eu.semantics.cc/page/llm-policy> for more details.
- Submissions must be anonymous; the reviewing process is double-blind,
but reviewers will be able to disclose their identities if they wish, by
signing their reviews.
- Accepted papers will be published in open access proceedings by IOS
Press, and the text of all the reviews (excluding the scores) of all the
accepted papers will be posted on the conference website and will be
archived on Zenodo as publicly available material.
- At least one author of each accepted paper must present it in person
and therefore register for the conference at the ONSITE rate.
- All authors are strongly suggested to provide optional links to code,
materials, and datasets during the submission process - we will have
specific optional fields in the EasyChair submission form - the review
process will take these into account when provided. To anonymise resources
for the reviewing process, authors can use services like *Anonymous
GitHub* <https://anonymous.4open.science/> or figshare/Zenodo as
described *here*
<https://github.com/dgraziotin/disclose-data-dbr-first-then-opendata?tab=rea…>.
- The Research and Innovation Track will not accept papers that, at the
time of submission, are under review or have already been published in or
accepted for publication in a journal or another conference.
- All authors will have the opportunity to provide an ORKG comparison in
the Open Research Knowledge Graph (*https://orkg.org* <https://orkg.org>)
during the submission process - we will have a specific optional field in
the EasyChair submission form.
Review and Evaluation Criteria
Each submission will be reviewed by at least three Programme Committee
members. The reviewing process is double-blind. However, reviewers can
disclose their identity by signing their reviews and/or adding one of their
persistent identifiers (e.g. their ORCID).
The text of all the reviews (excluding the scores) of all the accepted
papers will be posted on the conference website with the basic
bibliographic metadata of the reviewed submission (i.e. title and authors),
and it will be archived on Zenodo as publicly available material. All the
signed reviews of the accepted papers will be licensed using a Creative
Commons Attribution license (CC-BY, the copyright holder will be the
reviewer), except the anonymous ones that will be released in CC0.
Papers submitted to this track will be evaluated according to the following
criteria:
- Appropriateness
- Originality, novelty, and innovativeness
- Impact of results
- Technical quality of the methods
- Soundness of the evaluation
- Proper comparison to related work
- Clarity and quality of writing
- Reproducibility of results and resources
*We look forward to receiving your contributions!*
Research and Innovation Track Chairs
Blerina Spahiu (University of Milano-Bicocca, IT)
Mehdi Ali (Lamarr Institute & Fraunhofer IAIS, Germany)
Kind Regards,
On behalf of the organising committee.
=========================
Dr. Kossi Amouzouvi
ScaDS.AI Dresden/Leipzig, TU Dresden
--
DISCLAIMER: The contents of this email and any attachments are
confidential. They are intended for the named recipient(s) only. If you
have received this email by mistake, please notify the sender immediately
and you are herewith notified that the contents are legally privileged and
that you do not have permission to disclose the contents to anyone, make
copies thereof, retain or distribute or act upon it by any means,
electronically, digitally or in print. The views expressed in this
communication may be of a personal nature and not be representative of
AIMS-NEI and/or any of its Centres or Initiatives.
ACL 2025 Call for Student Volunteers
# Student Volunteer Program
A limited number of student volunteers are needed for the success of ACL
2025. Both online and in-person event volunteers are needed.
Tasks may include assisting at the registration desk, filling delegate
packs, managing poster board sessions and displays, serving as volunteer
coordinator for the day, and/or AV/technical support such as (but not
limited to) managing social media (X/Twitter) and providing assistance
for conference events including tutorials, the main conference, and
workshops (either online or in-person versions).
In exchange for a minimum of 10 hours of service, students receive free
registration to the main conference (including the ACL membership fee of
the current year and paper registration fee if applicable), workshops
and tutorials, and social events. The work will be divided, probably
into two half-day shifts, and the shifts will be scheduled to maximise
volunteer access to the conference events.
*We'd like to kindly inform you that the award does not include
provisions for travel and accommodation.* If travel support is essential
for you, we encourage you to explore the D&I funds as well.
## IMPORTANT DATES
All deadlines are 11:59 PM UTC-12:00 ("anywhere on Earth").
- Application Deadline: June 6, 2025
- Notification of Acceptance: June 16, 2025
## SELECTION CRITERIA
The selection process for the Student Volunteer Program will involve a
careful evaluation of the materials submitted by applicants. Priority
consideration will be given to individuals who meet either of the
following criteria:
- Students who will be presenting a paper at the main conference or an
associated workshop, whether in person or online. We encourage both
newcomers and those with prior volunteer experience to apply.
- Students who are enthusiastic about assisting with various aspects of
the conference, genuinely hoping to collaborate in making ACL 2025 a
success.
- Student presenters who demonstrate financial need (Applicants can
optionally include a letter from a faculty advisor to explain financial
need).
- For virtual volunteers, students who are familiar with GatherTown and
Whova.
**IMPORTANT: Applicants who are selected must commit to attending the
training sessions and fulfilling the assigned responsibilities. If you
are unsure whether you will attend the conference, please do not
apply-we expect every volunteer to show up and demonstrate enthusiasm
for helping out the conference**.
## SUBMISSION PROCEDURE
Applicants for the Student Volunteer Program must be full-time students
and should submit the completed application form, where we ask a few
questions and a one-page CV (resume). Students should make travel
arrangements and accommodations independent of the results of the
application. Apply via the following form:
https://forms.gle/THCSnxw8dte34Wbu5 [1]
Please **DO NOT REGISTER** for the ACL 2025 Conference until someone has
reached out to you or you have received a Congratulatory email
confirming your Volunteer Service Acceptance. Once you have received
this Acceptance notification, you will receive a special link to
register for which your registration fees are waived. Additionally, a
separate email containing a Volunteer Registration form will be sent to
you, in which you can list what volunteer preference task you would like
based on your skillset (i.e. registration, volunteer coordinator, poster
session liaison etc.). This form must be completed FIRST in order to
receive the ACL Conference 2025 LINK to have registration fees waived.
If, for any reason, you are not accepted, the Registrar will work with
you to secure early registration fee rates.
In the case that a student requires reimbursement after the conference
for LATE volunteer registration, the student must provide a receipt of
paid registration fees from a debit or bank account (i.e., a business
expense report) to the ACL Assistant Director of Events, Megs Haddad
acl.megshaddad(a)gmail.com
## SUMMARY OF IMPORTANT NOTES
- Student volunteers receive only free conference registration and free
ACL membership for the year, and must be responsible for all other
costs, such as travel and accommodation.
- Student volunteers may also apply for the Diversity & Inclusion (D&I)
grant, which may help offset additional costs. The application
procedures are separate.
- We plan to notify of acceptance of student volunteer status on June
16, 2025. Please do not register for the conference before that.
- Student volunteers who do NOT show up to their training sessions and
their assigned duties will be charged the full cost after the
conference.
- Student volunteers who do not fulfil 10 hours of service may be
charged for a portion--or the entirety--of their conference benefits.
## STUDENT VOLUNTEER CHAIRS
Contact: acl2025-volunteer-chairs(a)googlegroups.com
- Pedro Henrique Luz de Araujo, University of Vienna (Austria)
- Eleonora Mancini, University of Bologna (Italy)
Links:
------
[1] https://forms.gle/THCSnxw8dte34Wbu5
Dear ACL 2025 Attendees:
ACL 2025 is providing D&I funds for registration, caregiving, bandwidth,
travel and VPN subsidies. We strongly encourage researchers from
developing countries and marginalized communities, students, and
researchers with financial hurdles to apply for both subsidies and
volunteering opportunities (https://2025.aclweb.org/calls/volunteers/
[1]) to maximize their chances of getting their registration fees
waived. Please note that we are offering both in-person and virtual
attendance subsidies, and review the calls below for details on
eligibility requirements.
Link to Call: https://2025.aclweb.org/calls/subsidies/ [2]
Link to Call (virtual non-presenters only):
https://2025.aclweb.org/calls/virtual_subsidies/ [3]
Deadline: June 6th, 2025 at 11:59pm (Anywhere on Earth)
Time estimate: This application will take you about 15-20 minutes to
complete. You are able to revise your responses to this form before the
deadline, so please be sure to keep your application up to date and
accurate.
If you have any questions or concerns, please contact us by
acl2025diversity(a)googlegroups.com.
Sincerely,
ACL 2025 Diversity and Inclusion Team
Links:
------
[1] https://2025.aclweb.org/calls/volunteers/
[2] https://2025.aclweb.org/calls/subsidies/
[3] https://2025.aclweb.org/calls/virtual_subsidies/
🔔 Evaluation Phase Now Open!
The evaluation phase for the Ahasis Shared Task has officially begun!
👉 If you're registered, access the test set and submission portal via CodaBench: https://www.codabench.org/competitions/5871
👉 Not registered yet? Visit our official website to register and get started! : https://ahasis-42267.web.app/
😊 Sentiment Across Multi-Dialectal Arabic: A Benchmark for Sentiment Analysis in the Hospitality Domain
We invite researchers, practitioners, and NLP enthusiasts to participate in the Sentiment Across Multi-Dialectal Arabic shared task, a challenge aimed at advancing sentiment analysis for Arabic dialects in the hospitality sector.
🧠 About the Task
Arabic is one of the world’s most spoken languages, characterised by rich dialectal variation across different regions. These dialects significantly differ in syntax, vocabulary, and sentiment expression, making sentiment analysis a challenging NLP task. This task focuses on multi-dialectal sentiment detection in hotel reviews, where participants will classify sentiment as positive, neutral, or negative across multiple Arabic dialects, including Saudi, and Moroccan.
This shared task provides a high-quality multi-dialect parallel dataset, enabling participants to explore:
1. Dialect-Specific Sentiment Detection – Understanding how sentiment varies across dialects.
2. Cross-Linguistic Sentiment Analysis – Investigating sentiment preservation across dialects.
3. Benchmarking on Multi-Dialect Data – Evaluating models on a standardised Arabic dialect dataset.
📦 Dataset Overview
- Hotel reviews across multiple Arabic dialects.
- Balanced sentiment distribution (positive, neutral, negative).
- Multi-Dialect Parallel Dataset – Each review is available in multiple dialects, allowing for cross-linguistic comparison.
📏 Evaluation Metrics
- Primary Metric: F1-Score.
- Additional Analysis: Comparison of sentiment accuracy across dialects.
🧪 Baseline System
- Pre-trained BERT-based model (AraBERT) fine-tuned on MSA and Arabic dialect data.
- Participants are encouraged to improve upon the baseline model with their own techniques and use LLMs.
🌟 Why Participate?
- Contribute to Arabic NLP Research – Help advance sentiment analysis for Arabic dialects.
- Gain Access to a High-Quality Dataset – A unique multi-dialect benchmark for future research.
- Collaborate with the NLP Community – Engage with leading researchers and practitioners.
- Showcase Your Work – High-performing models may be featured in a post-task publication.
🗓️ Timeline
- Training data ready – April 15, 2024
- Test Evaluation starts – May 1, 2025
- Test Evaluation end – May 5, 2025
- Paper submission due – May 16, 2025
- Notification to authors – May 31, 2025
- Shared task presentation co-located with RANLP 2025 – September 11, 12, and 13, 2025
✅ How to Participate?
1. Register for the task via https://ahasis-42267.web.app/
2. Download the dataset and baseline system.
3. Develop and test your sentiment analysis model.
4. Submit your results for evaluation.
👥 Organising Team
- Maram Alharbi, Lancaster University, UK
- Salmane Chafik, Mohammed VI Polytechnic University, Morocco
- Professor Ruslan Mitkov, Lancaster University, UK
- Dr. Saad Ezzini, King Fahd University of Petroleum and Minerals, Saudi Arabia
- Dr. Tharindo Ranasinghe, Lancaster University, UK
- Dr. Hansi Hettiarachchi, Lancaster University, UK
📬 For inquiries, please contact us at ahasis.task(a)gmail.com
🎉 Don’t forget to enjoy the challenge, explore the beauty of Arabic dialects, and push the boundaries of what your models can do! 🚀
Dear ACL 2025 Attendees:
ACL 2025 is providing D&I funds for registration, caregiving, bandwidth,
travel and VPN subsidies. We strongly encourage researchers from developing
countries and marginalized communities, students, and researchers with
financial hurdles to apply for both subsidies and volunteering
opportunities (https://2025.aclweb.org/calls/volunteers/) to maximize their
chances of getting their registration fees waived. Please note that we are
offering both in-person and virtual attendance subsidies, and review the
calls below for details on eligibility requirements.
Link to Call: https://2025.aclweb.org/calls/subsidies/
Link to Call (virtual non-presenters only):
https://2025.aclweb.org/calls/virtual_subsidies/
Deadline: June 6th, 2025 at 11:59pm (Anywhere on Earth)
Time estimate: This application will take you about 15-20 minutes to
complete. You are able to revise your responses to this form before the
deadline, so please be sure to keep your application up to date and
accurate.
If you have any questions or concerns, please contact us by
acl2025diversity(a)googlegroups.com.
Sincerely,
ACL 2025 Diversity and Inclusion Team
Dear Colleagues,
We are announcing that the Dev phase of the M-DAIGT shared task has started, and the registration deadline has been extended until 7 May 2025.
The Multi-Domain Detection of AI-Generated Text (M-DAIGT) shared task, hosted at RANLP 2025, is bringing together researchers to explore methods for detecting AI-generated text across multiple domains, with a focus on news articles and academic writing.
We invite participation in two subtasks:
1. News Article Detection (NAD): Classify news articles and snippets as human-written or AI-generated.
2. Academic Writing Detection (AWD): Identify AI-generated content within student coursework and academic research across various disciplines.
*
Participants will receive balanced datasets containing human-written and AI-generated texts from multiple language models. Evaluation will be conducted on the Codabench platform.
Evaluation Metrics:
*
Primary: F1-score, Accuracy, Precision, Recall
* Secondary: Robustness across text lengths, domains, and generation sources
Important Dates:
*
Training Data Release: March 31, 2025
*
Development Phase Start: May 1, 2025
*
Evaluation Data Release: May 7, 2025
*
Evaluation Period: May 815, 2025
*
Paper Submission Deadline: June 1, 2025
* Workshop Dates: September 1112, 2025
More Information and Registration:
*
Website: https://ezzini.github.io/M-DAIGT/
* GitHub Repository: https://github.com/ezzini/M-DAIGT
*
Registration: Click here to register for solo or team participation<https://docs.google.com/forms/d/e/1FAIpQLSextZDY7qjGRJSLCBNISPcBNQZwusRWKvy…>
*
Participation:
*
Shared Task1 News Article Detection (NAD): https://www.codabench.org/competitions/7391/
*
Shared Task2 Academic Writing Detection (AWD): https://www.codabench.org/competitions/7329/
*
We look forward to your participation and encourage you to share this with colleagues who may be interested. For any queries, feel free to reach out to the organizers.
Yours sincerely,
The M-DAIGT Organizers
**********************************************************************
DISCLAIMER: The information in this email and its attachments (if any) is intended for the addressee only and may contain confidential or privileged information. If you are not the intended recipient, please delete the email and its attachments from your system and notify the sender immediately. You should not retain, disclose, copy, or use this email or any of its contents for any purpose, nor disclose its contents to any other person. KFUPM is not responsible for changes made to this message after it was sent. Statements and opinions expressed in this e-mail are those of the sender, and do not necessarily reflect those of KFUPM. KFUPM is not liable for any effect or virus damage caused by this message.
إن المعلومات الواردة في هذا البريد الإلكتروني ومرفقاته إن وجدت، قد تكون خاصة أو سرية؛ فإذا لم تكن المقصود بهذه الرسالة؛ فيُرجى منك حذفها ومرفقاتها من نظامك وإخطار المرسل بخطأ وصولها إليك فورا. كما لا يجوز نسخ أي جزء منها أو مرفقاتها ، أو الإفصاح عن محتوياتها لأي شخص أو استعمالها لأي غرض آخر. إن جامعة الملك فهد للبترول والمعادن لا تتحمل مسؤولية التغييرات التي يتم إجراؤها على هذه الرسالة بعد إرسالها. وإن البيانات أو الآراء المعبر عنها في هذا البريد، هي بيانات تخص مُرسلها، ولا تعكس بالضرورة رأي وبيانات الجامعة. كما لا تتحمل الجامعة مسؤولية أي تأثير ينتج عن هذه الرسالة أوعن أي فيروس قد تحمله.
Dear colleagues,
While the deadline for regular GEM^2
<https://gem-benchmark.com/workshop> submissions
has passed, until May 17th it is possible to submit papers that have
already been reviewed in a recent ARR cycle, by simply filling this short
form
<https://docs.google.com/forms/d/e/1FAIpQLSdDUoxvdwKgwv6mOsxL7aFJ3InkyHxkPug…>.
Note that our website will be updated soon with this information.
Updated important dates:
- May 5 May 17: Pre-reviewed (ARR) commitment deadline.
- May 19 May 25: Notification of acceptance.
- June 6 June 12: Camera-ready paper deadline.
- July 7: Pre-recorded videos due.
- July 31 - August 1: Workshop at ACL in Vienna.
Regards,
simon
*ADAPT Research Centre / Ionaid Taighde ADAPT*
*School of Computing, Dublin City University, Glasnevin Campus
/ Scoil na Ríomhaireachta,
Campas Ghlas Naíon, Ollscoil Chathair Bhaile Átha Cliath*
ACL 2025 Call for Student Volunteers
# Student Volunteer Program
A limited number of student volunteers are needed for the success of ACL
2025. Both online and in-person event volunteers are needed.
Tasks may include assisting at the registration desk, filling delegate
packs, managing poster board sessions and displays, serving as volunteer
coordinator for the day, and/or AV/technical support such as (but not
limited to) managing social media (X/Twitter) and providing assistance for
conference events including tutorials, the main conference, and workshops
(either online or in-person versions).
In exchange for a minimum of 10 hours of service, students receive free
registration to the main conference (including the ACL membership fee of
the current year and paper registration fee if applicable), workshops and
tutorials, and social events. The work will be divided, probably into two
half-day shifts, and the shifts will be scheduled to maximise volunteer
access to the conference events.
*We'd like to kindly inform you that the award does not include provisions
for travel and accommodation.* If travel support is essential for you, we
encourage you to explore the D&I funds as well.
## Important Dates
All deadlines are 11:59 PM UTC-12:00 (“anywhere on Earth”).
- Application Deadline: June 6, 2025
- Notification of Acceptance: June 16, 2025
## Selection Criteria
The selection process for the Student Volunteer Program will involve a
careful evaluation of the materials submitted by applicants. Priority
consideration will be given to individuals who meet either of the following
criteria:
- Students who will be presenting a paper at the main conference or an
associated workshop, whether in person or online. We encourage both
newcomers and those with prior volunteer experience to apply.
- Students who are enthusiastic about assisting with various aspects of the
conference, genuinely hoping to collaborate in making ACL 2025 a success.
- Student presenters who demonstrate financial need (Applicants can
optionally include a letter from a faculty advisor to explain financial
need).
- For virtual volunteers, students who are familiar with GatherTown and
Whova.
**IMPORTANT: Applicants who are selected must commit to attending the
training sessions and fulfilling the assigned responsibilities. If you are
unsure whether you will attend the conference, please do not apply–we
expect every volunteer to show up and demonstrate enthusiasm for helping
out the conference**.
## Submission Procedure
Applicants for the Student Volunteer Program must be full-time students and
should submit the completed application form, where we ask a few questions
and a one-page CV (resume). Students should make travel arrangements and
accommodations independent of the results of the application. Apply via the
following form:
https://forms.gle/THCSnxw8dte34Wbu5
Please **DO NOT REGISTER** for the ACL 2025 Conference until someone has
reached out to you or you have received a Congratulatory email confirming
your Volunteer Service Acceptance. Once you have received this Acceptance
notification, you will receive a special link to register for which your
registration fees are waived. Additionally, a separate email containing a
Volunteer Registration form will be sent to you, in which you can list what
volunteer preference task you would like based on your skillset (i.e.
registration, volunteer coordinator, poster session liaison etc.). This
form must be completed FIRST in order to receive the ACL Conference 2025
LINK to have registration fees waived. If, for any reason, you are not
accepted, the Registrar will work with you to secure early registration fee
rates.
In the case that a student requires reimbursement after the conference for
LATE volunteer registration, the student must provide a receipt of paid
registration fees from a debit or bank account (i.e., a business expense
report) to the ACL Assistant Director of Events, Megs Haddad
acl.megshaddad(a)gmail.com
## Summary of Important Notes
- Student volunteers receive only free conference registration and free ACL
membership for the year, and must be responsible for all other costs, such
as travel and accommodation.
- Student volunteers may also apply for the Diversity & Inclusion (D&I)
grant, which may help offset additional costs. The application procedures
are separate.
- We plan to notify of acceptance of student volunteer status on June 16,
2025. Please do not register for the conference before that.
- Student volunteers who do NOT show up to their training sessions and
their assigned duties will be charged the full cost after the conference.
- Student volunteers who do not fulfil 10 hours of service may be charged
for a portion—or the entirety—of their conference benefits.
## Student Volunteer Chairs
Contact: acl2025-volunteer-chairs(a)googlegroups.com
- Pedro Henrique Luz de Araujo, University of Vienna (Austria)
- Eleonora Mancini, University of Bologna (Italy)
--
Horacio Saggion
Full Professor / Chair in Computer Science and Artificial Intelligence
Head of the Natural Language Processing Group - TALN
Project Coordinator iDEM Project (HE)
Co-PI of the AI-BOOST project (HE)
Co-PI of the IDEAL project (HE)
Universitat Pompeu Fabra
https://twitter.com/h_saggionhttps://www.linkedin.com/in/horacio-saggion-1749b916
--
Horacio Saggion
Full Professor / Chair in Computer Science and Artificial Intelligence
Head of the Natural Language Processing Group - TALN
Project Coordinator iDEM Project (HE)
Co-PI of the AI-BOOST project (HE)
Co-PI of the IDEAL project (HE)
Universitat Pompeu Fabra
https://twitter.com/h_saggionhttps://www.linkedin.com/in/horacio-saggion-1749b916
**** French version below ****
* Workshop on Medical Language Processing in the era of Large Language Models (MLP-LLM 2025) *
Colocated with CORIA-TALN 2025 -- 30 June 2025 , Marseille
Deadline : 30th April 2025 (UTC-12/Anywhere on Earth)
* Call for Papers *
[ https://atilf-umr7118.github.io/MLPLLM2025/ | https://atilf-umr7118.github.io/MLPLLM2025/ ]
The advent of large language models (LLMs) has revolutionized natural language processing across various domains, including healthcare. However, the complexities of medical language—marked by specialized terminologies, the use of abbreviations and code ontologies such as ICD, UMLS or SNOMED, implicit contextual dependencies (based on the context, the medication information may be different: temporality, action, certainty, etc.) —pose unique challenges and opportunities. Medical NLP is at critical stakes, given the importance of finding the right diagnosis and treatment for each patient. Moreover, the field of health includes not only the human aspect represented by the practitioner-patient relationship, but also the contact with the biological world (animals, plants, viruses, microbes). This workshop, MLP-LLM, aims to bring together researchers from NLP, medicine, bioNLP and linguistics to explore advancements, limitations, and ethical considerations of using LLMs in medical contexts. Topics of interest include, but are not limited to:
* Fine-tuning and adapting LLMs for medical applications and for different languages.
* Addressing biases in medical language understanding along with LLM hallucinations.
* Proposing evaluation methods to assess the quality of medical NLP tools.
* Ensuring transparency, interpretability, and uncertainty awareness in medical AI systems.
* Developing domain-specific benchmarks for evaluating LLMs in healthcare.
* Developing applications for LLMs in clinical decision support, medical transcription and communication between practitioners and patients.
Keynote speaker : Natalia Grabar, Université de Lille
We welcome articles that are:
* new contributions,
* state-of-the-art articles,
* work in progress,
* short/translated version of a paper accepted at a major conference.
Important Dates :
* Submission Due: 30 April 2025
* Author Notification: 12 May 2025
* Camera ready: 16 May 2025
* Workshop: 30 June 2025
Submissions are accepted both in English or French.
Contact :
Workshop Organizers ( [ mailto:mlpllm2025@gmail.com | mlpllm2025(a)gmail.com) ]
Ioana Buhnila ( [ mailto:ioana.buhnila@univ-lorraine.fr | ioana.buhnila(a)univ-lorraine.fr ] )
Aman Sinha ( [ mailto:aman.sinha@univ-lorraine.fr | aman.sinha(a)univ-lorraine.fr ] )
------------------------------ ---------------------------
**** V ersion anglaise ci-dessus ****
* L'atelier Traitement du langage médical à l'époque des LLMs (MLP-LLM 2025) *
Coloc avec CORIA-TALN 2025 -- 30 juin 2025, Marseille
* Appel à communications *
[ https://atilf-umr7118.github.io/MLPLLM2025/ | https://atilf-umr7118.github.io/MLPLLM2025/ ]
L’avènement des grands modèles de langue (LLMs) a révolutionné le traitement automatique des langues dans divers domaines, y compris la santé. Cependant, la complexité du langage médical - marquée par des terminologies spécialisées, l’utilisation des abréviations et des ontologies médicales de type CIM, UMLS or SNOMED, des dépendances contextuelles implicites (dans un contexte donné, le contenu médical peut changer en fonction de la temporalité, l’événement ou le degré de certitude) - pose des défis et des opportunités uniques. Le TAL médical peut aider les praticiens hospitaliers dans le diagnostic et le traitement des patients. De plus, le domaine de la santé comprend non seulement l’aspect humain représenté par la relation praticien-patient, mais également le contact avec le monde biologique (animaux, plantes, virus, microbes). Cet atelier, MLP-LLM, vise à rassembler des chercheurs en TAL, bioinformatique, médecine et linguistique afin d’explorer les avancées, les limites et les considérations éthiques de l’utilisation des LLMs dans des contextes médicaux. Les sujets d’intérêt incluent, mais ne sont pas limités à:
* Affiner et adapter les LLMs pour les applications médicales et pour différentes langues.
* Proposer des méthodes d’évaluation adaptées au domaine médical.
* Traiter les biais dans la compréhension du langage médical et les hallucinations des LLMs.
* Garantir la transparence, l’interprétabilité, le niveau de certitude et la responsabilité dans les systèmes d’IA médicale.
* Développer des benchmarks spécifiques au domaine pour évaluer les LLMs dans le domaine de la santé.
* Développer des applications des LLMs dans l’aide à la décision clinique, la transcription médicale et la communication entre les praticiens et les patients.
Keynote speaker : Natalia Grabar, Université de Lille
Les types d'article acceptés sont:
* contribution nouvelle,
* état de l'art,
* travaux en cours,
* version courte/traduite d'un article accepté dans une grande conférence.
Calendrier de l'appel :
* Soumission des articles: 30 Avril 2025
* Notification aux auteurs: 12 Mai 2025
* Version finale: 16 Mai 2025
* Atelier : 30 Juin 2025
Les soumissions sont acceptées en anglais ou en français.
Contact :
Workshop Organizers ( [ mailto:mlpllm2025@gmail.com | mlpllm2025(a)gmail.com) ]
Ioana Buhnila ( [ mailto:ioana.buhnila@univ-lorraine.fr | ioana.buhnila(a)univ-lorraine.fr ] )
Aman Sinha ( [ mailto:aman.sinha@univ-lorraine.fr | aman.sinha(a)univ-lorraine.fr ] )
ᐧ
ᐧ
PAN @ CLEF 2025: Shared Tasks on Authorship Analysis, Computational Ethics, and Originality
Call for Participation
We'd like to invite you to participate in the following shared tasks at PAN 2025 held in conjunction with the CLEF conference in Madrid, Spain.
1. Voight-Kampff Generative AI Detection.
Subtask 1: Given a (potentially obfuscated) text, decide whether it was written by a human or an AI.
Subtask 2: Given a document collaboratively authored by human and AI, classify the extent to which the model assisted.
https://pan.webis.de/clef25/pan25-web/generated-content-analysis.html
2. Multilingual Text Detoxification.
Given a toxic piece of text, re-write it in a non-toxic way while saving the main content as much as possible.
https://pan.webis.de/clef25/pan25-web/text-detoxification.html
3. Multi-Author Writing Style Analysis.
Given a document, determine at which positions the author changes.
https://pan.webis.de/clef25/pan25-web/style-change-detection.html
4. Generative Plagiarism Detection.
Given a pair of documents, your task is to identify all contiguous maximal-length passages of reused text between them.
https://pan.webis.de/clef25/pan25-web/generated-plagiarism-detection
Find out more at https://pan.webis.de/clef25/pan25-web
Important Dates
--------------------------
now Training Data Released
May 23, 2025 Software submission
May 30, 2025 Participant paper submission
June 27, 2025 Peer review notification
July 07, 2025 Camera-ready participant papers submission
Sep 09-12, 2025 Conference
Links
--------------------------
PAN: https://pan.webis.de
Contact: pan(a)webis.de
We are looking forward to your submission!
The PAN team
Dear Corpus Linguists,
This symposium may be of interest to those of you involved in educational corpus linguistics and disciplinary literacy. It is part of the University of Bath's British Academic Written English Secondary School (BAWESS) project.
Dear All,
We are thrilled to announce that the registration for the University of Bath’s 2<https://urldefense.com/v3/__https://www.bath.ac.uk/events/second-disciplina…>nd<https://urldefense.com/v3/__https://www.bath.ac.uk/events/second-disciplina…> Disciplinary Literacy Symposium<https://urldefense.com/v3/__https://www.bath.ac.uk/events/second-disciplina…>, on the 26th and 27th June 2025 at the University of Bath and the Royal High School Bath is now open.
The Symposium brings together leading education specialists, linguists and academics in the field, who will present and discuss their latest work on topics, such as:
* The literacy skills pupils need to succeed and how they differ across disciplines
* The type of texts student write in different disciplines
* The structure and language of long answers written in exams
* How to teach writing explicitly in different disciplines
* Cross-disciplinary and Teacher/Researcher Collaboration
* Teacher Professional Development
* Language and literacy expectations at a tertiary level
The Symposium, hosted by the Disciplinary literacy & corpus-based pedagogy: The BAWESS project<https://urldefense.com/v3/__https://www.bath.ac.uk/projects/disciplinary-li…> team, is aimed at teachers and language education researchers.
The event promises excellent opportunities for discussion, knowledge exchange, professional development and networking. Join us to hear about current research and approaches from leading researchers and teachers working in the field: Lee McCallum, David Beauchamp, Hadrian Briggs, Natalie Cheers, Honglin Chen, Bev Derewianka, Yaegan Doran, Philip Durrant, Gail Forey, Sheena Gardner, Meg Gebhard, Helen Handford, Sally Humphrey, Reka Jablonkai, Pauline Jones, Cassi Liardet, Ana Llinares, Erika Matruglio, Christian Matthiessen, Tom Morton, Nashwa Nashaat-Sobhy, Hilary Nesi, Dana Therova, Paul Thompson, Leah Tompkins, Winfred Wenhui Xuan.
We’ve tried to make this as affordable as possible. The symposium will cost £15 for the Thursday, £25 for the Friday (incl. lunch and refreshments), or £35 for both. We will also live stream the event at a cost of £15 registration. See the attachment.
For more details and to register visit the Symposium website<https://urldefense.com/v3/__https://www.bath.ac.uk/events/second-disciplina…>.
Registration closes 19 June 2025. Spaces are limited, so please book early top avoid disappointment.
Please share this will teachers and colleagues you think would be interested in joining a discussion on disciplinary literacy.
All the best,
Gail
[cid:cf1a7453-a359-4858-b3bf-c9ba12487201]
[cidimage001.png(a)01D6FFDF.923A9130]
Prof Gail Forey
Associate Dean (Education)
Faculty of Humanities & Social Sciences
Professor of Applied Linguistics
Department of Education
My pronouns are: she/her
University of Bath<https://urldefense.com/v3/__http://www.bath.ac.uk/__;!!K7l7YuZ3_aFnun0eduI!…>
Building 1 West North, 3.23b, Bath BA2 7AY, United Kingdom | Telephone: +44 (0)1225 386355 | Email:g.forey@bath.ac.uk<mailto:g.forey@bath.ac.uk>
[cid:c6c6a68d-9107-4f29-8dd3-8f7ed4e7487f]
David Beauchamp
Post-graduate Researcher
The Centre for Arts, Memory and Communities (CAMC)
Coventry University
Queen's Award for Enterprise
International Trade 2022
Ranked in the top 50% of UK universities for research power
Times Higher Education analysis of REF 2021
Joint top Modern University for Career Prospects
Guardian University Guide 2022
Top 40 in the World for International Students (ratio)
QS World University Rankings 2025
NOTICE
This message and any files transmitted with it is intended for the addressee only and may contain information that is confidential or privileged. Unauthorised use is strictly prohibited. If you are not the addressee, you should not read, copy, disclose or otherwise use this message, except for the purpose of delivery to the addressee.
Any views or opinions expressed within this e-mail are those of the author and do not necessarily represent those of Coventry University.
**** We apologize for the multiple copies of this email. In case you are
already registered to the next webinar, you do not need to register
again. ****
------------------------------------------------------------------------
Dear colleague,
We are happy to announce the next webinar in the Language Technology
webinar series organized by the HiTZ Chair of AI< (https://hitz.eus).
You can view the videos of previous webinars and the schedule for
upcoming webinars here: http://www.hitz.eus/webinars
Next webinar:
*Speaker: *André F. T. Martins (Universidade de Lisboa)
*Title: *xCOMET, Tower, EuroLLM: Open & Multilingual LLMs for Europe
*Date: *Thursday, May 8, 2025 - 15:00 CET
*Summary: *Today, LLMs are Swiss knives and MT one of their tools. Is
this the end of MT research? In this talk, I argue that the connection
between LLM and MT research is two-way. I present some of our recent
work advancing multilingual LLMs, tools to estimate their quality, and
how the two can be combined for test-time scaling. First, I present
xCOMET, an open-source learned metric which integrates sentence-level
evaluation and error span detection, exhibiting state-of-the-art
performance across all types of meta-evaluation (sentence-level,
system-level, and error span detection). Moreover, it does so while
highlighting and categorizing error spans, thus enriching the quality
assessment. Then, I present Tower, a suite of open multilingual LLMs for
translation-related tasks. Tower models are created through continued
pretraining on a carefully curated multilingual mixture of monolingual
and parallel data. The combination of Tower with COMET reranking
obtained the best results in 8 out of 11 language pairs in the WMT
General Translation shared task, according to human evaluation. Finally,
I describe EuroLLM, an ongoing EU-made project whose goal is to train an
open multilingual LLM from scratch using the European HPC infrastructure
(EuroHPC). The last release (EuroLLM-9B) supports 35 languages,
including all 24 official EU languages, and it achieves strong results
in various benchmarks, comparable or better than the best existing
models of similar size.
xCOMET:
https://huggingface.co/collections/Unbabel/xcomet-659eca973b3be2ae4ac023bb
Tower:
https://huggingface.co/collections/Unbabel/tower-659eaedfe36e6dd29eb1805c
EuroLLM: https://huggingface.co/blog/eurollm-team/eurollm-9b
*Bio: *André F. T. Martins (PhD 2012, Carnegie Mellon University and
Instituto Superior Técnico; https://andre-martins.github.io/) is an
Associate Professor at Instituto Superior Técnico, University of Lisbon,
researcher at Instituto de Telecomunicações, and the VP of AI Research
at Unbabel. His research, funded by a ERC Starting Grant (DeepSPIN) and
Consolidator Grant (DECOLLAGE), among other grants, include machine
translation, quality estimation, structure and interpretability in deep
learning systems for NLP. His work has received several paper awards at
ACL conferences. He co-founded and co-organizes the Lisbon Machine
Learning School (LxMLS), and he is a Fellow of the ELLIS society and
co-director of the ELLIS Program in Natural Language Processing. He is a
member of the R&I advisory group of EuroHPC, the European infrastructure
for supercomputing.
*
Upcoming webinars:*
· Mirella Lapata (Thursday, June 5, 2025)
If you are interested in participating, please complete this
registration form: http://www.hitz.eus/webinar_izenematea
If you cannot attend this seminar, but you want to be informed of the
following HiTZ webinars, please complete this registration form instead:
http://www.hitz.eus/webinar_info
Best wishes,
HiTZ Zentroa
P.S: HiTZ will not grant any type of certificate for attendance at these
webinars.