This is the second call for participation for the *2nd SIGIR 2025 Workshop on Simulations for Information Access (Sim4IA)*.
The workshop will be held with SIGIR 2025 in Padua, Italy. It will provide a unique platform for researchers and practitioners to explore and discuss advancements in simulations for information access systems.
## tl;dr
----------
- 17 July 2025, co-located with SIGIR 2025 in Padua, Italy
- Micro shared task data and framework available
- Tech and infrastructure talks/presentations welcome
- Keynote by Christine Bauer confirmed
- We are on the ACM Slack: https://acmsigir.slack.com/archives/C08STM45N90
- Website: https://sim4ia.org/sigir2025/
## Micro Shared Task Data and Framework Available
-----------------------------------------------------------------------
To drive a more focused discussion at the workshop, we designed a micro shared task that demonstrates how a shared task in user simulations might look. On 16 May 2025, we released the first training data set as well as a prebundled and dockerized version of SimIIR to give everyone a head start on the shared task.
Our shared task concept is based on the fundamental design principle of validating user simulations instead of measuring system effectiveness. We envision users interacting with a particular IA system, such as a traditional search engine (Task A) or a conversational system (Task B). We challenge participants to design and implement user simulators that can mimic the interactions of real users with these systems with a high degree of fidelity. The workshop features a stripped-down version of this concept, a micro shared task.
We will discuss the submissions and ideas for the next steps or evaluation measures at the workshop. Non-binding expression of interest to take part in the micro shared tasks: https://forms.gle/ftV8cwjywHWsBhCw9
More information on the shared task, data sets, and framework: https://sim4ia.org/sigir2025/#micro-shared-task
## Keynote by Christine Bauer
-----------------------------------------
We are happy to announce that Christine Bauer has confirmed to give a keynote on “From toy models to tactics: What user simulation is good for”.
Christine Bauer is a Professor of Interactive Intelligent Systems at the Department of Artificial Intelligence and Human Interfaces (AIHI) at the University of Salzburg. She is involved in the EXDIGIT initiative, emphasizing interdisciplinary technologies in digital sciences. Her research lies at the intersection of human-centered computing, data science, and artificial intelligence, with a focus on context-aware recommender systems, particularly in the music and media domains. Her core interests include fairness and multi-method evaluation. Her multidisciplinary background drives her research activities.
More information on the keynote: https://sim4ia.org/sigir2025/#keynote
## Invitation of Tech/Infrastructure Talks
-----------------------------------------------------
We reserved a special time slot at the workshop for talks on recent technologies and/or infrastructures for (user) simulations, and invite you to submit your ideas for such talks at the workshop.
Send a short email with your idea in the form of a title and roughly half a page of abstract to sigir2025(a)sim4ia.org
Check out the tentative program, shared task data and description, the keynote announcement, and much more at https://sim4ia.org/sigir2025/
See you in Padua!
Sim4IA Organizers
Philipp Schaer, Christin Kreutz, Krisztian Balog, Timo Breuer, and Andreas Kruff
Dear all
You are warmly invited to submit an abstract to the Shifting Power in Language Learning and Applied Linguistics with GenAI conference, which will take place in Milton Keynes, UK and online on November 13-14, 2025.
This conference will explore how power is being shifted towards, away from, and between learners and educators by AI technologies, and the new dynamic and potential changes this is bringing about in applied linguistics, languages and cultures studies. Potential topics for the papers may include, but are not limited to:
*
AI and its impact on the training and evolving roles of languages and applied linguistics educators and their relationships with learners
*
AI and its potential to support inclusive and personalised learning in languages and applied linguistics;
*
AI integration into learning, teaching and assessment of languages, cultures and applied linguistics with a focus on ethical issues and sustainability challenges;
*
Core concepts and theoretical frameworks guiding the integration of AI in applied linguistics;
*
Core concepts and theoretical frameworks guiding the integration of AI in the learning and teaching of languages and cultures;
*
Questions around the use of AI in carrying out research in languages, cultures and applied linguistics, and its impact on research processes and outputs.
Instructions for submission
We welcome submissions in the following formats:
* 20-minute presentations (online or in person)
* 40-minute facilitated discussions with up to 3 facilitators (online or in person)
Proposals should be submitted via email, by May 31st, 2025: ai-languages-conference(a)open.ac.uk <mailto:ai-languages-conference@open.ac.uk>
The following information will be requested during the submission process:
* Names, titles, contact info, institutional or organisational affiliation and short bio (max 100 words) for each presenter and facilitator
* Conference topic (selected from the list above)
* Session format (selected from the list above)
* Title of the abstract
* Abstract (max. 300 words)
Kind regards
Rachele
Dr Rachele De Felice (she/her) | Lecturer in Applied Linguistics
School of Languages and Applied Linguistics
The Faculty of Wellbeing, Education and Language Studies
The Open University
https://profiles.open.ac.uk/rachele-de-felice
Dear Colleagues,
The ACL 2025 Conference is pleased to announce that *registration is now
officially open*. We encourage you to register early to take advantage of
reduced rates.
Please note the following important deadlines for registration:
- *Early Registration:* Concludes on *Wednesday, July 2, 2025, AOE*.
- *Late Registration:* Will close for both In-Person and Virtual
attendees on *Friday, July 25, 2025, at 11:59 PM CET*.
- *Onsite Registration:* Will be available for both In-Person and
Virtual attendees from *Saturday, July 26, 2025, through August 1, 2025,
at 11:59 PM CET*.
Detailed information regarding the registration process can be found on the
official conference website: https://acl.swoogo.com/acl2025
We look forward to welcoming you to ACL 2025 in beautiful Vienna!
Sincerely,
The ACL Organization Team
--
Horacio Saggion
Full Professor / Chair in Computer Science and Artificial Intelligence
Head of the Natural Language Processing Group - TALN
Project Coordinator iDEM Project (HE)
Co-PI of the AI-BOOST project (HE)
Co-PI of the IDEAL project (HE)
Universitat Pompeu Fabra
https://twitter.com/h_saggionhttps://www.linkedin.com/in/horacio-saggion-1749b916
--
Horacio Saggion
Full Professor / Chair in Computer Science and Artificial Intelligence
Head of the Natural Language Processing Group - TALN
Project Coordinator iDEM Project (HE)
Co-PI of the AI-BOOST project (HE)
Co-PI of the IDEAL project (HE)
Universitat Pompeu Fabra
https://twitter.com/h_saggionhttps://www.linkedin.com/in/horacio-saggion-1749b916
We have three postdoc position openings at Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi. The project is based on a collaboration with a leading industry partner on the development of a conversational booking agent.
* Postdoctoral Research Scientist in Conversational AI & NLP
* Postdoctoral Research Scientist in Recommendation & Personalization
* Postdoctoral Research Scientist in Persuasive Language Generation
More information regarding responsibilities and requirements can be found on our webpage:
https://mbzuai-hiring.github.io/
Start date: To be filled immediately, July 2025
- Duration: 1‑year contract with possibility of extension
- Location: MBZUAI<https://mbzuai.ac.ae/>, Abu Dhabi, UAE
- Apply via e‑mail: NLP.IndustryProject(a)mbzuai.ac.ae
We look forward to receiving your application!
Regards,
Teresa
Teresa Lynn , PhD
Head of NLP Research Engagement
Natural Language Processing
P +971 2 811 3284 W.www.mbzuai.ac.ae<https://www.mbzuai.ac.ae/>
[mbzuai logo.png] [cid:image002.png@01DBCB09.577C26E0] <https://www.instagram.com/mbzuai> [cid:image003.png@01DBCB09.577C26E0] <https://www.facebook.com/MBZUAI> [cid:image004.png@01DBCB09.577C26E0] <https://www.youtube.com/c/mbzuai> [cid:image005.png@01DBCB09.577C26E0] <https://www.linkedin.com/school/mbzuai/> [cid:image006.jpg@01DBCB09.577C26E0] <https://twitter.com/mbzuai>
This is the last call to participate in ADoBo 2025, the shared task on automatic detection of borrowings in Spanish.
To gain access to the data, make submissions and check the leaderboard please join the competition at Codabench. Systems submissions will be due on May 26th.
https://www.codabench.org/competitions/7284/
TIMELINE
April 21: Dev set released.
May 12: Test set released
May 26: Systems output submissions.
June 9: Working notes paper submission.
June 16: Notification of acceptance (peer-reviews).
June 23: Camera ready paper submission.
September: ADoBo results to be presented at IberLEF 2025.
ORGANIZATION COMMITTEE
Elena Álvarez Mellado, Universidad Nacional de Educación a Distancia (UNED).
Julio Gonzalo, Universidad Nacional de Educación a Distancia (UNED).
Constantine Lignos, Brandeis University.
Jordi Porta Zamorano, Universidad Autónoma de Madrid (UAM).
AVISO LEGAL. Este mensaje puede contener información reservada y confidencial. Si usted no es el destinatario no está autorizado a copiar, reproducir o distribuir este mensaje ni su contenido. Si ha recibido este mensaje por error, le rogamos que lo notifique al remitente.
Le informamos de que sus datos personales, que puedan constar en este mensaje, serán tratados en calidad de responsable de tratamiento por la UNIVERSIDAD NACIONAL DE EDUCACIÓN A DISTANCIA (UNED) c/ Bravo Murillo, 38, 28015-MADRID-, con la finalidad de mantener el contacto con usted. La base jurídica que legitima este tratamiento, será su consentimiento, el interés legítimo o la necesidad para gestionar una relación contractual o similar. En cualquier momento podrá ejercer sus derechos de acceso, rectificación, supresión, oposición, limitación al tratamiento o portabilidad de los datos, ante la UNED, Oficina de Protección de datos<https://www.uned.es/dpj>, o a través de la Sede electrónica<https://sede.uned.es/> de la Universidad.
Para más información visite nuestra Política de Privacidad<https://descargas.uned.es/publico/pdf/Politica_privacidad_UNED.pdf>.
Tokshop: Tokenization Workshop (ICML 2025)
Submission to the Tokenization Workshop begins on April 14, 2025, via OpenReview. The deadline for submissions is May 30, 2025, at 11:59pm (anywhere on earth). Notifications of acceptance will be sent out on June 9, 2025, and camera-ready papers will be due shortly afterward at 11:59pm (anywhere on earth). The workshop will take place on July 18, 2025.
Workshop Description The Tokenization Workshop (TokShop) at ICML aims to bring together researchers and practitioners from all corners of machine learning to explore tokenization in its broadest sense. We will discuss innovations, challenges, and future directions for tokenization across diverse data types and modalities.
Call for Papers
Topics of interest include:
- Subword Tokenization in NLP: Analysis of techniques such as BPE, WordPiece, and UnigramLM, as well as improvements for efficiency, interpretability, and adaptability. - Multimodal Tokenization: Tokenization strategies for images, audio, video, and other modalities, including methods to align representations across different types of data. - Multilingual Tokenization: Development of tokenizers that work robustly across languages and scripts, and investigation into failure modes tied to tokenization. - Tokenizer Modification Post-Training: Methods for updating tokenizers after model training to boost performance and/or efficiency without retraining from scratch. - Alternative Input Representations: Exploration of non-traditional tokenization approaches, such as byte-level, pixel-level, or patch-based representations. - Statistical Perspectives on Tokenization: Empirical analysis of token distributions, compression properties, and correlations with model behavior. By broadening the scope of tokenization research beyond language, this workshop seeks to foster cross-disciplinary dialogue and inspire new advances at the intersection of representation learning, data efficiency, and model design.
Submission guidelines Our author guidelines follow the ICML requirements unless otherwise specified. - Paper submission is hosted on OpenReview. - Each submission should contain up to 9 pages, not including references or appendix (shorter submissions also welcome). - Please use the provided LaTeX template (Style Files) for your submission. Please follow the paper formatting guidelines general to ICML as specified in the style files. Authors may not modify the style files or use templates designed for other conferences. - The paper should be anonymized and uploaded to OpenReview as a single PDF. - You may use as many pages of references and appendix as you wish, but reviewers are not required to read the appendix. - Posting papers on preprint servers like ArXiv is permitted. - We encourage each submission to discuss the limitations as well as ethical and societal implications of their work, wherever applicable (but neither are required). These sections do not count towards the page limit. - This workshop offers both archival and non-archival options for submissions. Archival papers will be indexed with proceedings, while non-archival submissions will not. - The review process will be double-blind
Read more: https://tokenization-workshop.github.io/
(apologies for multiple postings)
Dear colleagues,
We would like to inform you that the registration for the eLex 2025 conference has now opened (https://elex.link/elex2025/registration/). The deadline for early-bird fee is 5 September 2025.
A call for Hornby bursary applications is also out (https://elex.link/elex2025/hornby-bursary/). The bursaries cover participants' registration fee, so if you intend to apply, please wait for results before paying the registration fees (you can still complete all the steps of the registration process and pay later).
Finally, the special rates for rooms at the venue and partner hotels are available: https://elex.link/elex2025/venue/. There are a limited number of rooms available so early booking is advisable (there is a very friendly cancellation option).
Please monitor the conference website for further updates on the programme, proceedings and related news.
Looking forward to seeing you at the conference.
Best regards
Iztok Kosem
Head of the eLex 2025 organising committee
----------------------------
HealTAC 2025
June 16-18th, 2025, Glasgow (UK)
https://healtac2025.github.io/
----------------------------
Call for participation
----------------------------
The 8th Healthcare Text Analytics Conference (HealTAC 2025) invites everyone for three days of state of the art discussions on healthcare text analytics. The programme features
-- keynotes on "Addressing the Missing Context Problem in Foundation Models for Healthcare" (by Jason Fries, Stanford University) and "AI for Healthcare: Text as a Medium for Multimodal datasets" (by Alison O'Neil, Canon Medical Research Europe)
-- panels on "Opportunities and challenges in LLMs for health research: A multidisciplinary perspective on surfacing social inequalities, bias detection, and mitigation" and "Challenges in AI deployment within NHS";
-- 18 talks describing current PhD projects;
-- a workshop on "NLP in mental healthcare and research" (June 16th);
-- 4 demos and 9 lightning talks;
-- 24 posters presenting healthcare text analytics research.
The detailed programme is available at: https://healtac2025.github.io/programme/
----------------------------
Registration fees
----------------------------
Due to generous support from Health Data Research UK, CogStack, Frontiers, DataMind, University of Glasgow, Research Data Scotland and Healtex, the registration fee is only £100 (for students) and £200 (for everyone else), and includes the full 3-day programme, lunches, the conference dinner and even breakfast on day 1.
This is the early registration fee until May 29th. Registration details:https://healtac2025.github.io/registration/
----------------------------
Accommodation and travel
----------------------------
The University accommodation is available for the registered participants for only £43 per night. All details are available at: https://healtac2025.github.io/accommodation/
Follow the conference announcements on social media at #HEALTAC2025 . We are looking forward to welcoming you to HealTAC 2025.
The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. Is e buidheann carthannais a th’ ann an Oilthigh Dhùn Èideann, clàraichte an Alba, àireamh clàraidh SC005336.
The 2nd LLMs4Subjects Shared Task: LLM-based Subject Tagging for the TIB Technical Library's Digital Catalog
Theme: The Development of Energy- and Compute-Efficient LLM Systems
Organized as part of the German Evaluation (GermEval 2025) Shared Task Series
10. - 12. September, 2025
Hildesheim, Germany
(co-located with KONVENS 2025 - Conference on Natural Language Processing)
2nd LLMs4Subjects Shared Task: https://sites.google.com/view/llms4subjects-germeval/
Join the Codabench Competition: https://www.codabench.org/competitions/8373/
KONVENS 2025: https://konvens-2025.hs-hannover.de/about/
Task Overview
LLMs4Subjects challenges the research community to develop cutting-edge LLM-based solutions for subject tagging of technical records from Leibniz University's Technical Library (TIBKAT). Participants are tasked with leveraging large language models (LLMs) to tag technical records using the GND taxonomy. The task involves bilingual language modeling, as systems must process technical documents in both German and English. Successful solutions may be integrated into the operational workflows of TIB, the Leibniz Information Centre for Science and Technology.
With the rapid advancements in LLMs, the focus is shifting toward making these models more energy- and compute-efficient while maintaining high performance. Recent innovations, such as the DeepSeek series, have demonstrated how techniques like mixture-of-experts (MoE) and model distillation can significantly reduce computational costs without sacrificing effectiveness.
The 2nd LLMs4Subjects shared task highlights the importance of efficiency in LLMs, encouraging participants to explore strategies that enhance model performance while optimizing for energy consumption and inference speed. We welcome approaches (but not limited to) that leverage model compression, quantization, efficient fine-tuning, and adaptive computation techniques to push the boundaries of sustainable AI development.
Subtasks
The 2nd LLMs4Subjects shared task organizes the following two subtasks:
Subtask 1 - Multi-Domain Classification of Library Records
Subtask 2 - Large-scale Multilabel Subject Indexing of Library Records
Important Dates
* Release of training data: March 8, 2025
* Release of testing data: May 30, 2025
* Deadline for system submissions: June 2, 2025
* Evaluation end: June 27, 2025
* Paper submission deadline: July 7, 2025
* Notification of acceptance: June 28, 2025
* Camera-ready paper due: August 15, 2025
* Workshop/KONVENS: September 10 - 12, 2025 (TBA)
Note: Submit your system outputs on our Codabench live leaderboard at https://www.codabench.org/competitions/8373/
2nd CfP: The 5th Workshop on Computational Linguistics for the
Political and Social Sciences (CPSS-2025)
https://cpss-sig.github.io/CPSS-2025
CPSS-2025 will be held in September 2025, co-located with KONVENS
<https://konvens-2025.hs-hannover.de> in Hildesheim, Germany.
The workshop will provide a forum for the presentation and discussion of
innovative research on all aspects of using CL/NLP techniques for the
political and social sciences, including:
* Modeling political communication with NLP (e.g. topic
classification, position measurement)
* Mining policy debates from heterogeneous textual sources
* Modeling complex social constructs (e.g. populism, polarization,
identity) with NLP methods
* Political and social bias in language models
* Methodological insights in interdisciplinary collaboration:
workflows, challenges, best practices
* NLP support to understand and support democratic decision making
* Resources and tools for Political/Social Science research
* and many more...
CPSS-2025 will be held in person.
Special Theme
The special theme of CPSS-2025 is
*Validation and best practices for using NLP in political and social
science research*.
In addition to CPSS's general topics, we specifically invite submissions
on this year's special theme, focussing on validation and best practices
for applying NLP techniques for research in the political and social
sciences. We are especially interested in papers addressing issues
related to:
* Data quality in human and synthetic data
* Data leakage and contamination, especially in LLMs
* New ways to collect data such as dataset donation
* Validation of results beyond the train-dev-test paradigm of NLP and
data science.
* Any other topics related to the special theme.
*Important Dates*
All submission deadlines are 11:59 p.m. UTC-12:00 “anywhere on Earth.”
Workshop papers due June 13, 2025
Notification of acceptance Aug 1, 2025
Camera-ready papers due Aug 10, 2025
Workshop date Sep 2025
*Submissions*
We solicit two types of submissions:
*archival papers* describing original and unpublished work (long papers:
max. 8 pages, references/appendix excluded; short papers: max 4 pages,
references/appendix excluded). Accepted papers will be published on the
ACL anthology. For the submission format, refer to the KONVENS guidelines.
*non-archival papers* (1-page abstracts, references excluded) describing
ongoing work, PhD projects, or already published research.
For more details, please refer to the CPSS-2025 website:
https://cpss-sig.github.io/CPSS-2025
*CPSS 2025 organising committee*
Dennis Assenmacher (GESIS), Christopher Klamm (U-Mannheim), Gabriella
Lapesa (GESIS/U-Düsseldorf),
Simone Ponzetto (U-Mannheim), Ines Rehbein (U-Mannheim), Indira Sen
(U-Mannheim)
--
Ines Rehbein
Data and Web Science Group
University of Mannheim, Germany