**Apologies for cross-posting** The paper submission deadline for LDK
2025 has been extended until the 24th of March 2025!
CFP Below
==============
Call for Papers @ Fifth Conference on Language, Data and Knowledge (LDK
2025)
Dates: 9-12 September 2025
Location: Naples, Italy
Website: http://2025.ldk-conf.org
Twitter/X: https://x.com/LDKconference
Submission Deadline: 06/03/2025
Submission page: https://openreview.net/group?id=LDK/2025/Conference
==============
We invite submissions to the fifth biennial conference on Language, Data
and Knowledge (LDK 2025) to be held in Naples, Italy in September 2025.
This conference aims to bring together researchers from across different
disciplines concerned with the acquisition, treatment, curation and the
use of language data in the context of data science and knowledge-based
applications. This edition builds upon the success of the inaugural
event held in Galway, Ireland in 2017, the second LDK in Leipzig,
Germany in 2019, the third LDK in Zaragoza, Spain in 2021, and the
fourth LDK in Vienna, Austria in 2023.
Paper Submission
We welcome submissions of relevance to the topics listed below.
Submissions can be in the form of:
Long papers: 9–12 pages;
Short papers: 4–6 pages.
All submission lengths are given including references. Accepted
submissions will be published in an open-access conference proceedings
volume and indexed in ACL anthology and DBLP, free of charge for
authors. The ACL templates should therefore be used for all conference
submissions.
As the reviewing process is single-blind, submissions should not be
anonymised. Papers should be submitted via OpenReview at the following
address:
https://openreview.net/group?id=LDK/2025/Conference
All papers must represent original work. When submitted, the submission
must not have been previously published*, and the material in it must
not have been/be submitted for review at another journal or conference
while under review at LDK 2025.
*This excludes papers on preprint archives, such as arXiv, which we do
not consider to have been previously published.
The conference will be hybrid (face-to-face and remote). Note that at
least one author of each accepted paper must register to present the
paper at the conference (either remotely or on-site).
Topics
Relevant topics for the conference include, but are not limited to, the
following fields:
Language Data
Language data construction and acquisition
Language data annotation
FAIR data practices for language data
Language data portals and metadata about language data
Organisational and infrastructural management of language data
Multilingual, multimedia and multimodal language data
Evaluation, provenance and quality of language data
Visualisation of language data
Standards and interoperability of language data
Legal aspects of publishing language data
Under-resourced languages
e-Lexicography
Semantic processing
Knowledge Graphs
Linguistic Linked Data and the multilingual Semantic Web
Ontologies, terminologies, wordnets, framenets and related resources
Information and knowledge extraction (taxonomy extraction, ontology
learning)
Data, information and knowledge integration across languages
(Cross-lingual) ontology alignment
Entity linking and relatedness
Linked data profiling
Knowledge representation and reasoning
Knowledge graphs for corpora processing and analysis
Neuro Symbolic Artificial Intelligence
Methods and Applications for Language, Data and Knowledge
Question answering and semantic search
Text analytics on big data
NLP for language documentation and preservation
Speech recognition and synthesis
Spoken language processing
Semantic content management
Computer-aided language learning
Natural language interfaces to big data
Knowledge-based NLP
Deep learning and machine learning for and on LLOD
Language Models and Foundation Models (Language and Multimodal Models).
Generative Artificial Intelligence and Language, Data, Knowledge Graphs
Use Cases in Language, Data and Knowledge
Contributions are welcome where the topics above - and others within the
scope of Language, Data and Knowledge - are applied to domain-specific
use cases, including but not limited to: social sciences and humanities,
legal, life sciences, FinTech, cybersecurity.
Organising Committee
Conference Chairs:
Jorge Gracia, University of Zaragoza, Spain
Dagmar Gromann, University of Vienna, Austria
Program Chairs:
Mehwish Alam, Telecom Paris, Institut Polytechnique de Paris, France
Andon Tchechmedjiev, Institut Mines Telecom | EuroMov Digital Health in
Motion
Proceedings Chair:
Max Ionov, University of Zaragoza, Spain
Workshop and Tutorial Chairs:
Katerina Gkirtzou, ILSP/Athena Research Center, Greece
Slavko Zitnik, University of Ljubljana, Slovenia
Local Organisers:
Maria Pia Buono - University of Naples “L’Orientale”, Italy
Johanna Monti - University of Naples “L’Orientale”, Italy
Important Dates:
Paper submission deadline EXTENDED: 24th March, 2025
Acceptance/Rejection Notification: 8th May, 2025
Pre/Post Conference events: 9 to 12 September, 2025
Main conference: 10-11 September, 2025
All deadlines are 23:59 AoE (anywhere on Earth)
*** Third Call for Papers – DEADLINES UPDATED ***
We invite paper submissions to the 9th Workshop on Online Abuse and Harms (WOAH), which will take place on July 31/August 1 at ACL 2025.
Website: https://www.workshopononlineabuse.com/cfp.html
Important Dates
* Submission due: April 18, 2025
* ARR reviewed submission due: May 20, 2025
* Notification of acceptance: May 30, 2025
* Camera-ready papers due: June 13, 2025
* Workshop: July 31st - August 1st, 2025
Overview
Digital technologies have brought significant benefits to society, transforming how people connect, communicate, and interact. However, these same technologies have also enabled the widespread dissemination and amplification of abusive and harmful content, such as hate speech, harassment, and misinformation. Given the sheer volume of content shared online, addressing abuse and harm at scale requires the use of computational tools. Yet, detecting and moderating online abuse remains a complex task, fraught with technical, social, legal, and ethical challenges.
The 9th Workshop on Online Abuse and Harms (WOAH) invites paper submissions from a diverse range of fields, including but not limited to natural language processing, machine learning, computational social science, law, political science, psychology, sociology, and cultural studies. We explicitly encourage interdisciplinary research, technical and non-technical contributions, and submissions that focus on under-resourced languages. Non-archival papers and civil society reports are also welcome.
Topics covered by WOAH include, but are not limited to:
* New models or methods for detecting abusive and harmful online content, including misinformation;
* Biases and limitations in existing detection models or datasets for abusive and harmful content, especially those in commercial use;
* Development of new datasets and taxonomies for online abuse and harms;
* Novel evaluation metrics and procedures for detecting harmful content;
* Analyses of the dynamics of online abuse, its propagation, and its impact on different communities;
* Social, legal, and ethical considerations in detecting, monitoring, and moderating online abuse.
Special Theme: Harms Beyond Hate Speech
In its 9th edition, WOAH highlights the theme Harms Beyond Hate Speech. We aim to expand the conversation beyond conventional definitions of harmful content by exploring the nuanced ways online harms manifest—such as technologically mediated inauthentic behavior, the power of technologies to reshape perceptions and opinions, and their potential to incite discrimination, hostility, violence, or even genocide. Additionally, we emphasize the diverse targets affected by such harms and the unique considerations computational interventions demand.
To facilitate this exploration, we invite NLP researchers, social scientists, cultural scholars, and practitioners to engage with key issues, including child sexual abuse material, radicalization, misinformation, platform policies, security, and the politics of computational approaches. By fostering interdisciplinary collaboration, our goal is to deepen understanding of these complex phenomena and advance effective, ethical solutions
Submission
Submission is electronic, using the Softconf START conference management system.
Submission link: https://softconf.com/acl2025/woah2025/
The workshop will accept three types of papers:
1) Academic Papers (long and short): Long papers of up to 8 pages, excluding references, and short papers of up to 4 pages, excluding references. Unlimited pages for references and appendices. Accepted papers will be given an additional page of content to address reviewer comments. Previously published papers cannot be accepted.
2) Non-Archival Submissions: Up to 2 pages, excluding references, to summarise and showcase in-progress work and work published elsewhere.
3) Civil Society Reports: Non-archival submissions, with a minimum of 2 pages and no upper limit. Can include work published elsewhere.
All submissions must use the official ACL style files<https://github.com/acl-org/acl-style-files>. Submissions that do not conform to the required styles, including paper size, margin width, and font size restrictions, will be rejected without review. All submissions should adhere to the workshop policies https://www.workshopononlineabuse.com/policies.html.
WOAH Community
We are excited to share the WOAH community Slack channel — a workspace for researchers interested in or working on understanding and addressing online abuse and harms!
Join us here: https://join.slack.com/t/hatespeechdet-47d7560/shared_invite/zt-2a8d96j4z-g…
Contact Info
Please send any questions about the workshop to organizers(a)workshopononlineabuse.com<mailto:organizers@workshopononlineabuse.com>
Organisers
Agostina Calabrese, University of Edinburgh
Christine de Kock, University of Melbourne
Debora Nozza, Bocconi University
Flor Miriam Plaza-del-Arco, Bocconi University
Zeerak Talat, University of Edinburgh
Francielle Vargas, University of São Paulo
The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. Is e buidheann carthannais a th’ ann an Oilthigh Dhùn Èideann, clàraichte an Alba, àireamh clàraidh SC005336.
The Research Training Group 2853 “Neuroexplicit Models of Language,
Vision, and Action” is looking for
*up to 8 PhD Students - Fall 2025*
Neuroexplicit models combine neural and human-interpretable (“explicit”)
models in order to overcome the limitations that each model class has
separately. They include neurosymbolic models, which combine neural and
symbolic models, but also e.g. combinations of neural and physics-based
models. In the RTG, we will improve the state of the art in natural
language processing (“Language”), computer vision (“Vision”), and
planning and reinforcement learning (“Action”). We also develop novel
machine learning techniques for neuroexplicit models (“Foundations”).
Our overarching aim is to contribute to a better understanding of the
cross-cutting design principles of effective neuroexplicit models
through interdisciplinary collaboration.
The RTG is scheduled to grow to a total of *24 PhD students* by 2025. An
excellent and international group of twelve PhD students and one postdoc
have already joined the RTG. Through the inclusion of ~20 associated PhD
students and postdocs funded from other sources, it will be one of the
largest research centers on neuroexplicit or neurosymbolic models in the
world.
The RTG brings together researchers at Saarland University, the Max
Planck Institute for Informatics, the Max Planck Institute for Software
Systems, the CISPA Helmholtz Center for Information Security, and the
German Research Center for Artificial Intelligence (DFKI). All of these
institutions are collocated on the same campus in Saarbrücken, Germany.
The positions will be *funded for four years* at the TV-L E13 100% pay
scale. They are intended to start in September 2025. You should have or
be about to complete an MSc degree in computer science or a related
field and have demonstrated expertise in one of the research areas of
the RTG, e.g. through an excellent Master’s thesis or relevant publications.
The RTG is part of the Saarland Informatics Campus, one of the *leading
centers for research* in computer science, artificial intelligence, and
natural language processing in Europe. The Saarland Informatics Campus
brings together 900 researchers and 2500 students from 81 countries. The
CISPA Helmholtz Center, located on the same campus, is home to an
additional 350 researchers and on track to grow to 800 by 2026.
Researchers at SIC and CISPA are part of the ELLIS network and have been
awarded more than 40 ERC grants.
Each PhD student in the RTG will be *jointly supervised by two PhD
advisors* from the list of Principal Investigators below. Each student
will freely define their own research topic; we encourage the choice of
topics that cross the traditional boundaries of research fields.
Students may be affiliated with Saarland University or with one of the
participating institutes.
Vera Demberg, Saarland University - Computational Linguistics
Jörg Hoffmann, Saarland University - AI Planning
Dietrich Klakow, Saarland University - Natural Language Processing
Alexander Koller, Saarland University - Computational Linguistics
Bernt Schiele, MPI for Informatics - Computer Vision, Machine Learning
Philipp Slusallek, DFKI and Saarland University - Computer Graphics,
Artificial Intelligence
Christian Theobalt, MPI for Informatics - Visual Computing, Machine Learning
Mariya Toneva, MPI for Software Systems - Computational Neuroscience,
Machine Learning
Isabel Valera, Saarland University - Machine Learning
Jilles Vreeken, CISPA - Machine Learning, Causality
Joachim Weickert, Saarland University - Mathematical Data Analysis
Verena Wolf, DFKI and Saarland University - Modeling and Simulation,
Reinforcement Learning
Ellie Pavlick, Brown University and Google AI, will join us regularly as
a Mercator Fellow.
Please send your application by *24th March 2024* to
apply(a)neuroexplicit.org and include the reference number W2616. We aim
to conduct job interviews in April-May 2025.
For more details on the position, including what materials to submit
with your application, please see our website:
<https://www.neuroexplicit.org/jobs/#phd-2023>https://www.neuroexplicit.org/jobs/
Dear colleagues,
registration is now open for our upcoming workshop on "Large Language
Models (LLMs) in the History, Philosophy, and Sociology of Science (HPSS)"!
The workshop will focus on exploring use cases and proposals for how, and
to what extent, LLMs might help overcome long-standing challenges in
studies of how science works. The event will take place from April 2–4,
2025, at Technische Universität Berlin, Germany. The event will open at
around 1:30 PM on April 2 and end at around 5 PM on April 4. Further
information is available at
https://www.tu.berlin/hps-mod-sci/workshop-llms-for-hpss
Register here:
https://events.tu-berlin.de/en/events/0194f579-606f-75e9-9be9-c2167c356171/…
Early registration is greatly appreciated as it will help us with further
planning. The deadline for registration is March 28, 2025 but we will also
try to accommodate “walk-ins”.
Please don’t hesitate to reach out in case you need more information.
Kind regards,
The organizers
**** We apologize for the multiple copies of this email. In case you are
already registered to the next webinar, you do not need to register
again. ****
------------------------------------------------------------------------
Dear colleague,
We are happy to announce the next webinar in the Language Technology
webinar series organized by the HiTZ Chair of AI< (https://hitz.eus).
You can view the videos of previous webinars and the schedule for
upcoming webinars here: http://www.hitz.eus/webinars
Next webinar:
*Speaker:* Christian Herff (Maastricht University)
*Title:* Speech neuroprostheses based on intracranial EEG
*Date: * Thursday, March 6, 2025 - 15:00 CET
*Summary:* Speech is our most natural way of communication and the loss
of the ability to speak is therefore devastating to patients. A speech
neuroprostheses that directly reconstructs speech processes from neural
activity could provide a new means of communications to these severely
affected patients. In this presentation, I will present some approaches
to reconstruct different representations of speech from intracranial
recordings and highlight how they can be used to build a speech
neuroprosthesis. The decoding of speech processes is particularly
challenging, as not only the neural, but also the target signal has
complex, nonlinear dynamics. I will stress the use of interpretable
machine learning models for this task to ensure that meaningful activity
is decoded and scientific insights might be generated as a side product.
*Bio:* Dr. Christian Herff is an assistant professor in the School for
Mental Health and Neuroscience at Maastricht University where he leads
the invasive BCI research line. His research interest lays in the
application of machine learning technology to neurophysiological data
for Brain-Computer Interfaces and neuroscience research. With a
particular focus on the decoding of speech processes from intracranial
data, he tries to improve the lives of severely paralyzed patients while
simultaneously improving our understanding of complex higher order
cognition. He emphasizes the ability to achieve interpretable results
based on computational models. In particular, visualization of complex
dynamic models, such as deep neural networks, is of interest to him.
*
Upcoming webinars:*
· Emanuele Bugliarello (Thursday, April 3, 2025)
· André F. T. Martins (Thursday, May 8, 2025)
· Mirella Lapata (Thursday, June 5, 2025)
If you are interested in participating, please complete this
registration form: http://www.hitz.eus/webinar_izenematea
If you cannot attend this seminar, but you want to be informed of the
following HiTZ webinars, please complete this registration form instead:
http://www.hitz.eus/webinar_info
Best wishes,
HiTZ Zentroa
P.S: HiTZ will not grant any type of certificate for attendance at these
webinars.
(apologies for cross-posting)
Dear colleague,
We invite you to participate in the 2025 edition of the CheckThat! Lab at
CLEF 2025. This year, we feature four tasks ---one follow-up and three
new--- that correspond to important components within and around the full
fact-checking pipeline in multiple languages:
Task 1 Subjectivity in news articles. to spot text that should be processed
with specific strategies; benefiting the fact-checking pipeline. Available
in Arabic, English, Bulgarian, German, Italian, and Multilingual.
Task 2 Claim Normalization. to simplify the primary claim made in the
social media post into a concise form. This task is offered in 20
languages: English, Arabic, Bengali, Czech, German, Greek, French, Hindi,
Korean, Marathi, Indonesian, Dutch, Punjabi, Polish, Portuguese, Romanian,
Spanish, Tamil, Telugu, Thai.
Task 3 Fact-Checking Numerical Claims. to verify claims with numerical
quantities and temporal expressions. Available in Arabic, English and
Spanish.
Task 4 Scientific Web Discourse Processing (SciWeb). to (a) classify
different forms of science-related online discourse and (b) retrieve the
scientific paper that serves as the source for the claim from a given pool
of candidate scientific papers. Available in English.
Register and participate:
https://clef2025-labs-registration.dei.unipd.it/registrationForm.php
Further information: https://checkthat.gitlab.io/
Datasets: https://gitlab.com/checkthat_lab/clef2025-checkthat-lab
Important Dates
---------------------
- November 2024: Lab registration opens
- December 2024: Release of the training materials
- 25 April 2025: Lab registration closes
- 30 April 2025: Beginning of the evaluation cycle (test sets release)
- 10 May 2025 (23:59 AOE): End of the evaluation cycle (run submission)
- 30 May 2025: Deadline for the submission of working notes [CEUR-WS]
- 30 May – 27 June 2025: Review process of participant papers
- 9 June 2025: Submission of Condensed Lab Overviews [LNCS]
- 16 June 2025: Notification of Acceptance for Condensed Lab Overviews
[LNCS]
- 23 June 2025: Camera Ready Copy of Condensed Lab Overviews [LNCS] due
- 27 June 2025: Notification of Acceptance for Participant Papers [CEUR-WS]
- 7 July 2025: Camera Ready Copy of Participant Papers and Extended Lab
Overviews [CEUR-WS] due
- 21-25 July 2025: CEUR-WS Working Notes Preview for Checking by Authors
and Lab Organizers
- 9-12 September 2025: CLEF 2025 Conference in Madrid, Spain
Best regards,
The CLEF-2025 CheckThat! Lab Shared Task Organizers
Dear Corpora-List,
Please see below for a great postdoc position at Cornell on behalf of my
colleague.
*Opportunity: Postdoctoral Associate in Digital Humanities, Cornell
University*
Cornell University’s Critical Inquiry into Values, Imagination, and Culture
(CIVIC) initiative seek a *Postdoctoral Associate in Digital Humanities, *to
be based in the Department of Literatures in English at Cornell University
in Ithaca, New York. We especially welcome applications from scholars
engaging data-driven or computational methods in the humanities. The
postdoctoral associate will collaborate closely with faculty and librarians
as part of Cornell’s Digital Humanities CIVIC research fellows group
<https://url.usb.m.mimecastprotect.com/s/ZbsyCWWA5rT68LlqES6fRToRKqK?domain=…>
.
Learn more about the position and apply at this link
<https://url.usb.m.mimecastprotect.com/s/Q1l5CXYBgvf4ZPpKQHVhJTWBp2Z?domain=…>
by March 21, 2025. The appointment period will begin on July 1, 2025. The
pay range for this position is $61,008 - $87,000. Actual salary offers in
the College of Arts and Sciences will be based on education, experience,
discipline, and relevant skills.
*Questions can be directed to Lindsay Thomas (lthomas(a)cornell.edu
<lthomas(a)cornell.edu>), Associate Professor of Literatures in English. *
*Iliana Burgos* (she/her)
Emerging Data Practices Librarian
Digital Scholarship Services
<https://url.usb.m.mimecastprotect.com/s/MH0MCYVDjwfDXWGl7I9iLTx8K0M?domain=…>
Olin Library | Cornell University
Ithaca, New York 14853
--
Dr Heather Froehlich
w // http://hfroehli.ch
t // @heatherfro
*this email address is for mailing list subscriptions only and is not
monitored regularly.*
We are excited to share that the ArchEHR-QA development set is now available via PhysioNet:
* https://doi.org/10.13026/zzax-sy62
Shared Task Website:
* https://archehr-qa.github.io/
Introduction to ArchEHR-QA 2025
Responding to patients’ medical inbox messages through patient portals is increasingly a contributor to clinician burden. To this end, automatically generating answers to questions from patients considering their medical records is important. The overarching goal of the ArchEHR-QA 2025 shared task is to develop automated responses to patients' questions by generating answers that are grounded in key clinical evidence from their electronic health records (EHRs).
ArchEHR-QA Dataset
The proposed dataset comprises hand-curated, realistic patient questions (reflective of patient portal messages), relevant focus areas identified within these questions (as determined by a clinician), corresponding clinician-rewritten versions (crafted to aid in formulating responses), and note excerpts providing essential clinical context.
For more information and examples, please visit the shared task website at https://archehr-qa.github.io/.
Important Dates
(Tentative)
* Release of the public and hidden test datasets: March 25 (Tuesday), 2025
* Submission of system responses: April 25 (Friday), 2025
* Submission of shared task papers (optional): May 2 (Friday), 2025
* Notification of acceptance: May 10 (Saturday), 2025
* BioNLP Workshop Date: July 31 (Thursday) OR August 1 (Friday), 2025
We are also looking for people to join the program committee, where the responsibilities will include reviewing papers. If you are interested, please send an email to sarvesh.soni(a)nih.gov<mailto:sarvesh.soni@nih.gov>.
Website: https://archehr-qa.github.io/
Google Group: https://groups.google.com/g/archehr-qa
Email: sarvesh.soni(a)nih.gov<mailto:sarvesh.soni@nih.gov>
Looking forward to your participation,
Sarvesh Soni, National Library of Medicine, US
Dina Demner-Fushman, National Library of Medicine, US
Dear Colleagues,
I would like to inform you that the submission deadline for the Advances in Retrieval-Augmented Generation (RAG) for Specialized Domains Workshop, taking place in Singapore, has been extended. The new final deadline is March 7, 2025.
This workshop is part of the International Conference on Computational Science (ICCS 2025), which will be held from July 7 to July 9, 2025, in Singapore.
Retrieval-Augmented Generation (RAG) has emerged as a critical and rapidly evolving theme in artificial intelligence (AI) and natural language processing (NLP). By integrating traditional and modern information retrieval (IR) techniques with the generative capabilities of large language models (LLMs), RAG systems address key limitations related to knowledge updates and domain specificity. This approach is particularly valuable in high-stakes fields such as law, biology, physics, and medicine, where accuracy and reliability are paramount.
This workshop aims to bring together researchers and practitioners working on RAG systems to discuss novel approaches, share insights, and explore emerging challenges in efficiency, scalability, domain adaptation, retrieval optimization, and evaluation. Topics of interest include, but are not limited to:
* Innovations in RAG architectures and retrieval models
* Techniques for improving efficiency, scalability, and storage
* Domain-specific applications in legal, biomedical, and scientific fields
* Fusion methods and retrieval optimization strategies
* Evaluation benchmarks and robustness techniques
* Ethical considerations, cross-lingual applications, and challenges in low-resource settings
We encourage you to share this information with students and other potentially interested individuals. More details about the workshop, including submission guidelines, can be found on the official conference website:
🔗 ICCS 2025 – RAG Workshop [https://www.iccs-meeting.org/iccs2025/]
We look forward to receiving your submissions and engaging in fruitful discussions at the workshop.
Please feel free to reach out if you have any questions.
Best regards,
Best regards,
Magda Król
The next meeting of the Edge Hill Corpus Research Group will take place online (MS Teams) on Friday 7 March 2025, 3:15-4:30 pm (GMT).
Topic: LLMs
Speaker: Yannis Korkontzelos<https://research.edgehill.ac.uk/en/persons/yannis-korkontzelos> (Edge Hill University, UK)
Title: Detecting Text generated by Large Language Models: A Novel Statistical Technique to address Paraphrasing
The abstract and registration link are here: https://sites.edgehill.ac.uk/crg/next
Attendance is free. Registration closes on Thursday 6 March.
If you have problems registering, or have any questions, please send an email to: gabrielc(a)edgehill.ac.uk<mailto:gabrielc@edgehill.ac.uk>
________________________________
Edge Hill University<http://ehu.ac.uk/home/emailfooter>
Modern University of the Year, The Times and Sunday Times Good University Guide 2022<http://ehu.ac.uk/tef/emailfooter>
University of the Year, Educate North 2021/21
________________________________
This message is private and confidential. If you have received this message in error, please notify the sender and remove it from your system. Any views or opinions presented are solely those of the author and do not necessarily represent those of Edge Hill or associated companies. Edge Hill University may monitor email traffic data and also the content of email for the purposes of security and business communications during staff absence.<http://ehu.ac.uk/itspolicies/emailfooter>