Multimodal Semantic Representations (MMSR II)
Co-located with IWCS 2025 (https://iwcs2025.github.io/)
22-24 September, Düsseldorf, Germany
(workshop on 24 September)
Workshop website: https://mmsr-workshop.github.io/
Description:
The demand for more sophisticated natural human-computer and human-robot interactions is rapidly increasing as users become more accustomed to conversation-like interactions with AI and NLP systems. Such interactions require not only the robust recognition and generation of expressions through multiple modalities (language, gesture, vision, action, etc.), but also the encoding of situated meaning.
When communications become multimodal, each modality in operation provides an orthogonal angle through which to probe the computational model of the other modalities, including the behaviors and communicative capabilities afforded by each. Multimodal interactions thus require a unified framework and control language through which systems interpret inputs and behaviors and generate informative outputs. This is vital for intelligent and often embodied systems to understand the situation and context that they inhabit, whether in the real world or in a mixed-reality environment shared with humans.
Furthermore, multimodal large language models appear to offer the possibility for more dynamic and contextually rich interactions across various modalities, including facial expressions, gestures, actions, and language. We invite discussion on how representations and pipelines can potentially integrate such state-of-the-art language models.
We solicit papers on multimodal semantic representation, including but not limited to the following topics:
- Semantic frameworks for individual linguistic co-modalities (e.g. gaze, facial expression);
- Formal representation of situated conversation and embodiment, including knowledge graphs, designed to represent epistemic state;
- Design, annotation, and corpora of multimodal interaction and meaning representation;
- Challenges (including cross-lingual and cross-cultural) in multimodal representation and/or processing;
- Criteria or frameworks for evaluation of multimodal semantics;
- Challenges in aligning co-modalities in formal representation and/or NLP tasks;
- Design and implementation of neurosymbolic or fusion models for multimodal processing (with a representational component);
- Methods for probing knowledge of multimodal (language and vision) models;
- Virtual and situated agents that embody multimodal representations of common ground.
Submission Information:
Two types of submissions are solicited: long papers and short papers. Long papers should describe original research and must not exceed 8 pages, excluding references. Short papers (typically system or project descriptions, or ongoing research) must not exceed 4 pages, excluding references. Accepted papers get an extra page in the camera-ready version.
We strongly encourage students to submit to the workshop.
Papers should be formatted using the IWCS style files, available at: https://iwcs2025.github.io/call_for_papers
Submission Link: https://openreview.net/group?id=IWCS/2025/Workshop/MMSR
Please do not hesitate to reach out with any questions.
Richard Brutti, Lucia Donatelli, Nikhil Krishnaswamy, Kenneth Lai, & James Pustejovsky (MMSR II organizers)
LAST CALL FOR PAPERS
The 12th Workshop on Argument Mining @ ACL 2025
July 31, 2025
https://argmining-org.github.io/2025/
The 12th Workshop on Argument Mining will be held on July 31, 2025, in Vienna, Austria, together with ACL 2025.
The Workshop on Argument Mining provides a regular forum for presenting and discussing cutting-edge research in argument mining (a.k.a argumentation mining) for academic and industry researchers. By continuing a series of eleven successful previous workshops, this edition will welcome the submission of long and short papers, as well as extended abstracts and PhD proposals. It will also feature a number of shared tasks and a keynote talk.
IMPORTANT DATES
*** Direct paper submission deadline (OpenReview): April 17, 2025 [due to constraints in the publication of the proceedings, we will not be able to extend the deadline!]
Paper commitment from ARR: May 21, 2025
Notification of acceptance: May 28, 2025
Camera-ready papers due: June 4, 2025
Workshop: July 31, 2025
INVITED SPEAKER
Andreas Vlachos, University of Cambridge
PANEL
Topic: "Broadening the Scope of Argument Mining".
Panelists: check the website!
SHARED TASKS
1, Critical Questions Generation https://hitz-zentroa.github.io/shared-task-critical-questions-generation/
2. Multimodal Argumentative Fallacy Detection and Classification on Political Debates https://nlp-unibo.github.io/mm-argfallacy/2025/
TOPICS OF INTEREST
- Identification, Assessment, and Analysis of Arguments
- Identification of argument components (e.g., premises and conclusions)
- Structure analysis of arguments within and across documents
- Relation Identification between arguments and counterarguments (e.g., support and attack)
- Creation and evaluation of argument annotation schemes, relationships to linguistic and discourse annotations, (semi-) automatic argument annotation methods and tools, and creation of argumentation corpora
- Assessment of arguments for various properties (e.g., stance, clarity)
- Generation of Arguments, Multi-modal and Multi-lingual Argument Mining
- Automatic generation of arguments and their components
- Consideration of discourse goals in argument generation
- Argument mining and generation from multi-modal/multi-lingual data
- Mining and Analysis of different Genres and Domains of Arguments
- Argument mining in specific genres and domains (e.g., education, law, scientific writing)
- Analysis of unique styles within genres (e.g., short informal text, highly structured writing)
- Modelling, assessing, and critically reflecting on the argumentative reasoning capabilities of Large Language Models
- Knowledge Integration, Information Retrieval, and Real-world Applications
- Integration of commonsense and domain knowledge into argumentation models
- Combination of information retrieval methods with argument mining
- Real-world applications, including argument web search, opinion analysis and summarization, and misinformation detection
- Interdisciplinary interfaces of Argument Mining
- Mining political discourse, by experts and laypeople
- Argument mining support for deliberation
- Persuasion and convincingess from a psychological perspective
- Subjectivity, disagreements and perspectivism in argumentation
- Ethical Considerations and Future Reflections
- Reflection on the ethical aspects and societal impact of argument-mining methods
- Reflection on the future of argument mining in light of the fast advancement of large language models (LLMs)
SUBMISSIONS
The organizing committee welcomes submitting long papers, short papers, extended abstracts and PhD proposals. Accepted papers will be presented via oral or poster presentations. Long and short papers will be included in the ACL proceedings as workshop papers. Extended abstracts and PhD proposals will be non-archival.
- Long paper submissions must describe substantial, original, completed, and unpublished work. Wherever appropriate, concrete evaluation and analysis should be included. Long papers must be at most eight pages, including title, text, figures, and tables. An unlimited number of pages is allowed for references. Two additional pages are allowed for appendices, and an extra page is allowed in the final version to address reviewers’ comments.
- Short paper submissions must describe original and unpublished work. Please note that a short paper is not a shortened long paper. Instead, short papers should have a point that can be made in a few pages, such as a small, focused contribution, a negative result, or an interesting application nugget. Short papers must be at most four pages, including title, text, figures, and tables. An unlimited number of pages is allowed for references. One additional page is allowed for the appendix, and an extra page is allowed in the final version to address reviewers’ comments.
- Extended abstracts must be at most two pages including references describing ongoing projects, interesting pieces of data or results, or already published work.
- PhD proposals must describe PhD projects being or to be developed within the broad field of natural language argumentation processing. PhD proposals must be at most four pages including the main research directions or challenges being investigated, the specific contributions made (on the research direction), and the directions for the remaining work. A dedicated poster session will be hosted, allowing students to get feedback and discuss their work with a broad and multidisciplinary community.
Multiple Submissions
ArgMining 2025 will not consider any paper under review in a journal or another conference or workshop at the time of submission, and submitted papers must not be submitted elsewhere during the review period.
ArgMining 2025 will also accept submissions of ARR-reviewed papers, provided that the ARR reviews and meta-reviews are available by the ARR commitment deadline (May 21). However, ArgMining 2025 will not accept direct submissions that are actively under review in ARR, or that overlap significantly (>25%) with such submissions.
Submission Format
All long, short, and non-archival submissions must follow the two-column ACL 2025 format. Authors are expected to use the LaTeX or Microsoft Word style template (https://github.com/acl-org/acl-style-files). Submissions must conform to the official ACL style guidelines contained in these templates. Submissions must be electronic and in PDF format.
Submission Link and Deadline For Direct Submissions
Authors have to fill in the submission form in the OpenReview system and upload a PDF of their paper before April 17, 2025, 11:59 pm UTC-12h (anywhere on earth).
https://openreview.net/group?id=aclweb.org/ACL/2025/Workshop/ArgMining
For the ARR commitment process, we will provide details in our second call for papers.
Double Blind Review
ArgMining 2025 will follow the ACL policies for preserving the integrity of double-blind review for long and short paper submissions. Papers must not include authors’ names and affiliations. Furthermore, self-references or links (such as GitHub) that reveal the author’s identity, e.g., “We previously showed (Smith, 1991) …” must be avoided. Instead, use citations such as “Smith previously showed (Smith, 1991) …” Papers that do not conform to these requirements will be rejected without review. Papers should not refer, for further detail, to documents that are not available to the reviewers. For example, do not omit or redact important citation information to preserve anonymity. Instead, use the third person or named reference to this work, as described above (“Smith showed” rather than “we showed”). Papers may be accompanied by a resource (software and/or data) described in the paper, but these resources should also be anonymized.
Unlike long and short papers, demo descriptions will not be anonymous. Demo descriptions should include the authors’ names and affiliations, and self-references are allowed.
ANONYMITY PERIOD (taken from the ACL call for papers in verbatim for the most part)
We follow the ACL Policies for Review and Citation. Submissions must be anonymized, but there is no anonymity period or limitation on posting or discussing non-anonymous preprints while the work is under peer review.
BEST PAPER AWARD
In order to recognize significant advancements in argument mining science and technology, ArgMining 2025 will include the Best Paper award. All papers at the workshop are eligible for the best paper award, and a selection committee consisting of prominent researchers in the fields of interest will select the award recipients.
SPONSORS
CITEC, IBM, GSCL
ArgMining 2025 ORGANIZING COMMITTEE
Elena Chistova, Laboratory for Analysis and Controllable Text Generation Technologies, RAS
Philipp Cimiano, Bielefeld University
Shohreh Haddadan, Machine learning department, Moffitt Cancer Center
Gabriella Lapesa, Leibniz Institute for the Social Sciences (GESIS), Cologne, and Heinrich-Heine University Düsseldorf
Ramon Ruiz-Dolz, Centre for Argument Technology (ARG-tech), University of Dundee
24th International Semantic Web Conference (ISWC 2025)
Nara, Japan
November 2-6, 2025
Follow us:
Twitter/X: @iswc_conf #iswc_conf ( https://twitter.com/iswc_conf )
LinkedIn: https://www.linkedin.com/company/iswc/
Mastodon Social: https://mastodon.social/@iswc_conf
Bsky: https://bsky.app/profile/iswc-conf.bsky.social
ISWC 2025 features multiple tracks - including Research, Resources, and
In-Use, among others, Hence, authors are kindly asked to check out the
calls of each track to choose the one that best fits their contribution.
=========================
Call for Research Track Papers
The research track of ISWC 2025 solicits novel and significant research
contributions addressing theoretical, analytical, and empirical aspects of
the Semantic Web. We welcome work describing original and replicable
research showing evidence of significant contribution to the Semantic Web.
More details: https://iswc2025.semanticweb.org/#/calls/research
Research Track Chairs:
Contact: iswc2025-research(a)easychair.org
Daniel Garijo, Universidad Politécnica de Madrid, Spain
Sabrina Kirrane, Vienna University of Economics and Business, Austria
========================
Call for Resource Track Papers
The ISWC 2025 Resources Track aims to promote the sharing of resources that
support, enable, or utilize semantic web research. We welcome descriptions
of resources that leverage knowledge representation based on Semantic Web
standards or other graph data models to improve the acquisition,
processing, and sharing of data on the web.
More details: https://iswc2025.semanticweb.org/#/calls/resource
Resource Track Chairs:
Contact: iswc2025-resource(a)easychair.org
Cogan Shimizu, Wright State University, US
Angelo Salatino, KMi,The Open University, UK
=======================
Call for In-use Track Papers
The In-Use track seeks submissions describing applied research as well as
software tools, systems, or architectures that benefit from the use of
Semantic Web and Knowledge Graph technologies (including, but not limited
to, technologies based on the Semantic Web standards). Importantly,
submitted papers should provide convincing evidence of the use of the
proposed application or tool by the target user group, preferably outside
the group that conducted the development and, more broadly, outside the
Semantic Web and Knowledge Graph research communities.
More details: https://iswc2025.semanticweb.org/#/calls/in-use
In-Use Track Chairs:
Contact: iswc2025-in-use(a)easychair.org
Maribel Acosta
Technical University of Munich, Germany
Andrea Giovanni Nuzzolese
CNR - Institute of Cognitive Sciences and Technologies, Italy
===========================================
Important Dates:
Abstract submission due May 6th, 2025
Full paper submission due May 13th, 2025
Rebuttal June 17th - 20th, 2025
Notifications July 17th, 2025
Camera ready papers due July 31st, 2025
Website: https://iswc2025.semanticweb.org/
Would you like to know about the host city, Nara? Check out this blog:
https://iswc2025.semanticweb.org/#/blogs/host
--
*Dr.-Ing. **Genet Asefa Gesese*
Head of Machine Learning Department (Abteilungsleitung Maschinelles Lernen)
FIZ Karlsruhe – Leibniz Institute for Information Infrastructure, Germany
( *https://www.fiz-karlsruhe.de/en/bereiche/lebenslauf-und-publikationen-dr-ing-genet-asefa-gesese
<https://www.fiz-karlsruhe.de/en/bereiche/lebenslauf-und-publikationen-dr-in…>*
)
AND
Karlsruhe Institute of Technology (KIT)
*( https://ise.aifb.kit.edu/21_89.php
<https://ise.aifb.kit.edu/21_89.php> )*
LLMs with Limited Resources for Slavic Languages @ WMT2025 @ EMNLP2025
Website: https://www2.statmt.org/wmt25/limited-resources-slavic-llm.html
Join our Google Group! https://groups.google.com/g/slavic-llms-mt2025
HuggingFace Collection:
https://huggingface.co/collections/tum-nlp/llms-for-slavic-languages-67f399…
This shared task explores how LLMs perform on MT and QA jointly, aiming to
investigate task synergy under limited data and compute resources.
Ukrainian (uk) is a mid-resource language (~40M L1 speakers), while Upper
Sorbian (hsb) and Lower Sorbian (dsb) are minority West Slavic languages
(30k and 7k L1 speakers, respectively) spoken in Germany.
Data Overview
Ukrainian
-
MT directions: en→uk, cs→uk
-
QA: Derived from high-school graduation exams (ZNO)
-
Training sets examples:
-
MT: WMT24++ <https://huggingface.co/datasets/google/wmt24pp>, SMOL
<https://huggingface.co/datasets/google/smol>
-
QA: UNLP2024 <https://huggingface.co/datasets/osyvokon/zno>, ZNO-EVAL
<https://github.com/NLPForUA/ZNO>, Cohere INCLUDE
<https://huggingface.co/datasets/CohereForAI/include-base-44>
Upper Sorbian & Lower Sorbian (two separate tracks)
-
MT directions: de→hsb, de→dsb
-
QA: Multiple-choice questions based on actual CEFR-based language
certification exams (A1–C1 levels)
-
We will prepare the following resources:
-
Parallel & monolingual corpora via Witaj-Sprachzentrum and Leipzig
Corpora Collection;
-
Previous WMT low-resource tracks (2020–2022);
-
QA task adapted from language certifications of different levels.
Submission Guidelines
-
Models must produce both MT & QA outputs for the chosen language(s);
-
Submissions are language-specific; submit to one or multiple language
tracks;
-
Participants can only use one of the following base models that are
restricted
to 3B parameters maximum:
-
Qwen2.5-3B-Instruct <https://huggingface.co/Qwen/Qwen2.5-3B-Instruct>
-
Qwen2.5-1.5B <https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct>
-
Qwen2.5-0.5B <https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct>
-
Quantized or Unsloth variants from HuggingFace collections
Key Dates (AoE)
-
Registration opens now!: Join our Google group
https://groups.google.com/g/slavic-llms-mt2025
-
Training/dev data release: Late April
-
Test data release: Late June
-
Submission deadline: Early July
-
System description deadline: Late July
-
Final workshop: 5-9th November @ EMNLP 2025 in Suzhou, China!
Organisers
TUM Heilbronn:
Daryna Dementieva
Marion di Marco
Lukas Edman
Alexander Fraser
Kathy Hämmerl
Shu Okabe
Witaj-Sprachzentrum:
Beate Brězan,
Anita Hendrichowa
Marko Měškank
Tomaš Šołta
Acknowledgements
We thank the UNLP 2024 Shared Task team (Roman Kyslyi, Mariana Romanyshyn,
Oleksiy Syvokon) for kindly sharing Ukrainian QA resources.
Best regards,
Daryna Dementieva
On behalf of TUM Heilbronn Workshop Organizers
Join the Computational Linguistics group at Uppsala University!
We are hiring an Assistant Professor (Tenure Track) in Computational Linguistics
Application deadline: April 29, 2025
Find out more and apply: https://uu.varbi.com/en/what:job/jobID:809532/
******************************
Description
The department is seeking an associate senior lecturer in computational linguistics. Computational linguistics, or language technology, is an interdisciplinary field dealing with computational models of natural language. Traditionally, research has been driven both by the theoretical goal of understanding human language and by practical applications such as machine translation, information retrieval, and human-computer dialogue. The area has in recent years increasingly been influenced by the emergence of large language models and generative AI. The computational linguistics group at Uppsala University conducts research within a broad field with two focus areas: digital philology and multilingual language technology. Education is primarily offered at the second-cycle level, within the international master’s program in language technology, and at the third-cycle level within the PhD program in computational linguistics.
Subject area
Computational linguistics
Duties
The aim of the position is to give the teacher the opportunity to develop independence as a researcher and enhance their academic as well as their pedagogical qualifications in order to meet the criteria for employment as senior lecturer. The duties comprise research within the relevant area of specialization (50% of full-time) as well as teaching and administration (no more than 50% of full-time). Teaching includes supervision, course responsibility, course administration, and course development.
Teaching primarily consists of courses in language technology, including master’s thesis supervision, within the international master’s program in language technology, which is taught in English. Teaching may include courses with the following specializations: general language technology, language technology applications such as information retrieval and machine translation, machine learning in language technology, as well as programming and mathematics. It may also include supervision of PhD students.
As an associate senior lecturer, the candidate is expected to actively apply for external research funding, to stay aware of developments within their field of specialization, and to take part in the faculty’s and department’s development of research and teaching.
Eligibility requirements
Applicants are eligible for employment as associate senior lecturer if they hold a doctoral degree or have the equivalent academic competence, have demonstrated pedagogical competence, and have the personal characteristics required in order to carry out the duties that the position involves well. Precedence will be given to applicants who completed their PhD projects or reached the equivalent competence no more than five years before the deadline for applications. In order to meet the requirement of pedagogical competence, the applicant should have completed at least five weeks of relevant training in tertiary-level teaching or acquired the equivalent competence in other ways. In special circumstances, the training can be carried out during the first two years of employment.
Ability to teach in English is required. The candidate is furthermore expected to be able to use Swedish as a working language within two years.
The candidate should have the personal characteristics required to carry out the duties of the position well. Such characteristics include flexibility, the capacity to work well with others, and a sense of responsibility.
It is a requirement that the applicant’s academic, pedagogical, and professional competence is relevant to the subject area and duties that the position involves.
När du har kontakt med oss på Uppsala universitet med e-post så innebär det att vi behandlar dina personuppgifter. För att läsa mer om hur vi gör det kan du läsa här: http://www.uu.se/om-uu/dataskydd-personuppgifter/
E-mailing Uppsala University means that we will process your personal data. For more information on how this is performed, please read here: http://www.uu.se/en/about-uu/data-protection-policy
Dear Colleagues,
Cornelia Sindermann (computational psychologist, University of
Stuttgart) and I (natural language processing person, University of
Bamberg) are in the phase of preparing a project proposal, for which we
could use your help. Our goal is to develop a novel annotation and data
acquisition platform that is flexible enough to be used across research
fields. In preparation for that, we would like to collect some
information on the use of data acquisition platforms (survey tools,
annotation tools) that are used in our various research communities.
You could help us with that by filling the form at
https://tinyurl.com/annotation2025fis (redirects to a qualtrics survey).
If you have questions about this, please do not hesitate to contact us
at roman.klinger(a)uni-bamberg.de and
cornelia.sindermann(a)iris.uni-stuttgart.de!
Thank you in advance and best regards,
Roman
The 2nd LLMs4Subjects Shared Task: LLM-based Subject Tagging for the TIB Technical Library's Digital Catalog
Theme: The Development of Energy- and Compute-Efficient LLM Systems
Organized as part of the German Evaluation (GermEval 2025) Shared Task Series
10. - 12. September, 2025
Hildesheim, Germany
(co-located with KONVENS 2025 - Conference on Natural Language Processing)
2nd LLMs4Subjects Shared Task: https://sites.google.com/view/llms4subjects-germeval/
KONVENS 2025: https://konvens-2025.hs-hannover.de/about/
Task Overview
LLMs4Subjects challenges the research community to develop cutting-edge LLM-based solutions for subject tagging of technical records from Leibniz University's Technical Library (TIBKAT). Participants are tasked with leveraging large language models (LLMs) to tag technical records using the GND taxonomy. The task involves bilingual language modeling, as systems must process technical documents in both German and English. Successful solutions may be integrated into the operational workflows of TIB, the Leibniz Information Centre for Science and Technology.
With the rapid advancements in LLMs, the focus is shifting toward making these models more energy- and compute-efficient while maintaining high performance. Recent innovations, such as the DeepSeek series, have demonstrated how techniques like mixture-of-experts (MoE) and model distillation can significantly reduce computational costs without sacrificing effectiveness.
The 2nd LLMs4Subjects shared task highlights the importance of efficiency in LLMs, encouraging participants to explore strategies that enhance model performance while optimizing for energy consumption and inference speed. We welcome approaches (but not limited to) that leverage model compression, quantization, efficient fine-tuning, and adaptive computation techniques to push the boundaries of sustainable AI development.
Subtasks
The 2nd LLMs4Subjects shared task organizes the following two subtasks:
Subtask 1 - Multi-Domain Classification of Library Records
Subtask 2 - Large-scale Multilabel Subject Indexing of Library Records
Important Dates
* Release of training data: March 8, 2025
* Release of testing data: May 23, 2025
* Deadline for system submissions: June 2, 2025
* Evaluation end: June 27, 2025
* Paper submission deadline: July 7, 2025
* Notification of acceptance: June 28, 2025
* Camera-ready paper due: August 15, 2025
* Workshop/KONVENS: September 10 - 12, 2025 (TBA)
A PhD position at the University of Groningen, the Netherlands:
- Fully-funded 4-year position
- Research focus: using computational models (including small probabilistic
models and neural network language models) to study the acquisition of
modal verbs
- Programming skills and background in linguistics / language acquisition
required
- Supervisors: Annemarie van Dooren, Yevgen Matusevych, Arianna Bisazza
- Application deadline: 24 April 2025
- More details and application:
https://www.rug.nl/about-ug/work-with-us/job-opportunities/?details=00347-0…
--
Yevgen Matusevych
Assistant Professor
Computational Linguistics, University of Groningen
https://yevgen.web.rug.nl
ClimateCheck: Shared Task on Scientific Fact-Checking of Social Media Claims on Climate Change
Hosted as part of the SDP 2025 Workshop at ACL 2025
31 July 2025
Vienna, Austria
ClimateCheck Shared Task: https://sdproc.org/2025/climatecheck.html
Competition on Codabench: https://www.codabench.org/competitions/6639/
Shared Task Overview
Social media facilitates discussions on critical issues such as climate change, but it also contributes to the rapid dissemination of misinformation, which complicates efforts to maintain an informed public and create evidence-based policies. In this shared task, we emphasise the need to link public discourse to peer-reviewed scholarly articles by gathering claims from social media about climate change (both real-life and automatically generated ones) as well as a corpus of about 400,000 abstracts of publications from climate science research. The participants will be asked to retrieve relevant abstracts for each claim (subtask I) and classify the relation between the claim and abstract as ‘supports’, ‘refutes’, or ‘not enough information’ (subtask II). Participants are allowed to take part either in subtask I only, or in both subtasks.
Subtask I: Abstracts Retrieval
Task: given a claim from social media about climate change and a corpus of abstracts, retrieve the top 10 most relevant abstracts.
Evaluation: Recall@K (K=2, 5, 10) and B-Pref based on annotated gold data; additional unjudged documents will be reviewed by annotators.
Subtask II: Claim Verification
Task: given the claim-abstract pairs received from the previous subtask, classify their relation as ‘supports’, ‘refutes’, or ‘not enough information’.
Evaluation: Precision, Recall, and F1 weighted scores based on annotated documents.
Important dates
Release of training data: April 1, 2025
Release of testing data: April 15, 2025
Deadline for system submissions: May 16, 2025
Paper submission deadline: May 23, 2025
Notification of acceptance: June 13, 2025
Camera-ready paper due: June 20, 2025
Workshop: July 31, 2025
We encourage and invite participation from junior researchers and students from diverse backgrounds. Participants are also encouraged to submit a paper describing their systems to the SDP 2025 workshop.
Organisers
Raia Abu Ahmad (DFKI, Berlin, Germany)
Aida Usmanova (Leuphana University, Lüneburg, Germany)
Georg Rehm (DFKI, Berlin, Germany)
SciVQA: Scientific Visual Question Answering Shared Task
Hosted as part of the SDP 2025 Workshop at ACL 2025
31 July 2025
Vienna, Austria
SciVQA Shared Task: https://sdproc.org/2025/scivqa.html
Competition on Codabench: https://www.codabench.org/competitions/5904
Shared Task Overview
Scholarly articles convey valuable information not only through unstructured text but also via (semi-)structured figures such as charts and diagrams. Automatically interpreting the semantics of knowledge encoded in these figures can be beneficial for downstream tasks such as question answering (QA). In SciVQA, challenge participants will develop multimodal QA systems using a dataset of 3000 images of scientific figures from ACL Anthology and arXiv papers. Each figure is annotated with seven QA pairs (21000 in total) and includes metadata such as caption, ID, type (e.g., compound, line graph, bar chart, scatter plot), and QA pair type. This shared task specifically focuses on closed-ended visual (i.e., addressing visual attributes such as colour, shape, size, height, etc.) and non-visual (not addressing figure visual attributes) questions.
Evaluation
Systems will be evaluated using BERTscore, ROUGE-L, and ROUGE-1 metrics. Automated evaluations of submitted systems will be performed through Codabench.
Important Dates
Release of training data: April 1, 2025
Release of testing data: April 15, 2025
Deadline for system submissions: May 16, 2025
Paper submission deadline: May 23, 2025
Notification of acceptance: June 13, 2025
Camera-ready paper due: June 20, 2025
Workshop: July 31, 2025
Participants are invited to submit papers on their systems. Successful submissions will be published in the proceedings of the SDP 2025 workshop.
Organizers
Ekaterina Borisova (DFKI, Berlin, Germany)
Georg Rehm (DFKI, Berlin, Germany)