- Corpora - ELRA lists

Call for Papers - NLP4Sustain Workshop 2025
by Jakob, Charlott Frederike 03 Apr '25

03 Apr '25

Call for Papers: NLP for Sustainability (NLP4Sustain) Workshop 2025 With this workshop, we want to provide an interdisciplinary forum for discussing research, progress, and challenges in the context of NLP and sustainability. We invite submissions about NLP-based analysis of sustainability-related texts, sustainable NLP models and evaluation practices in general. Authors and other participants will engage with each other in a poster session and there will be an interdisciplinary invited talk with an ensuing discussion. We invite technical, survey, and position papers, as long (8 page) or short (4 page) papers (plus references and appendices) written in English and formatted according to the ACL stylesheet. Relevant Topics • analyses and classifications of sustainability-related texts (such as company reports, advertisements, legal texts, …) • generation of explanations, critiques, summaries, … of sustainability-related texts • multimodal models related to sustainability, such as language-vision or climate-impact models • question answering in the sustainability/climate context • sustainable (e.g. small, efficient) NLP models for other applications/domains • sentiment analysis in the sustainability/climate context • media and social media analysis with NLP methods in the sustainability/climate context Important Dates * Tue, 10.06.2025: Paper submission deadline * Fri, 25.07.2025: Acceptance notification * Fri, 15.08.2025: Camera-ready due * Wed, 10.09.2025: Workshop @KONVENS in Hildesheim, Germany (at least one author must be present) For details, visit the website: https://nlp4sustain.github.io/ If you have any questions, please contact: jakob.prange(a)uni-a.de<mailto:jakob.prange@uni-a.de> and/or c.jakob(a)tu-berlin.de<mailto:c.jakob@tu-berlin.de> —————————————————— Charlott Jakob (she/her) Academic Researcher Technische Universität Berlin Quality and Usability Lab Institute of Software Engineering and Theoretical Computer Science Faculty IV Electrical Engineering and Computer Science Technische Universität Berlin Sekr. MAR 6-7, Marchstr. 23, 10587 Berlin, Germany

1 0

Post-doc position in computational linguistics at Univ. Lorraine (France)
by Mathieu Constant 02 Apr '25

02 Apr '25

The Research unit ATILF (Computer Processing and Analysis of the French Language) offers a postdoctoral position in computational linguistics. Topic: multiword expressions in large language models Location: ATILF, Nancy, France (Univ. Lorraine and CNRS) Starting date: September 2025 Duration: 12 months (possibility to extend the duration for one more year) Supervisors: Mathieu Constant (Univ. Lorraine, France) and Patrick Watrin (UC Louvain, Belgium) Salary: depends on experience and salary grids (from 3000 to 4200 euros before tax) Application deadline: April 22, 2025 Subject. The term « multiword expression » (MWE) refers to a combination of multiple lexical items that displays irregular composition possibly on different linguistic levels (morphology, syntax, semantics, …). They include a large variety of phenomena such as idioms (run around in circles), support verb constructions (take a walk), nominal compounds (dry run), complex function units (in spite of). They have been the subject of extensive research work in the NLP community over the last 50 years. The goal of this post-doc position is to investigate to what extent large language models encode multiword expressions and their various levels of idiomaticity and fixedness. In particular, the hired post-doc will develop methods to extract linguistic features about multiword expressions in context from large language models. The methods will be experimented on French and will be used to provide aids for French L2 learners when reading MWE occurrences in authentic texts. Context. The position is part of the STAR-FLE project (STrategic Adaptations for better Reading and Text Comprehension in FFL, https://www.starfle.fr/en, 2024-2027) funded by the French National Research Agency (ANR). The project aims to propose innovative digital solutions in the area of Natural Language Processing (NLP) that may improve text comprehension for French L2 learners and assist teachers in managing multiple levels of learners. In particular, it will propose context-based aids for understanding lexical issues as well as MWEs found in authentic texts. The hired researcher will be fully integrated in the project team. Requirements. Applicants should hold a PhD thesis n natural language processing, in computational linguistics, in computer science, or in applied mathematics, . The hired post-doc researcher should have the following skills: * expertise in deep learning for NLP and notably large language models * excellent programming skills * Good linguistic skills * good knowledge of French would be a plus * team spirit Application. The applicants should submit a coverage letter, a CV including their publications, a list of references for recommandation, on the following official web site: https://emploi.cnrs.fr/Offres/CDD/UMR7118-SABMAR-022/Default.aspx?lang=EN. The applications should be sent not later than April 22, 2025. For more information, contact Mathieu Constant (Mathieu.Constant(a)univ-lorraine.fr)

1 0

Call for Papers – The Ninth Workshop on Search-Oriented Conversational AI (SCAI’25)
by Svitlana Vakulenko 02 Apr '25

02 Apr '25

*************************************************************************************************** Call for Papers – The Ninth Workshop on Search-Oriented Conversational AI (SCAI’25) **************************************************************************************************** Collocated with IJCAI 2025 (Montreal, Canada) on August 16-18, 2025 https://scai.info Important dates • Submission deadline: May 9, 2025 (AoE) • Author notification: June 6, 2025 Description SCAI is an established venue that provides a discussion platform on Conversational AI for intelligent information access, bringing together researchers and practitioners across natural language processing, information retrieval, machine learning and human-computer interaction fields. The goal of Search-oriented Conversational AI is to design systems that allow for more convenient information access by means of a conversational user interface. Further development of Conversational Search systems requires closer integration and better information exchange between the diverse communities that are engaged in the areas of Dialogue Systems, Information Retrieval and Conversational User Interfaces. SCAI aims to bring together researchers interested in informing the design of a new generation of systems for conversational information access. SCAI 2025 offers an opportunity to present ongoing or recently completed research work in an interdisciplinary meeting specifically focused on search-based conversational AI. The workshop program will include presentation of invited papers, oral presentations, and a poster session. We encourage submission for presentation of original as well as already published papers that are relevant to the following topics: * Design: theoretical understanding and empirical analysis of information-seeking dialogues, properties of a mixed-initiative interaction, modeling conversational contexts, relation to concurrent research in dialogue systems and conversational user interfaces * Implementation: prototypes of conversational search systems, demonstrations and proof-of-concept implementations, as well as lessons learned from deployed systems. * Evaluation: evaluation of conversational search systems including user studies, question answering and summarization metrics, Wizard-of-Oz experiments, user simulation for dialogues, measuring learning outcomes of an information-seeking dialogue, dialogue analysis, faithfulness and provenance of the dialogue responses. * Applications: information-seeking dialogues for personalised education, healthcare, entertainment and knowledge-intensive work. Submission Instructions: * Papers should be up to 8 pages (long) or up to 4 pages (short), excluding references and supplementary materials. * Submissions must be anonymized for double-blind review. * The authors are encouraged (but not required to) use the IJCAI template for their submissions: https://www.ijcai.org/authors_kit. * Our venue is non-archival and the authors retain the right to submit the same work to another venue for official publication, or submit previously published work. * The accepted papers will be presented either orally or as posters. The decision about the presentation format will be based on reviewers’ recommendation. * Submit your work using the on-line form: https://chairingtool.com/conferences/ijcai25-w21/main-track?role=author

1 0

Call for Papers: LxGr2025 – Extended Deadline
by Costas Gabrielatos 02 Apr '25

02 Apr '25

10th Symposium on Corpus Approaches to Lexicogrammar (LxGr2025) CALL FOR PAPERS Extended deadline for abstract submission: 20 April 2025 The symposium will take place online on Friday 11 and Saturday 12 July 2025. LxGr primarily welcomes papers reporting on corpus-based research on any aspect of the interaction of lexis and grammar -- particularly studies that interrogate the system lexicogrammatically to get lexicogrammatical answers. However, position papers discussing theoretical or methodological issues, as well as descriptions or demonstrations of tools or resources are also welcome, as long as they are relevant to both lexicogrammar and corpus linguistics. The theme of LxGr2025 is: Conceptions of Lexicogrammar: How can corpus linguistics shed light on its nature? If you would like to present, send an abstract of 500 words (excluding references) to lxgr(a)edgehill.ac.uk<mailto:lxgr@edgehill.ac.uk>. • Abstracts for research papers should specify the research focus (research questions or hypotheses), the corpus, the methodology (techniques, metrics), the theoretical orientation, and the main findings. • Abstracts for position papers should specify the theoretical orientation and the potential contribution to both lexicogrammar and corpus linguistics. • Abstracts for tools or resources should provide a clear description of the main functions, and specify the potential contribution to both lexicogrammar and corpus linguistics. Full papers will be allocated 35 minutes (including 10 minutes for discussion). Work-in-progress reports will be allocated 20 minutes (including 5 minutes for discussion). There will be no parallel sessions. Participation is free. For details, visit the LxGr website: https://sites.edgehill.ac.uk/lxgr If you have any questions, please contact lxgr(a)edgehill.ac.uk<mailto:lxgr@edgehill.ac.uk>. ________________________________ Edge Hill University<http://ehu.ac.uk/home/emailfooter> Modern University of the Year, The Times and Sunday Times Good University Guide 2022<http://ehu.ac.uk/tef/emailfooter> University of the Year, Educate North 2021/21 ________________________________ This message is private and confidential. If you have received this message in error, please notify the sender and remove it from your system. Any views or opinions presented are solely those of the author and do not necessarily represent those of Edge Hill or associated companies. Edge Hill University may monitor email traffic data and also the content of email for the purposes of security and business communications during staff absence.<http://ehu.ac.uk/itspolicies/emailfooter>

1 0

[Cfp] 3nd Call for Research & Innovation Papers at SEMANTiCS 2025
by Kossi Amouzouvi 02 Apr '25

02 Apr '25

Third Call for Research & Innovation Papers SEMANTiCS 2025 EU 21st International Conference on Semantic Systems Vienna, Austria September 3 - 5, 2025 Important Dates: - *Abstract Submission Deadline: April 25 , 2025* - *Paper Submission Deadline: May 2, 2025* - *Notification of Acceptance: June 13, 2025* - *Camera-Ready Paper Deadline: July 04, 2025* *All deadlines are set for 11:59 pm, Anywhere On Earth time (UTC-12)* *Submissions will be through Easychair and the submission link will be provided soon.* Proceedings of SEMANTiCS 2025 EU will be made available *open access*. Research and Innovation Track The SEMANTiCS 2025 conference is excited to invite submissions for the Research and Innovation Track, welcoming groundbreaking research contributions, innovative solutions, and experimental studies relevant to the Semantic Web, Semantic Technologies, and AI-enabled semantics. We also encourage submissions at the intersections of these fields with other scientific and applied disciplines, fostering cross-disciplinary exchange and advancement. Papers should present original work that has not been published or is not under consideration elsewhere. All submissions must adhere to the submission guidelines, including reference formatting and any additional documentation as required. Each submission will undergo a rigorous review process, with at least three independent reviews, evaluating the novelty, technical quality, reproducibility, and practical relevance of the work. Topics of Interest SEMANTiCS 2025 calls for submissions of high-quality research papers across a broad spectrum of topics in Semantic Web, Semantic Technologies, and AI. We are particularly interested in new and emerging trends, especially where semantic technologies intersect with evolving fields such as large language models, explainable AI, and trustworthy data infrastructures. Topics of interest include, but are not limited to: - Web Semantics & Linked (Open) Data - Enterprise Knowledge Graphs, Graph Data Management - Machine Learning Techniques for/using Knowledge Graphs (e.g. reinforcement learning, deep learning, data mining and knowledge discovery) - Generative AI and Knowledge Graphs (e.g., Retrieval-Augmented Generation (RAG) with knowledge graph integration, generative model grounding) - Reasoning, Rules, and Policies on RAG - Knowledge Engineering and Management (e.g., knowledge acquisition, extraction, integration, and publication workflows) - Terminology, Thesaurus & Ontology Management, Ontology engineering - Web agents - Natural Language Processing for/using Knowledge Graphs (e.g. entity linking and resolution using target knowledge such as Wikidata and DBpedia, foundation models) - Crowdsourcing for/using Knowledge Graphs - Data Quality Management and Assurance - Mathematical and Logical Foundations of Knowledge-aware AI - Multimodal Knowledge Graphs (e.g., text, image, audio fusion in graph structures) - Semantic-Enhanced Data Science Pipelines and Processes - Semantics in Blockchain environments (e.g., traceability, decentralized knowledge representation) - Trust, Data Privacy, and Security with Semantic Technologies - Internet of Things (IoT), Stream Processing, and Temporal Data Management (e.g., real-time semantic processing and predictive analytics) - Conversational AI and Dialogue Systems powered by Knowledge Graphs - Provenance and Data Change Tracking (e.g., semantic versioning, data updates in distributed settings) - Semantic Interoperability (e.g., cross-domain standards, mapping frameworks, ontology alignment) - Linked Data storage, triple stores, graph databases - Robust, Scalable, and Fault-Tolerant Semantic Data Systems (e.g., distributed querying, optimization) - User Interfaces and Usability of Semantic Technologies (e.g., visualizations, intelligent user interaction) - Explainable and Interoperable AI - Decentralised and Federated Knowledge Graphs (e.g., federated querying, link traversal) Applied Semantic Technologies and AI in Real-World Scenarios, such as, but not limited to: - Biomedicine and Health (e.g., Knowledge Graphs for biomedical applications, AI-driven diagnostics, personalized health) - AI for Environmental and Climate Solutions (e.g., semantic modeling for environmental impact, biodiversity knowledge graphs) - Scientific Knowledge Graphs and Open Science (e.g., FAIR data principles, enhanced scholarly communication) - Semantic Technologies in GLAM (Galleries, Libraries, Archives, and Museums) - Knowledge Graphs and Hybrid AI for Industry 4.0/5.0 and Predictive Maintenance - Digital Humanities and Cultural Heritage Preservation - Legal Technology, AI Ethics, and Regulatory Compliance (e.g., AI and legal frameworks, semantic-enabled compliance with the EU AI Act) - Economics and Governance of Data Ecosystems (e.g., data marketplaces, semantic service interoperability, data policy) Submissions will be through Easychair. Stay tuned for the submission link. For *Submission Guidelines* and * Review and Evaluation Criteria* please head to the online call for papers: *https://2025-eu.semantics.cc/page/cfp_rev_rep* <https://2025-eu.semantics.cc/page/cfp_rev_rep>. We would highly appreciate it if you could disseminate this call within your network. *We look forward to receiving your contributions!* Research and Innovation Track Chairs Blerina Spahiu (University of Milano-Bicocca, IT) Mehdi Ali (Lamarr Institute & Fraunhofer IAIS, Germany) Kind Regards, On behalf of the organising committee. ========================= Dr. Kossi Amouzouvi ScaDS.AI Dresden/Leipzig, TU Dresden -- DISCLAIMER: The contents of this email and any attachments are confidential. They are intended for the named recipient(s) only. If you have received this email by mistake, please notify the sender immediately and you are herewith notified that the contents are legally privileged and that you do not have permission to disclose the contents to anyone, make copies thereof, retain or distribute or act upon it by any means, electronically, digitally or in print. The views expressed in this communication may be of a personal nature and not be representative of AIMS-NEI and/or any of its Centres or Initiatives.

1 0

*SEM2025 - First Call for Papers
by Kemal Kurniawan 01 Apr '25

01 Apr '25

(Apologies for cross-posting) *SEM2025: The 14th Joint Conference on Lexical and Computational Semantics, Suzhou, China. (Co-located with EMNLP) https://starsem2025.github.io/ First Call for Papers *SEM brings together researchers interested in the semantics of natural languages and its computational modelling. The conference embraces a wide range of approaches including data-driven, neural, probabilistic and symbolic; practical applications as well as theoretical contributions are welcome. The long-term goal of *SEM is to provide a forum for NLP researchers working on any aspect of natural language semantics. *SEM invites submissions related to the computational modelling of natural language semantics (understood broadly) and its application. Relevant areas include (but are not limited to) theoretical aspects of computational semantics, empirical and data-driven approaches, resources, evaluation and applications/tools. *SEM encourages authors to consider ethical aspects of their work, and to address and discuss ethical questions and implications relevant to their research. *SEM also values reproducibility and particularly welcomes submissions that adhere to the reproducibility guidelines as specified here<https://folk.idi.ntnu.no/odderik/reproducibility_guidelines.pdf>. Submission Instructions Submissions must describe unpublished work and be written in English. We solicit both long and short papers. Long papers describe original research and may consist of up to eight (8) pages of content, plus unlimited pages for references. Appendices are allowed after the references, but the paper should be self-contained and reviewers will not be required to check the appendices, if any. Final versions of long papers will be given one additional page of content (up to 9 pages) so that reviewers' comments can be taken into account. Short papers describe original focused research and may consist of up to four (4) pages, plus unlimited pages for references. Upon acceptance, short papers will be given five (5) content pages in the proceedings. Authors are encouraged to use this additional page to address reviewers comments in their final versions. Limitations and Ethics Statement sections are allowed and encouraged, but are not mandatory. These sections should be placed after the conclusion and will not count towards the overall page limit. Submissions should follow the ARR formatting requirements<https://github.com/acl-org/acl-style-files>. Submission routes and deadlines *SEM solicits both direct submissions and ACL Rolling Review (ARR) commitments. The deadline for direct submissions is May 30, 2025, and these submissions will be reviewed by the *SEM2025 program committee. ACL Rolling Review (ARR) submissions can be committed to *SEM up to August 22, 2025 (authors of ARR-reviewed papers need to include their OpenReview link with reviews in the submission form). Both types of submissions are made through OpenReview. Direct submission link: https://openreview.net/group?id=aclweb.org/StarSEM/2025/Conference<https://openreview.net/group?id=aclweb.org/StarSEM/2025/Conference> Multiple submission policy: *SEM does not prohibit the submission of work that is under consideration for another venue at the same time as the *SEM review period. However, authors of such papers will be asked to declare this at submission time. Important Dates (All deadlines are 11:59pm UTC-12h, AoE) Direct submission deadline (long & short papers): May 30, 2025 ARR-reviewed submission deadline (long & short papers): August 22, 2025 Notification of acceptance: September 5, 2025 Camera-ready deadline: September 26, 2025 Conference date: TBA (co-located with EMNLP 2025) Following ACL and ARR policies<https://www.aclweb.org/portal/content/report-acl-committee-anonymity-policy>, there is no anonymity period requirement. Kemal Kurniawan | Research Fellow | (he/him) PhD School of Computing and Information Systems | Faculty of Engineering and IT Level 4, Melbourne Connect, 700 Swanston St The University of Melbourne, Victoria 3010 Australia E: kurniawan.k(a)unimelb.edu.au<mailto:kurniawan.k@unimelb.edu.au>

1 0

Invitation to participate in ImageCLEF 2025: Multimedia Retrieval in CLEF Lab
by stanciu.cristi12＠gmail.com 01 Apr '25

01 Apr '25

[Apologies for multiple postings] ImageCLEF 2025 Multimedia Retrieval in CLEF http://www.imageclef.org/2025/ We warmly invite you to take part in this year’s ImageCLEF evaluation campaign! With seven exciting and challenging tasks—each featuring multiple sub-tasks and unique research opportunities—there’s something for everyone. You and your team can begin development immediately, as all the training data is already available. Don’t miss the chance to showcase your skills and secure a spot on our leaderboard! *** CALL FOR PARTICIPATION *** ImageCLEF 2025 is an evaluation campaign conducted as part of the CLEF (Conference and Labs of the Evaluation Forum) labs. It features multiple research tasks, inviting teams from around the world to participate. The campaign results are published in the working notes proceedings of CEUR Workshop Proceedings (CEUR-WS.org) and presented at the CLEF conference. Additionally, selected contributions from participants may be invited for publication in the following year’s Springer Lecture Notes in Computer Science (LNCS), alongside the annual lab overviews. ImageCLEF’s target communities include, but are not limited to, researchers in information retrieval (text, vision, audio, multimedia, social media, sensor data, etc.), machine learning, deep learning, data mining, natural language processing, image and video processing, and computer vision. The campaign places particular emphasis on challenges related to multi-modality, multi-linguality, and interactive search. *** 2025 TASKS *** ImageCLEFmedical Automatic Image Captioning ImageCLEFmedical Synthetic Medical Images Created via GANs ImageCLEFmedical Visual Question Answering ImageCLEFmedical Multimodal And Generative TelemedICine (MAGIC) Image Retrieval/Generation for Arguments ImageCLEFtoPicto ImageCLEF Multimodal Reasoning #ImageCLEFmedical Automatic Image Captioning (9th edition) - Training data released! https://www.imageclef.org/2025/medical/caption Interpreting and summarizing the insights gained from medical images such as radiology output is a time-consuming task that involves highly trained experts and often represents a bottleneck in clinical diagnosis pipelines.The Automatic Image Captioning task is split into 2 subtasks: Concept Detection Task, based on identifying the presence and location of relevant concepts in a large corpus of medical images and the Caption Prediction Task, where participating systems are tasked with composing coherent captions for the entirety of an image Organizers: Hendrik Damm, Johannes Rückert, Christoph M. Friedrich, Louise Bloch, Raphael Brüngel, Ahmad Idrissi-Yaghir, Benjamin Bracke (University of Applied Sciences and Arts Dortmund, Germany), Asma Ben Abacha (Microsoft, USA), Alba García Seco de Herrera (University of Essex, UK), Henning Müller (University of Applied Sciences Western Switzerland, Sierre, Switzerland), Henning Schäfer, Tabea M. G. Pakull (Institute for Transfusion Medicine, University Hospital Essen, Germany), Cynthia S. Schmidt, Obioma Pelka (Institute for Artificial Intelligence in Medicine, Germany) #ImageCLEFmedical Synthetic Medical Images Created via GANs (3rd edition) - Train & Test data released! https://www.imageclef.org/2025/medical/gan The task aims to further investigate the hypothesis that generative models generate synthetic medical images that retain "fingerprints" from the real images used during their training. These fingerprints raise important security and privacy concerns, particularly in the context of personal medical image data being used to create artificial images for various real-life applications. In the first subtask, participants will analyze synthetic biomedical images to determine whether specific real images were used in the training process of generative models. In the second subtask, participants will link each synthetic biomedical image to the specific subset of real data used during its generation. The goal is to identify the particular dataset of real images that contributed to the training of the generative model responsible for creating each synthetic image. Organizers: Alexandra Andrei, Liviu-Daniel Ștefan, Mihai Gabriel Constantin, Mihai Dogariu, Bogdan Ionescu (National University of Science and Technology POLITEHNICA Bucharest, Romania), Ahmedkhan Radzhabov, Yuri Prokopchuk (National Academy of Science of Belarus, Minsk, Belarus), Vassili Kovalev (Belarusian Academy of Sciences, Minsk, Belarus), Henning Müller (University of Applied Sciences Western Switzerland, Sierre, Switzerland) #ImageCLEFmedical Visual Question Answering (3rd edition) - Train & Test data released! https://www.imageclef.org/2025/medical/vqa This year, the challenge looks at the integration of Visual Question Answering (VQA) with synthetic gastrointestinal (GI) data, aiming to enhance diagnostic accuracy and learning algorithms. The challenge includes developing algorithms that can interpret and answer questions based on synthetic GI images, creating advanced synthetic images that mimic accurate diagnostic visuals in detail and variability, and evaluating the effectiveness of VQA techniques with both synthetic and real GI data. The 1st subtask asks participants to build algorithms that can accurately interpret and respond to questions pertaining to gastrointestinal (GI) images. This involves understanding the context and details within the images and providing precise answers that would assist in medical diagnostics, while the 2nd subtask focuses on the generation of synthetic GI images that are highly detailed and variable enough to closely resemble real medical images. Organizers: Steven A. Hicks, Sushant Gautam, Michael A. Riegler, Vajira Thambawita, Pål Halvorsen (SimulaMet, Norway) #ImageCLEFmedical Multimodal And Generative TelemedICine (MEDIQA-MAGIC) (3rd edition) - Train data is released! https://www.imageclef.org/2025/medical/mediqa The task extends on the previous year’s dataset and challenge based on multimodal dermatology response generation. Participants will be given a clinical narrative context along with accompanying images. The task is divided into two relevant sub-parts: (i) segmentation of dermatological problem regions, and (ii) providing answers to closed-ended questions (participants will be given a dermatological query, its accompanying images, as well as a closed-question with accompanying choices – the task is to select the correct answer to each question) Organizers: Asma Ben Abacha, Wen-wai Yim, Noel Codella (Microsoft), Roberto Andres Novoa (Stanford University), Josep Malvehy (Hospital Clinic of Barcelona) #Image Retrieval/Generation for Arguments (4th edition) - In collaboration with Touché! https://www.imageclef.org/2025/argument-images Given a set of arguments, the task is to return for each argument several images that help convey the argument. A suitable image could depict the argument or show a generalization or specialization. Participants can optionally add a short caption that explains the meaning of the image. Images can be either retrieved from the focused crawl or generated using an image generator. Organizers: Maximilian Heinrich, Johannes Kiesel, Benno Stein (Bauhaus-Universität Weimar), Moritz Wolter (Leipzig University), Martin Potthast (University of Kassel, hessian.AI, scads.AI) #ImageCLEFtoPicto (3rd edition) - Train & Test data released! https://www.imageclef.org/2025/topicto The goal of ToPicto is to bring together linguists, computer scientists, and translators to develop new translation methods to translate either speech or text into a corresponding sequence of pictograms. The task refers to the relationship between text and related pictograms and is composed of 2 subtasks: the Text-to-Picto task, which focuses on the automatic generation of a corresponding sequence of pictogram terms and the Speech-to-Picto task, which focuses on directly translating speech to pictogram terms. Organizers: Diandra Fabre, Cécile Macaire, Benjamin Lecouteux, Didier Schwab (Université Grenoble Alpes, LIG, France) #ImageCLEF Multimodal Reasoning (new) - Train data released! https://www.imageclef.org/2025/multimodalreasoning MultimodalReason is a new task focusing on Multilingual Visual Question Answering (VQA). The formulation of the task is the following: Given an image of a question with 3-5 possible answers, participants must identify the single correct answer.The task is split into many subtasks, each handling a different language (English, Bulgarian, Arabic, Serbian, Italian, Hungarian, Croatian, Urdu, Kazakh, Spanish, with a few more on the way). The task's goal is to assess modern LLMs' reasoning capabilities on complex inputs, presented in different languages, across various subjects. Organizers: Dimitar Dimitrov, Ivan Koychev (Sofia University "St. Kliment Ohridski", Bulgaria), Rocktim Jyoti Das, Zhuohan Xie, Preslav Nakov (Mohamed bin Zayed University of Artificial Intelligence (MBZUAI), Abu Dhabi, UAE) *** IMPORTANT DATES *** (may vary depending on the task) - Run submission deadline: May 10, 2025 - Working notes submission: May 30, 2025 - CLEF 2025 conference: September 9-12, 2025, Madrid, Spain *** REGISTRATION *** Follow the instructions here https://www.imageclef.org/2025 *** OVERALL COORDINATION *** Bogdan Ionescu, Politehnica University of Bucharest, Romania Henning Müller, HES-SO, Sierre, Switzerland Dan-Cristian Stanciu, Politehnica University of Bucharest, Romania On behalf of the organizers, Dan-Cristian Stanciu https://www.aimultimedialab.ro/

1 0

Fully-funded PhD position in language acquisition / computational linguistics
by Yevgen Matusevych 01 Apr '25

01 Apr '25

A PhD position at the University of Groningen, the Netherlands: - Fully-funded 4-year position - Research focus: using computational models (including small probabilistic models and neural network language models) to study the acquisition of modal verbs - Programming skills and background in linguistics / language acquisition required - Supervisors: Annemarie van Dooren, Yevgen Matusevych, Arianna Bisazza - Application deadline: 24 April 2025 - More details and application: https://www.rug.nl/about-ug/work-with-us/job-opportunities/?details=00347-0… -- Yevgen Matusevych Assistant Professor Computational Linguistics, University of Groningen https://yevgen.web.rug.nl

1 0

9th Workshop on Online Abuse and Harms (WOAH) @ACL - Reminder
by Agostina Calabrese 01 Apr '25

01 Apr '25

We invite paper submissions to the 9th Workshop on Online Abuse and Harms (WOAH), which will take place on August 1 at ACL 2025. Website: https://www.workshopononlineabuse.com/cfp.html Important Dates * Submission due: April 18, 2025 * ARR reviewed submission due: May 20, 2025 * Notification of acceptance: May 30, 2025 * Camera-ready papers due: June 13, 2025 * Workshop: August 1st, 2025 Overview Digital technologies have brought significant benefits to society, transforming how people connect, communicate, and interact. However, these same technologies have also enabled the widespread dissemination and amplification of abusive and harmful content, such as hate speech, harassment, and misinformation. Given the sheer volume of content shared online, addressing abuse and harm at scale requires the use of computational tools. Yet, detecting and moderating online abuse remains a complex task, fraught with technical, social, legal, and ethical challenges. The 9th Workshop on Online Abuse and Harms (WOAH) invites paper submissions from a diverse range of fields, including but not limited to natural language processing, machine learning, computational social science, law, political science, psychology, sociology, and cultural studies. We explicitly encourage interdisciplinary research, technical and non-technical contributions, and submissions that focus on under-resourced languages. Non-archival papers and civil society reports are also welcome. Topics covered by WOAH include, but are not limited to: * New models or methods for detecting abusive and harmful online content, including misinformation; * Biases and limitations in existing detection models or datasets for abusive and harmful content, especially those in commercial use; * Development of new datasets and taxonomies for online abuse and harms; * Novel evaluation metrics and procedures for detecting harmful content; * Analyses of the dynamics of online abuse, its propagation, and its impact on different communities; * Social, legal, and ethical considerations in detecting, monitoring, and moderating online abuse. Special Theme: Harms Beyond Hate Speech In its 9th edition, WOAH highlights the theme Harms Beyond Hate Speech. We aim to expand the conversation beyond conventional definitions of harmful content by exploring the nuanced ways online harms manifest—such as technologically mediated inauthentic behavior, the power of technologies to reshape perceptions and opinions, and their potential to incite discrimination, hostility, violence, or even genocide. Additionally, we emphasize the diverse targets affected by such harms and the unique considerations computational interventions demand. To facilitate this exploration, we invite NLP researchers, social scientists, cultural scholars, and practitioners to engage with key issues, including child sexual abuse material, radicalization, misinformation, platform policies, security, and the politics of computational approaches. By fostering interdisciplinary collaboration, our goal is to deepen understanding of these complex phenomena and advance effective, ethical solutions Submission Submission is electronic, using the Softconf START conference management system. Submission link: https://softconf.com/acl2025/woah2025/ The workshop will accept three types of papers: 1) Academic Papers (long and short): Long papers of up to 8 pages, excluding references, and short papers of up to 4 pages, excluding references. Unlimited pages for references and appendices. Accepted papers will be given an additional page of content to address reviewer comments. Previously published papers cannot be accepted. 2) Non-Archival Submissions: Up to 2 pages, excluding references, to summarise and showcase in-progress work and work published elsewhere. 3) Civil Society Reports: Non-archival submissions, with a minimum of 2 pages and no upper limit. Can include work published elsewhere. All submissions must use the official ACL style files<https://github.com/acl-org/acl-style-files>. Submissions that do not conform to the required styles, including paper size, margin width, and font size restrictions, will be rejected without review. All submissions should adhere to the workshop policies https://www.workshopononlineabuse.com/policies.html. WOAH Community We are excited to share the WOAH community Slack channel — a workspace for researchers interested in or working on understanding and addressing online abuse and harms! Join us here: https://join.slack.com/t/hatespeechdet-47d7560/shared_invite/zt-2a8d96j4z-g… Contact Info Please send any questions about the workshop to organizers(a)workshopononlineabuse.com<mailto:organizers@workshopononlineabuse.com> Organisers Agostina Calabrese, University of Edinburgh Christine de Kock, University of Melbourne Debora Nozza, Bocconi University Flor Miriam Plaza-del-Arco, Bocconi University Zeerak Talat, University of Edinburgh Francielle Vargas, University of São Paulo The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. Is e buidheann carthannais a th’ ann an Oilthigh Dhùn Èideann, clàraichte an Alba, àireamh clàraidh SC005336.

1 0

1st CFP: EvalLAC 2025 workshop @ AIED 2025
by Shiva Taslimipoor 01 Apr '25

01 Apr '25

***** Second Workshop on Automated Evaluation of Learning and Assessment Content AIED 2025 workshop | Palermo (Italy) & Hybrid | 22-26 July 2025 https://sites.google.com/cam.ac.uk/eval-lac-2025 <https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fsites.goo…> ***** We are happy to announce the second edition of the Workshop on Automated Evaluation of Learning and Assessment Content will be held in Palermo (Italy) & online during the AIED 2025 conference. About the workshop Evaluation of learning and assessment content has always been a crucial task in the educational domain, but traditional approaches based on human feedback are not always usable in modern educational settings. Indeed, the advent of machine learning models, in particular Large Language Models (LLMs), enabled to quickly and automatically generate large quantities of data, making human evaluation unfeasible. Similarly, Massive Open Online Courses (MOOCs) have numbers of attending students so large that it is unsustainable to manually provide feedback to all of them. Thus, the need for accurate and automated techniques for evaluating educational content – e.g., questions, hints, and feedback – became pressing. Building on the success of the First Workshop on the Automatic Evaluation of Learning and Assessment Content, which was held at AIED 2024, this workshop aims to attract professionals from both academia and industry, and to to offer an opportunity to discuss common challenges, share best practices, and promising new research directions. Topics of interests include but are not limited to: - Question evaluation (e.g., in terms of the pedagogical criteria listed above: alignment to the learning objectives, factual accuracy, language level, cognitive validity, etc.). - Estimation of question statistics (e.g., difficulty, discrimination, response time, etc.). - Evaluation of distractors in Multiple Choice Questions. - Evaluation of reading passages in reading comprehension questions. - Evaluation of lectures and course material. - Evaluation of learning paths (e.g., in terms of prerequisites and topics taught before a specific exam). - Evaluation of educational recommendation systems (e.g., personalised curricula). - Evaluation of hints and scaffolding questions, as well as their adaptation to different students. - Evaluation of automatically generated feedback provided to students. - Evaluation of techniques for automated scoring. - Evaluation of pedagogical alignment of LLMs. - Evaluation of the ethical implications of using open-weight and commercial LLMs in education. - Evaluation of bias in educational content and LLM outputs. Human-in-the-loop approaches are welcome, provided that there is also an automated component in the evaluation and there is a focus on the scalability of the proposed approach. Papers on generation are also very welcome, as long as there is an extensive focus on the evaluation step. Important dates Submission deadline: May 25, 2025 Notification of acceptance: June 15, 2025 Camera ready: June 22, 2025 Workshop: July 22 or July 26, 2025 Submission guidelines There are two tracks, with different submission deadlines. Full and short papers: We are accepting short papers (5 pages, excluding references) and long papers (10 pages, excluding references), formatted according to the workshop style (using either the LaTeX template <https://www.overleaf.com/latex/templates/template-for-submissions-to-ceur-w…> or the DOCX template <https://ceur-ws.org/Vol-XXX/CEUR-Template-1col.docx> ). Extended abstracts: We also accept extended abstracts (max 2 pages), to showcase work in progress and preliminary results. Papers should be formatted according to the workshop style (using either the LaTeX template or the DOCX template). Submissions should contain mostly novel work, but there can be some overlap between the submission and work submitted elsewhere (e.g., summaries, focus on the evaluation phase of a broader work). Each of the submissions will be reviewed by the members of the Program Committee, and the proceedings volume will be submitted for publication to CEUR Workshop Proceedings. Due to CEUR-WS.org policies, only full and short papers will be submitted for publication, not the extended abstracts. Organisers Luca Benedetto (1), Andrew Caines (1), George Dueñas (2), Diana Galvan-Sosa (1), Gabrielle Gaudeau (1), Anastassia Loukina (3), Shiva Taslimipoor (1), Torsten Zesch (4) (1) ALTA Institute, Dept. of Computer Science and Technology, University of Cambridge (2) National Pedagogical University, Colombia (3) Grammarly, Inc. (4) FernUniversität in Hagen

1 0

2026

2025

2024

2023

2022

Corpora