- Corpora - ELRA lists

Stop notification
by Kayode V Amusan 12 Mar '25

12 Mar '25

stop

1 0

[Call for Papers] Developing Models for Linguistic Research: Training Data Usage in Low-Resource Scenarios
by Cristina Garcia Holgado 12 Mar '25

12 Mar '25

[Call for Papers] Developing Models for Linguistic Research: Training Data Usage in Low-Resource Scenarios 💬 Appel à communications en français : voir fichier joint 💬 Call for papers in english: [ https://llcd2025.sciencesconf.org/data/pages/Developing_models_for_linguist… | https://llcd2025.sciencesconf.org/data/pages/Developing_models_for_linguist… ] Workshop at LLcD 2025 This workshop focuses in approaches and limitations to adapting computational models (such as LLMs) for linguistic research in low-resource scenarios. These include under-resourced, minority, and endangered languages, ancient languages, learner corpora, and language disorder corpora. It seeks to foster interdisciplinary dialogue and collaboration, with the goal of developing best practices, identifying tool limitations, and establishing evaluation guidelines for low-resource linguistic research. 🏙️ Venue and Dates : 1–3 September 2025, Lille, France 📄 Submission Format : Abstract (500 words) 📅 Submission Deadline : 30 March 2025 ➡ Submission Information : [ https://lnkd.in/gKX47Esu | https://llcd2025.sciencesconf.org/resource/page/id/7 ] 🌐 Workshop Languages : English and French We invite contributions from researchers working on corpus construction, transcription, annotation, and analysis, as well as computational linguists and NLP specialists. ❓ For inquiries , please contact Natasha Romanova and Aleksandra Miletić (contact details available in the call).

1 0

1st CFP – Pedagogical Ability Assessment of AI-powered Tutors @ BEA 2025
by Ekaterina Kochmar 12 Mar '25

12 Mar '25

Conversational agents offer promising opportunities for education as they can fulfill various roles (e.g., intelligent tutors and service-oriented assistants) and pursue different objectives (e.g., improving student skills and increasing instructional efficiency), among which serving as an AI tutor is one of the most prevalent tasks. Recent advances in the development of Large Language Models (LLMs) provide our field with promising ways of building AI-based conversational tutors, which can generate human-sounding dialogues on the fly. The key question posed in previous research, however, remains: *How can we test whether state-of-the-art generative models are good AI teachers, capable of replying to a student in an educational dialogue?* In this shared task, we will focus on educational dialogues between a student and a tutor in the mathematical domain grounded in student mistakes or confusion, where the AI tutor aims to remediate such mistakes or confusions, with the goal of evaluating the quality of tutor responses along the key dimensions of tutor’s ability to (1) identify student’s mistake, (2) point to its location, (3) provide the student with relevant pedagogical guidance, that is also (4) actionable. Dialogues used in this shared task include the dialogue contexts from MathDial (Macina et al., 2023) and Bridge (Wang et al., 2024) datasets, including the last utterance from the student containing a mistake, and a set of responses to the last student’s utterance from a range of LLM-based tutors and, where available, human tutors, aimed at mistake remediation and annotated for their quality. **Tracks** This shared task will include five tracks. Participating teams are welcome to take part in any number of tracks. - Track 1 - Mistake Identification: Participants are invited to develop systems to detect whether tutors' responses recognize mistakes in students' solutions. - Track 2 - Mistake Location: Participants are invited to develop systems to assess whether tutors' responses accurately point to genuine mistakes and their locations in the students' responses. - Track 3 - Pedagogical Guidance: Participants are invited to develop systems to evaluate whether tutors' responses offer correct and relevant guidance, such as an explanation, elaboration, hint, examples, and so on. - Track 4 - Actionability: Participants are invited to develop systems to assess whether tutors' feedback is actionable, i.e., it makes it clear what the student should do next. - Track 5 - Guess the tutor identity: Participants are invited to develop systems to identify which tutors the anonymized responses in the test set originated from. **Participant registration** All participants should register using the following link: https://forms.gle/fKJcdvL2kCrPcu8X6 **Important dates** All deadlines are 11:59pm UTC-12 (anywhere on Earth). - March 12, 2025: Development data release - April 9, 2025: Test data release - April 23, 2025: System submissions from teams due - April 30, 2025: Evaluation of the results by the organizers - May 21, 2025: System papers due - May 28, 2025: Paper reviews returned - June 9, 2025: Final camera-ready submissions - July 31 and August 1, 2025: BEA 2025 workshop at ACL **Shared task website**: https://sig-edu.org/sharedtask/2025 **Organizers** - Ekaterina Kochmar (MBZUAI) - Kaushal Kumar Maurya (MBZUAI) - Kseniia Petukhova (MBZUAI) - KV Aditya Srivatsa (MBZUAI) - Justin Vasselli (Nara Institute of Science and Technology) - Anaïs Tack (KU Leuven) **Contact**: bea.sharedtask.2025(a)gmail.com

1 0

2nd Call for Participation: SustainEval 2025 - Understanding Sustainability Reports
by Jakob Prange 12 Mar '25

12 Mar '25

2nd Call for Participation: SustainEval 2025 - Understanding Sustainability Reports We invite system and paper submissions for SustainEval 2025, a GermEval Shared Task co-located with KONVENS 2025 in Hildesheim, Germany, in September 2025. With this shared task, we aim to fuel research on automatic analysis and detection of greenwashing by challenging participants to build systems that categorize excerpts from German-language sustainability reports for (A) content class and (B) statement verifiability rating. We have just published the development data, so you can go ahead and start building your systems! Data repo: https://github.com/SustainEval/sustaineval2025_data Important Dates All submission deadlines are 11:59 p.m. UTC-12:00 "anywhere on Earth."Development Data Release (270 Instances):12th March 2025Training Data Release (1000 Instances):14th April 2025Registration Deadline / Start Evaluation Phase:10th June 2025System/Results Submission Deadline:27th June 2025System Description Paper Submission Deadline:11th July 2025System papers will be reviewed within 10 days after the submission deadlineCamera-ready Deadline:15th Aug 2025Workshop & Final Presentation: 10th Sept 2025 For more information please refer to the shared task homepage: https://sustaineval.github.io/ If you have any questions, please feel free to contact us at sustaineval(a)gmail.com. Best regards and happy holidays, The Shared Task Organizers Jakob Prange, Universität Augsburg (contact for task-specific questions): jakob.prange(a)uni-a.de Charlott Jakob, TU Berlin (contact for organisational questions): c.jakob(a)tu-berlin.de Annemarie Friedrich, Universität Augsburg -- Dr. Jakob Prange (er/he) Akademischer Rat auf Zeit / Research Associate Chair for Natural Language Understanding & Digital Humanities (Prof. Friedrich) Faculty for Applied Informatics, University of Augsburg https://jakpra.github.io/

1 0

KONVENS 2025: Submission now open
by Christian Wartena 12 Mar '25

12 Mar '25

Dear all, We are pleased to inform you that submission for KONVENS 2025 is now open. Papers can be submitted here: https://openreview.net/group?id=GSCL.org/KONVENS/2025 Further Information can be found on the conference website: https://konvens-2025.hs-hannover.de/ Paper Submission Information We invite submissions of original and unpublished work on all aspects of computational linguistics and natural language processing from fundamental inquiries to the practical implementation of natural language resources, components, and systems. We particularly encourage submissions of NLP approaches dedicated to the German language, including survey papers that provide insights into the current state of the art in German language and speech processing. We welcome contributions from both academic and industry professionals. Papers can be submitted either to the main conference track or to the special track on application of Large Language Models focussing on the utilisation of LLMs and language-model approaches for the analysis and exploitation of textual resources in Linguistics, Computer Science, Information Science, the Humanities and the Social Sciences. We welcome the following types of paper submissions: * Long papers (8 pages plus references), describing original research with substantial new results. * Short papers (4 pages plus references), including small focused contributions, work in progress, as well as descriptions of projects, systems and resources. * Abstracts (1 page, non-archival). Abstracts can be presented at the poster session but will not be included in the conference proceedings. We especially invite submission on ongoing projects, student projects, past or ongoing bachelor and master theses, ongoing or recently completed PhD theses, and opinion pieces in this category to foster interaction and discussion in our community. Accepted papers will be presented orally or as posters as determined by the program chairs. However, a preference for a poster or oral presentation can be given. All presentations will take place in person (no online/hybrid format). The main conference language is English. Contributions written in both German or English will be accepted, however, English is recommended. The review process for long and short papers will be double-blind. Submissions of long and short papers must be anonymized accordingly. The submission of non-archival abstracts must not be anonymous (review of abstracts is single-blind). The conference proceedings will be published via the ACL Anthology (archival submissions only). Papers must be presented at the conference by at least one of the authors in order to be published as part of the proceedings. Papers must be formatted in accordance with the ACL style sheet: https://github.com/acl-org/acl-style-files. We strongly encourage authors to use LaTeX in preparing their document. Important Dates Paper Submission Deadline: May 3, 2025 Paper Notification: June 27, 2025 Camera Ready: August 15, 2025 Main Conference: September 10 - 12, 2025 Best whishes Christian Wartena Ulrich Heid Prof. Dr. Christian Wartena Hochschule Hannover Fakultät III - Medien, Information und Design Abt. Information und Kommunikation Lehrgebiet Sprach- und Wissensverarbeitung Expo Plaza 12 30539 Hannover e-mail: christian.wartena(a)hs-hannover.de<mailto:christian.wartena@hs-hannover.de> [DATA-H-Logo_RGB_Unterzeile_klein]

1 0

Call for Participation: 2nd LLMs4Subjects Shared Task
by D'Souza, Jennifer 12 Mar '25

12 Mar '25

The 2nd LLMs4Subjects Shared Task: LLM-based Subject Tagging for the TIB Technical Library's Open-Access Catalog Theme: The Development of Energy- and Compute-Efficient LLM Systems Organized as part of the German Evaluation (GermEval 2025) Shared Task Series 10. - 12. September, 2025 Hildesheim, Germany (co-located with KONVENS 2025 - Conference on Natural Language Processing) 2nd LLMs4Subjects Shared Task: https://sites.google.com/view/llms4subjects-germeval/ KONVENS 2025: https://konvens-2025.hs-hannover.de/about/ Task Overview LLMs4Subjects challenges the research community to develop cutting-edge LLM-based solutions for subject tagging of technical records from Leibniz University's Technical Library (TIBKAT). Participants are tasked with leveraging large language models (LLMs) to tag technical records using the GND taxonomy. The task involves bilingual language modeling, as systems must process technical documents in both German and English. Successful solutions may be integrated into the operational workflows of TIB, the Leibniz Information Centre for Science and Technology. With the rapid advancements in LLMs, the focus is shifting toward making these models more energy- and compute-efficient while maintaining high performance. Recent innovations, such as the DeepSeek series, have demonstrated how techniques like mixture-of-experts (MoE) and model distillation can significantly reduce computational costs without sacrificing effectiveness. The 2nd LLMs4Subjects shared task highlights the importance of efficiency in LLMs, encouraging participants to explore strategies that enhance model performance while optimizing for energy consumption and inference speed. We welcome approaches (but not limited to) that leverage model compression, quantization, efficient fine-tuning, and adaptive computation techniques to push the boundaries of sustainable AI development. Subtasks The 2nd LLMs4Subjects shared task organizes the following two subtasks: Subtask 1 - Multi-Domain Classification of Library Records Subtask 2 - Large-scale Multilabel Subject Indexing of Library Records Important Dates * Release of training data: March 8, 2025 * Release of testing data: May 23, 2025 * Deadline for system submissions: June 2, 2025 * Evaluation end: June 27, 2025 * Paper submission deadline: July 7, 2025 * Notification of acceptance: June 28, 2025 * Camera-ready paper due: August 15, 2025 * Workshop/KONVENS: September 10 - 12, 2025 (TBA)

1 0

[CFP] EcoDL 2025: The 1st Workshop on Digital Libraries and AI-based Information Systems for Ecological Research and Practice
by D'Souza, Jennifer 11 Mar '25

11 Mar '25

EcoDL 2025: The 1st Workshop on Digital Libraries and AI-based Information Systems for Ecological Research and Practice in conjunction with TPDL 2025 EcoDL 2025 aims to explore the integration of AI, digital libraries, and FAIR data principles in ecological research to improve knowledge synthesis and predictive modeling. Ecology's complexity and data heterogeneity present challenges in generalization, requiring advanced computational tools for structured knowledge representation, search, and decision support. We invite researchers from ecology, AI, and digital information systems to discuss AI-driven data synthesis, semantic search, causal inference, and machine learning applications in biodiversity and conservation. Through interdisciplinary contributions, EcoDL 2025 seeks to foster innovation in ecological informatics, supporting open science and advancing digital methods for ecological research and environmental sustainability. ************************************************************** Workshop website: https://sites.google.com/view/ecodl2025/ Paper Submission Deadline: 16th May 2025 (AoE) *************************************************************** Topics of interest ------------ The EcoDL 2025 workshop welcomes submissions on, but not limited to, the following topics: · Knowledge graphs and structured ecological data representation o Biodiversity knowledge graphs o Linked open data for integrating scattered ecological knowledge sources o Ontologies for data interoperability in ecology: Standardizing environmental terms and concepts o Semantic annotation and classification of ecological data o AI-driven taxonomy generation for ecological datasets · Advanced search and retrieval for ecological and environmental data o Neural search for literature and reports: Improving retrieval of species, habitats, and ecosystem information o Improving retrieval of study question, research hypothesis and applied method o LLMs for information extraction: Capturing species interactions, climate impacts, and conservation policies o Retrieval-Augmented Generation (RAG) for ecological research: Hybrid AI systems for answering complex scientific questions o Multimodal search for biodiversity and environmental studies: Combining text, image, and geospatial data retrieval o Automated knowledge discovery from climate and biodiversity repositories · FAIR data principles in ecological research o Data interoperability o Open science infrastructure for ecological and environmental data o Ontologies for data interoperability in ecology: Standardizing environmental terms and concepts o FAIR data and software o Data lifecycle management (Create, Store, Share, Reuse) o NanopublicationsMapping-based Knowledge Graph Construction · AI for assisting ecological research o AI-based literature review o AI-driven synthesis of ecological knowledge: taking complexity and context-dependence into account o Monitoring biases in study system, study regions and methods in ecological research o Tracking Misinformation in Climate Science Using NLP: Identifying and mitigating the spread of false environmental claims · Digital libraries and ecological informatics o Methods for digitizing and analyzing historical ecological archives o Indigenous knowledge and digital archives for sustainability o AI-powered environmental storytelling and digital heritage o Human-nature interactions in digital libraries o Digitization and NLP for analyzing historical climate data · Methods for integrating heterogeneous ecological datasets o Integrating remote sensing data with ecological repositories o Multimodal search for biodiversity studies · Applications of AI in ecosystem restoration, conservation planning and decision-making o AI-powered decision support systems for restoration and conservation o Lay summaries based on ecological evidence o Impact assessment of conservation policies via digital libraries · Reflections on knowledge synthesis in ecology and on the contributions of AI o Evaluating the role of AI in ecological research o Challenges and limitations of AI-driven ecological modeling o The impact of automated systems on scientific knowledge creation o Ethical considerations in AI-assisted ecological analysis o Future directions for AI in knowledge synthesis for ecology Submission guidelines ---------------- The EcoDL workshop solicits both long and short paper submissions: § Long Papers: Up to 15 LNCS style pages, including references. § Short Papers: Up to 10 LNCS style pages, including references. All accepted workshop papers will be published in the proceedings of the Springer series Communications in Computer and Information Science (CCIS). For detailed formatting instructions, please refer to the following link<https://www.springer.com/gp/computer-science/lncs/conference-proceedings-gu…>. Important dates ----------- Paper Submission: 16th May 2025 (AOE) Acceptance Notification: 20th June 2025 (AOE) Camera-ready Version: 10th July 2025 (AOE) Workshop: 23rd September 2025 in Tampere, Finland The EcoDL 2025 Workshop is collocated with the The 29th International Conference on Theory and Practice of Digital Libraries (TPDL 2025) https://tpdl2025.github.io/, 23rd to 26th September 2025. EcoDL 2025 Organising Committee ---------------- Jennifer D'Souza, TIB Leibniz Information Centre for Science and Technology, Hannover, Germany Birgitta König-Ries, University of Jena, Germany Tina Heger, Leibniz Institute of Freshwater Ecology and Inland Fisheries (IGB), Berlin, Germany Marie Kaiser, Bielefeld University, Germany The list of workshops and tutorials at TPDL this year can be found at https://tpdl2025.github.io/Program/workshops_tutorials.html

1 0

2 Postes MCF Section 71, Université de Lille
by Amel Fraisse 11 Mar '25

11 Mar '25

Chères et chers collègues, L'Université de Lille met au concours deux postes de Maître·sse de conférences en 71e section. Poste "Information, Savoir et Société" (réf. 71 MCF 251968) Poste "Information et Transition Numérique" (réf. 71 MCF 251970) Les deux recrutements relèvent du département Sciences de l’Information et du Document (SID) et du laboratoire GERIICO (ULR 4073). Les fiches complètes des postes sont disponibles ici : https://www.univ-lille.fr/enseignantsetenseignants-chercheurs N’hésitez pas à diffuser ces informations dans vos réseaux. Bien cordialement, <> Amel Fraisse Maitresse de Conférences Directrice Département Sciences de l’information et du Document Université de Lille - ICID - Laboratoire GERiiCO amel.fraisse(a)univ-lille.fr <mailto:prenom.nom@univ-lille.fr> / https://pro.univ-lille.fr/amel-fraisse/ <http://www.univ-lille.fr/> Domaine Universitaire de Pont de Bois - Villeneuve d'Ascq Bât. 2 - bureau B2.467 T. +33 (0)3 20 41 69 38

1 0

Re: 3-year postdoc position in NLP at the University of Oslo
by Lilja Øvrelid 11 Mar '25

11 Mar '25

(Apologies, this time with the correct link). A position as Postdoctoral Research Fellow in Natural Language Processing is available within MediaFutures:Research Centre for Responsible Media Technology & Innovation at the Language Technology Group (LTG) at the University of Oslo (UiO), Norway. The closing date is April 4th. For more information about the position and the research group, please see the full announcement here: https://www.jobbnorge.no/en/available-jobs/job/276909/postdoctoral-research… Please do not hesitate to contact me for any further information. Best regards, Lilja From: Lilja Øvrelid via Corpora <corpora(a)list.elra.info> Reply to: Lilja Øvrelid <liljao(a)ifi.uio.no> Date: Tuesday, 11 March 2025 at 09:31 To: "corpora(a)list.elra.info" <corpora(a)list.elra.info> Cc: "nodali(a)helsinki.fi" <nodali(a)helsinki.fi> Subject: [Corpora-List] 3-year postdoc position in NLP at the University of Oslo A position as Postdoctoral Research Fellow in Natural Language Processing is available within MediaFutures:Research Centre for Responsible Media Technology & Innovation at the Language Technology Group (LTG) at the University of Oslo (UiO), Norway. The closing date is April 4th. For more information about the position and the research group, please see the full announcement here: https://www.jobbnorge.no/en/available-jobs/job/237075/postdoctoral-research… Please do not hesitate to contact me for any further information. Best regards, Lilja

1 0

3-year postdoc position in NLP at the University of Oslo
by Lilja Øvrelid 11 Mar '25

11 Mar '25

A position as Postdoctoral Research Fellow in Natural Language Processing is available within MediaFutures:Research Centre for Responsible Media Technology & Innovation at the Language Technology Group (LTG) at the University of Oslo (UiO), Norway. The closing date is April 4th. For more information about the position and the research group, please see the full announcement here: https://www.jobbnorge.no/en/available-jobs/job/237075/postdoctoral-research… Please do not hesitate to contact me for any further information. Best regards, Lilja

1 0

2026

2025

2024

2023

2022

Corpora