[Call for Papers] Developing Models for Linguistic Research: Training Data Usage in Low-Resource Scenarios
💬 Appel à communications en français : voir fichier joint
💬 Call for papers in english: [ https://llcd2025.sciencesconf.org/data/pages/Developing_models_for_linguist… | https://llcd2025.sciencesconf.org/data/pages/Developing_models_for_linguist… ]
Workshop at LLcD 2025
This workshop focuses in approaches and limitations to adapting computational models (such as LLMs) for linguistic research in low-resource scenarios. These include under-resourced, minority, and endangered languages, ancient languages, learner corpora, and language disorder corpora. It seeks to foster interdisciplinary dialogue and collaboration, with the goal of developing best practices, identifying tool limitations, and establishing evaluation guidelines for low-resource linguistic research.
🏙️ Venue and Dates : 1–3 September 2025, Lille, France
📄 Submission Format : Abstract (500 words)
📅 Submission Deadline : 30 March 2025
➡ Submission Information : [ https://lnkd.in/gKX47Esu | https://llcd2025.sciencesconf.org/resource/page/id/7 ]
🌐 Workshop Languages : English and French
We invite contributions from researchers working on corpus construction, transcription, annotation, and analysis, as well as computational linguists and NLP specialists.
❓ For inquiries , please contact Natasha Romanova and Aleksandra Miletić (contact details available in the call).
Conversational agents offer promising opportunities for education as they
can fulfill various roles (e.g., intelligent tutors and service-oriented
assistants) and pursue different objectives (e.g., improving student skills
and increasing instructional efficiency), among which serving as an AI
tutor is one of the most prevalent tasks. Recent advances in the
development of Large Language Models (LLMs) provide our field with
promising ways of building AI-based conversational tutors, which can
generate human-sounding dialogues on the fly. The key question posed in
previous research, however, remains: *How can we test whether
state-of-the-art generative models are good AI teachers, capable of
replying to a student in an educational dialogue?*
In this shared task, we will focus on educational dialogues between a
student and a tutor in the mathematical domain grounded in student mistakes
or confusion, where the AI tutor aims to remediate such mistakes or
confusions, with the goal of evaluating the quality of tutor responses
along the key dimensions of tutor’s ability to (1) identify student’s
mistake, (2) point to its location, (3) provide the student with relevant
pedagogical guidance, that is also (4) actionable. Dialogues used in this
shared task include the dialogue contexts from MathDial (Macina et al.,
2023) and Bridge (Wang et al., 2024) datasets, including the last utterance
from the student containing a mistake, and a set of responses to the last
student’s utterance from a range of LLM-based tutors and, where available,
human tutors, aimed at mistake remediation and annotated for their quality.
**Tracks**
This shared task will include five tracks. Participating teams are welcome
to take part in any number of tracks.
- Track 1 - Mistake Identification: Participants are invited to develop
systems to detect whether tutors' responses recognize mistakes in students'
solutions.
- Track 2 - Mistake Location: Participants are invited to develop systems
to assess whether tutors' responses accurately point to genuine mistakes
and their locations in the students' responses.
- Track 3 - Pedagogical Guidance: Participants are invited to develop
systems to evaluate whether tutors' responses offer correct and relevant
guidance, such as an explanation, elaboration, hint, examples, and so on.
- Track 4 - Actionability: Participants are invited to develop systems to
assess whether tutors' feedback is actionable, i.e., it makes it clear what
the student should do next.
- Track 5 - Guess the tutor identity: Participants are invited to develop
systems to identify which tutors the anonymized responses in the test set
originated from.
**Participant registration**
All participants should register using the following link:
https://forms.gle/fKJcdvL2kCrPcu8X6
**Important dates**
All deadlines are 11:59pm UTC-12 (anywhere on Earth).
- March 12, 2025: Development data release
- April 9, 2025: Test data release
- April 23, 2025: System submissions from teams due
- April 30, 2025: Evaluation of the results by the organizers
- May 21, 2025: System papers due
- May 28, 2025: Paper reviews returned
- June 9, 2025: Final camera-ready submissions
- July 31 and August 1, 2025: BEA 2025 workshop at ACL
**Shared task website**: https://sig-edu.org/sharedtask/2025
**Organizers**
- Ekaterina Kochmar (MBZUAI)
- Kaushal Kumar Maurya (MBZUAI)
- Kseniia Petukhova (MBZUAI)
- KV Aditya Srivatsa (MBZUAI)
- Justin Vasselli (Nara Institute of Science and Technology)
- Anaïs Tack (KU Leuven)
**Contact**: bea.sharedtask.2025(a)gmail.com
2nd Call for Participation: SustainEval 2025 - Understanding Sustainability Reports
We invite system and paper submissions for SustainEval 2025, a GermEval Shared Task co-located with KONVENS 2025 in Hildesheim, Germany, in September 2025.
With this shared task, we aim to fuel research on automatic analysis and detection of greenwashing by challenging participants to build systems that categorize excerpts from German-language sustainability reports for (A) content class and (B) statement verifiability rating.
We have just published the development data, so you can go ahead and start building your systems!
Data repo: https://github.com/SustainEval/sustaineval2025_data
Important Dates
All submission deadlines are 11:59 p.m. UTC-12:00 "anywhere on Earth."Development Data Release (270 Instances):12th March 2025Training Data Release (1000 Instances):14th April 2025Registration Deadline / Start Evaluation Phase:10th June 2025System/Results Submission Deadline:27th June 2025System Description Paper Submission Deadline:11th July 2025System papers will be reviewed within 10 days after the submission deadlineCamera-ready Deadline:15th Aug 2025Workshop & Final Presentation: 10th Sept 2025
For more information please refer to the shared task homepage:
https://sustaineval.github.io/
If you have any questions, please feel free to contact us at sustaineval(a)gmail.com.
Best regards and happy holidays,
The Shared Task Organizers
Jakob Prange, Universität Augsburg (contact for task-specific questions): jakob.prange(a)uni-a.de
Charlott Jakob, TU Berlin (contact for organisational questions): c.jakob(a)tu-berlin.de
Annemarie Friedrich, Universität Augsburg
--
Dr. Jakob Prange (er/he)
Akademischer Rat auf Zeit / Research Associate
Chair for Natural Language Understanding & Digital Humanities (Prof. Friedrich)
Faculty for Applied Informatics, University of Augsburg
https://jakpra.github.io/
Dear all,
We are pleased to inform you that submission for KONVENS 2025 is now open. Papers can be submitted here: https://openreview.net/group?id=GSCL.org/KONVENS/2025
Further Information can be found on the conference website: https://konvens-2025.hs-hannover.de/
Paper Submission Information
We invite submissions of original and unpublished work on all aspects of computational linguistics and natural language processing from fundamental inquiries to the practical implementation of natural language resources, components, and systems. We particularly encourage submissions of NLP approaches dedicated to the German language, including survey papers that provide insights into the current state of the art in German language and speech processing. We welcome contributions from both academic and industry professionals.
Papers can be submitted either to the main conference track or to the special track on application of Large Language Models focussing on the utilisation of LLMs and language-model approaches for the analysis and exploitation of textual resources in Linguistics, Computer Science, Information Science, the Humanities and the Social Sciences.
We welcome the following types of paper submissions:
* Long papers (8 pages plus references), describing original research with substantial new results.
* Short papers (4 pages plus references), including small focused contributions, work in progress, as well as descriptions of projects, systems and resources.
* Abstracts (1 page, non-archival). Abstracts can be presented at the poster session but will not be included in the conference proceedings. We especially invite submission on ongoing projects, student projects, past or ongoing bachelor and master theses, ongoing or recently completed PhD theses, and opinion pieces in this category to foster interaction and discussion in our community.
Accepted papers will be presented orally or as posters as determined by the program chairs. However, a preference for a poster or oral presentation can be given. All presentations will take place in person (no online/hybrid format). The main conference language is English. Contributions written in both German or English will be accepted, however, English is recommended. The review process for long and short papers will be double-blind. Submissions of long and short papers must be anonymized accordingly. The submission of non-archival abstracts must not be anonymous (review of abstracts is single-blind). The conference proceedings will be published via the ACL Anthology (archival submissions only). Papers must be presented at the conference by at least one of the authors in order to be published as part of the proceedings.
Papers must be formatted in accordance with the ACL style sheet: https://github.com/acl-org/acl-style-files. We strongly encourage authors to use LaTeX in preparing their document.
Important Dates
Paper Submission Deadline: May 3, 2025
Paper Notification: June 27, 2025
Camera Ready: August 15, 2025
Main Conference: September 10 - 12, 2025
Best whishes
Christian Wartena
Ulrich Heid
Prof. Dr. Christian Wartena
Hochschule Hannover
Fakultät III - Medien, Information und Design
Abt. Information und Kommunikation
Lehrgebiet Sprach- und Wissensverarbeitung
Expo Plaza 12
30539 Hannover
e-mail: christian.wartena(a)hs-hannover.de<mailto:christian.wartena@hs-hannover.de>
[DATA-H-Logo_RGB_Unterzeile_klein]
The 2nd LLMs4Subjects Shared Task: LLM-based Subject Tagging for the TIB Technical Library's Open-Access Catalog
Theme: The Development of Energy- and Compute-Efficient LLM Systems
Organized as part of the German Evaluation (GermEval 2025) Shared Task Series
10. - 12. September, 2025
Hildesheim, Germany
(co-located with KONVENS 2025 - Conference on Natural Language Processing)
2nd LLMs4Subjects Shared Task: https://sites.google.com/view/llms4subjects-germeval/
KONVENS 2025: https://konvens-2025.hs-hannover.de/about/
Task Overview
LLMs4Subjects challenges the research community to develop cutting-edge LLM-based solutions for subject tagging of technical records from Leibniz University's Technical Library (TIBKAT). Participants are tasked with leveraging large language models (LLMs) to tag technical records using the GND taxonomy. The task involves bilingual language modeling, as systems must process technical documents in both German and English. Successful solutions may be integrated into the operational workflows of TIB, the Leibniz Information Centre for Science and Technology.
With the rapid advancements in LLMs, the focus is shifting toward making these models more energy- and compute-efficient while maintaining high performance. Recent innovations, such as the DeepSeek series, have demonstrated how techniques like mixture-of-experts (MoE) and model distillation can significantly reduce computational costs without sacrificing effectiveness.
The 2nd LLMs4Subjects shared task highlights the importance of efficiency in LLMs, encouraging participants to explore strategies that enhance model performance while optimizing for energy consumption and inference speed. We welcome approaches (but not limited to) that leverage model compression, quantization, efficient fine-tuning, and adaptive computation techniques to push the boundaries of sustainable AI development.
Subtasks
The 2nd LLMs4Subjects shared task organizes the following two subtasks:
Subtask 1 - Multi-Domain Classification of Library Records
Subtask 2 - Large-scale Multilabel Subject Indexing of Library Records
Important Dates
* Release of training data: March 8, 2025
* Release of testing data: May 23, 2025
* Deadline for system submissions: June 2, 2025
* Evaluation end: June 27, 2025
* Paper submission deadline: July 7, 2025
* Notification of acceptance: June 28, 2025
* Camera-ready paper due: August 15, 2025
* Workshop/KONVENS: September 10 - 12, 2025 (TBA)
EcoDL 2025: The 1st Workshop on Digital Libraries and AI-based Information Systems for Ecological Research and Practice in conjunction with TPDL 2025
EcoDL 2025 aims to explore the integration of AI, digital libraries, and FAIR data principles in ecological research to improve knowledge synthesis and predictive modeling. Ecology's complexity and data heterogeneity present challenges in generalization, requiring advanced computational tools for structured knowledge representation, search, and decision support. We invite researchers from ecology, AI, and digital information systems to discuss AI-driven data synthesis, semantic search, causal inference, and machine learning applications in biodiversity and conservation. Through interdisciplinary contributions, EcoDL 2025 seeks to foster innovation in ecological informatics, supporting open science and advancing digital methods for ecological research and environmental sustainability.
**************************************************************
Workshop website: https://sites.google.com/view/ecodl2025/
Paper Submission Deadline: 16th May 2025 (AoE)
***************************************************************
Topics of interest
------------
The EcoDL 2025 workshop welcomes submissions on, but not limited to, the following topics:
· Knowledge graphs and structured ecological data representation
o Biodiversity knowledge graphs
o Linked open data for integrating scattered ecological knowledge sources
o Ontologies for data interoperability in ecology: Standardizing environmental terms and concepts
o Semantic annotation and classification of ecological data
o AI-driven taxonomy generation for ecological datasets
· Advanced search and retrieval for ecological and environmental data
o Neural search for literature and reports: Improving retrieval of species, habitats, and ecosystem information
o Improving retrieval of study question, research hypothesis and applied method
o LLMs for information extraction: Capturing species interactions, climate impacts, and conservation policies
o Retrieval-Augmented Generation (RAG) for ecological research: Hybrid AI systems for answering complex scientific questions
o Multimodal search for biodiversity and environmental studies: Combining text, image, and geospatial data retrieval
o Automated knowledge discovery from climate and biodiversity repositories
· FAIR data principles in ecological research
o Data interoperability
o Open science infrastructure for ecological and environmental data
o Ontologies for data interoperability in ecology: Standardizing environmental terms and concepts
o FAIR data and software
o Data lifecycle management (Create, Store, Share, Reuse)
o NanopublicationsMapping-based Knowledge Graph Construction
· AI for assisting ecological research
o AI-based literature review
o AI-driven synthesis of ecological knowledge: taking complexity and context-dependence into account
o Monitoring biases in study system, study regions and methods in ecological research
o Tracking Misinformation in Climate Science Using NLP: Identifying and mitigating the spread of false environmental claims
· Digital libraries and ecological informatics
o Methods for digitizing and analyzing historical ecological archives
o Indigenous knowledge and digital archives for sustainability
o AI-powered environmental storytelling and digital heritage
o Human-nature interactions in digital libraries
o Digitization and NLP for analyzing historical climate data
· Methods for integrating heterogeneous ecological datasets
o Integrating remote sensing data with ecological repositories
o Multimodal search for biodiversity studies
· Applications of AI in ecosystem restoration, conservation planning and decision-making
o AI-powered decision support systems for restoration and conservation
o Lay summaries based on ecological evidence
o Impact assessment of conservation policies via digital libraries
· Reflections on knowledge synthesis in ecology and on the contributions of AI
o Evaluating the role of AI in ecological research
o Challenges and limitations of AI-driven ecological modeling
o The impact of automated systems on scientific knowledge creation
o Ethical considerations in AI-assisted ecological analysis
o Future directions for AI in knowledge synthesis for ecology
Submission guidelines
----------------
The EcoDL workshop solicits both long and short paper submissions:
§ Long Papers: Up to 15 LNCS style pages, including references.
§ Short Papers: Up to 10 LNCS style pages, including references.
All accepted workshop papers will be published in the proceedings of the Springer series Communications in Computer and Information Science (CCIS). For detailed formatting instructions, please refer to the following link<https://www.springer.com/gp/computer-science/lncs/conference-proceedings-gu…>.
Important dates
-----------
Paper Submission: 16th May 2025 (AOE)
Acceptance Notification: 20th June 2025 (AOE)
Camera-ready Version: 10th July 2025 (AOE)
Workshop: 23rd September 2025 in Tampere, Finland
The EcoDL 2025 Workshop is collocated with the The 29th International Conference on Theory and Practice of Digital Libraries (TPDL 2025) https://tpdl2025.github.io/, 23rd to 26th September 2025.
EcoDL 2025 Organising Committee
----------------
Jennifer D'Souza, TIB Leibniz Information Centre for Science and Technology, Hannover, Germany
Birgitta König-Ries, University of Jena, Germany
Tina Heger, Leibniz Institute of Freshwater Ecology and Inland Fisheries (IGB), Berlin, Germany
Marie Kaiser, Bielefeld University, Germany
The list of workshops and tutorials at TPDL this year can be found at https://tpdl2025.github.io/Program/workshops_tutorials.html
Chères et chers collègues,
L'Université de Lille met au concours deux postes de Maître·sse de conférences en 71e section.
Poste "Information, Savoir et Société" (réf. 71 MCF 251968)
Poste "Information et Transition Numérique" (réf. 71 MCF 251970)
Les deux recrutements relèvent du département Sciences de l’Information et du Document (SID) et du laboratoire GERIICO (ULR 4073).
Les fiches complètes des postes sont disponibles ici : https://www.univ-lille.fr/enseignantsetenseignants-chercheurs
N’hésitez pas à diffuser ces informations dans vos réseaux.
Bien cordialement,
<>
Amel Fraisse
Maitresse de Conférences
Directrice Département Sciences de l’information et du Document
Université de Lille - ICID - Laboratoire GERiiCO
amel.fraisse(a)univ-lille.fr <mailto:prenom.nom@univ-lille.fr> / https://pro.univ-lille.fr/amel-fraisse/ <http://www.univ-lille.fr/>
Domaine Universitaire de Pont de Bois - Villeneuve d'Ascq
Bât. 2 - bureau B2.467
T. +33 (0)3 20 41 69 38
(Apologies, this time with the correct link).
A position as Postdoctoral Research Fellow in Natural Language Processing is available within MediaFutures:Research Centre for Responsible Media Technology & Innovation at the Language Technology Group (LTG) at the University of Oslo (UiO), Norway.
The closing date is April 4th.
For more information about the position and the research group, please see the full announcement here:
https://www.jobbnorge.no/en/available-jobs/job/276909/postdoctoral-research…
Please do not hesitate to contact me for any further information.
Best regards,
Lilja
From: Lilja Øvrelid via Corpora <corpora(a)list.elra.info>
Reply to: Lilja Øvrelid <liljao(a)ifi.uio.no>
Date: Tuesday, 11 March 2025 at 09:31
To: "corpora(a)list.elra.info" <corpora(a)list.elra.info>
Cc: "nodali(a)helsinki.fi" <nodali(a)helsinki.fi>
Subject: [Corpora-List] 3-year postdoc position in NLP at the University of Oslo
A position as Postdoctoral Research Fellow in Natural Language Processing is available within MediaFutures:Research Centre for Responsible Media Technology & Innovation at the Language Technology Group (LTG) at the University of Oslo (UiO), Norway.
The closing date is April 4th.
For more information about the position and the research group, please see the full announcement here:
https://www.jobbnorge.no/en/available-jobs/job/237075/postdoctoral-research…
Please do not hesitate to contact me for any further information.
Best regards,
Lilja
A position as Postdoctoral Research Fellow in Natural Language Processing is available within MediaFutures:Research Centre for Responsible Media Technology & Innovation at the Language Technology Group (LTG) at the University of Oslo (UiO), Norway.
The closing date is April 4th.
For more information about the position and the research group, please see the full announcement here:
https://www.jobbnorge.no/en/available-jobs/job/237075/postdoctoral-research…
Please do not hesitate to contact me for any further information.
Best regards,
Lilja
*CALL FOR PAPERS*
*Special Issue of The Journal of Asia TEFL *(e-ISSN 2466-1511, ISSN
1738-3102, Indexed in SCOPUS, ESCI)
*Learner Corpus Research in the AI Era: Perspectives from Asia*
The emergence of generative AI has fundamentally transformed the landscape
of corpus linguistics, particularly in the domain of learner corpus
research. These powerful technologies are not merely new analytical tools
but represent a paradigm shift in how we conceptualise, collect, and
interpret learner language data. As large language models become
increasingly embedded in language learning environments, researchers must
critically examine both the opportunities and challenges they present.
In Asian contexts, where technological adoption in education proceeds at
remarkable pace, there is an urgent need to investigate how these
developments are reshaping our understanding of learner language. This
special issue aims to bring together cutting-edge research that explores
these transformations from theoretical, methodological and practical
perspectives.
*1. RESEARCH FOCUS*
This special issue invites original contributions that examine how
generative AI is reconfiguring learner corpus research. We are particularly
interested in empirical studies that demonstrate innovative approaches to
corpus compilation, annotation and analysis in the AI era. Successful
submissions will offer insights into how corpus linguistics methodologies
are adapting to accommodate AI-mediated language learning environments.
The integration of AI technologies raises fundamental questions about the
nature of learner language itself. How do we distinguish between authentic
learner production and AI-assisted output? What new analytical frameworks
are required to interpret learner corpora in contexts where AI tools are
ubiquitous? How might AI-enhanced analysis reveal patterns in learner
language previously undetectable through conventional methods?
*Potential topics include:*
- Novel approaches to learner corpus compilation and annotation
leveraging AI technologies
- Methodological innovations in error analysis and pattern
identification using AI
- Comparative investigations of AI-generated versus authentic learner
language
- Applications of AI-driven corpus analysis in developing targeted
pedagogical interventions
- Validity and reliability concerns in corpus research within
AI-integrated learning environments
- Corpus-informed evaluations of AI feedback systems in language
learning contexts
*2. SUBMISSION REQUIREMENTS*
We welcome empirical studies, methodological papers, and critical analyses
that substantively advance our understanding of learner corpus research in
the AI era. Submissions should demonstrate technical rigour while
addressing practical implications for language teaching and assessment in
Asian contexts. Papers should engage critically with existing corpus
linguistics methodologies while proposing adaptations necessary for the AI
era.
*3. IMPORTANT DATES*
- *Abstract Submission Deadline: April 7, 2025*
- Notification of Acceptance: April 30, 2025
- Full Paper Submission Deadline: November 28, 2025
- Reviews by Reviewers: December 2025
- First Revisions by Authors: January 2026
- Reviews by the Editor: March 2026
- Second Revisions by Authors: April 2026
- Editing for Publishing: April 28 – May 23, 2026
- Expected Publication Date: May 26, 2026
*4. ABSTRACT SUBMISSION GUIDELINES*
Abstracts should present a clear articulation of research questions,
methodological framework, and the significance of the study to learner
corpus research in the AI era. Effective abstracts will demonstrate
precision in language and conceptual clarity while highlighting the
innovative aspects of the research. Abstracts should not exceed 500 words
and must be submitted by April 7, 2025 through the following link:
*Abstract submission (Deadline: April 7, 2025)*:
https://forms.gle/njjoaBCuv4mGnzUz9
*5. FULL PAPER SUBMISSION GUIDELINES*
Authors of accepted abstracts should prepare their manuscripts following
The Journal of Asia TEFL guidelines. Full papers must be submitted through
the journal's online submission system and will undergo a rigorous
double-blind peer review process. Successful papers will present compelling
evidence and incisive analysis of how AI technologies are transforming
corpus linguistics methodologies and applications. Papers should
demonstrate meticulous attention to data collection procedures, analytical
frameworks, and the implications of findings for both theory and practice
in learner corpus research.
For inquiries regarding this special issue, please contact the guest
editor, CK Jung, at ckjung(a)inu.ac.kr.
We look forward to receiving your contributions to this timely exploration
of how generative AI is reshaping the field of learner corpus research.
*CK Jung BEng(Hons) Birmingham MSc Warwick EdD Warwick Cert Oxford*
Associate Professor | Department of English Language and Literature,
Incheon National University, South Korea
Director | Institute for Corpus Research, Incheon National University,
South Korea
Editor-in-Chief | Asia Pacific Journal of Corpus Research, South Korea
Editorial Board | Corpora, Edinburgh University Press, UK
Editorial Board | English Today, Cambridge University Press, UK