We invite submissions to the 2nd Workshop on Ecology, Environment, and Natural Language Processing. We are particularly interested in contributions that push the boundaries of linguistics and NLP research in the context of ecological and environmental crisis and that foster interdisciplinary collaboration.
The topics of interest include, but are not limited to:
- Sentiment, Argument, and Stance Analysis of Environmental Topics:
Evaluating public opinions, emotions, and stances on ecological issues across social media, news outlets, and other media, including environmental activism communication and AI–environment debates (e.g., Longo and Longo, 2025; Ibrohim et al., 2023; Barz et al., 2025; Grasso et al., 2024).
- Automated Linguistic and Discourse/Frame Analysis, and Topic Modeling:
Studying grammatical, lexical, and discourse patterns in ecological communication from an ecolinguistic perspective, including topic modeling and framing analyses of media, political discourse, corporate reports, and NGO communication (e.g., Widanti, 2022; Dehler-Holland et al., 2021; Bosco et al., 2025; Grasso et al., 2025b).
- Detection of Anthropocentric and Speciesist Biases:
Identifying and mitigating anthropocentric and speciesist biases in language data and NLP applications, including bias in large language models (e.g., Leach et al., 2021; Takeshita et al., 2022; Grasso et al., 2025a).
- Text Classification, Entity Recognition, and Environmental Monitoring:
Categorizing texts into environmental subdomains such as biodiversity, climate change, and conservation, and identifying or tracking mentions of species, habitats, pollutants, and ecological phenomena, including applications of LLMs to ecological and biodiversity corpora (e.g., Volkanovska, 2025; Schimanski et al., 2023; Abdelmageed et al., 2022; Grasso & Locci, 2024).
- Fact-checking and Greenwashing Detection:
Analyzing corporate sustainability reports and institutional communication to detect misleading claims, greenwashing practices, and inaccuracies in environmental discourse (e.g., Glazkova and Zakharova, 2025; Cojoianu et al., 2020; Moodaley & Telukdarie, 2023).
- Ecofeminism, Environmental Justice, and Language:
Exploring the intersections of gender, justice, power, and ecological narratives, and how NLP methods can support the analysis of environmental justice–oriented discourse.
Further topics include:
- Ecolinguistic applications of NLP.
-Large Language Models (LLMs) application in Climate Change and Environmental domain.
-Analysis of harmful environmental narratives and misinformation on social media.
-Corpora creation and annotation for ecological discourse
-Geo-tagging and Sentiment Mapping of Environmental Discussions
-Fairness, ethics, and accountability in environmental NLP.
-Environmental communication in low-resource languages.
-Multimodal analysis for ecological and environmental challenges.
-Lexical and semantic analysis of sustainability discourse.
-Linked Data and Knowledge Graphs on ecological topics.
-Language diversity and inclusion in environmental narratives.
-Cognitive models and ecological storytelling.
-NLP for understanding indigenous knowledge in environmental contexts.
-Machine learning techniques for analyzing environmental communication.
-NLP for environmental legislation and policy discourse.
-NLP for environmental education and awareness campaigns.
-Speech technologies to support ecological field research.
-Educational chatbots and conversational agents for raising environmental awareness.
We invite submissions in the following categories:
- Regular Papers (from a minimum of 4 up to 8 pages)
- Non-archival contributions (up to 4 pages).
Regular papers must report original, previously unpublished work and follow the LREC 2026 Author Kit. Accepted regular papers will be included in the workshop proceedings.
Non-archival contributions include research communications (i.e. work already published at other venues), work in progress, manifestos, and similar contributions. Non-archival contributions can be presented at the workshop but will not be included in the proceedings.
Please visit https://nlp4ecology2026.di.unito.it/ for more infos.
Contact e-mails: nlp4ecology.workshop(a)gmail.com ; fr.grasso(a)unito.it
****We apologize for the multiple copies of this e-mail****
Dear all,
We would like to share the following PhD opportunity, which may be of interest to prospective doctoral candidates or colleagues who could help disseminate the call.
A PhD position (4-year FPI contract) is available at the UNED NLP & IR group, within the framework of the ANNOTATE research project, carried out in collaboration with the University of Barcelona and the Universitat Politècnica de València.
The PhD research will focus on Human-Centric Artificial Intelligence, with particular emphasis on the analysis and detection of sexism in digital and multimedia environments, combining multimodal data and explainable, socially responsible AI approaches.
The doctoral work will contribute to the development of a Sexism Observatory, an interactive platform for large-scale detection, analysis, and visualization of sexist content.
The position is aimed at candidates with a background in Artificial Intelligence and Natural Language Processing, strong Python programming skills, and a good command of English.
The application deadline is February 8th, 2026. Full details about the position and the application process are available in the linked document: http://nlp.uned.es/~jcalbornoz/jobs/Full_time_Phd_UNED_FPI.pdf
Best regards,
Jorge Carrillo-de-Albornoz
P.S. Please feel free to share or redistribute this call with anyone who might be interested.
AVISO LEGAL. Este mensaje puede contener información reservada y confidencial. Si usted no es el destinatario no está autorizado a copiar, reproducir o distribuir este mensaje ni su contenido. Si ha recibido este mensaje por error, le rogamos que lo notifique al remitente.
Le informamos de que sus datos personales, que puedan constar en este mensaje, serán tratados en calidad de responsable de tratamiento por la UNIVERSIDAD NACIONAL DE EDUCACIÓN A DISTANCIA (UNED) c/ Bravo Murillo, 38, 28015-MADRID-, con la finalidad de mantener el contacto con usted. La base jurídica que legitima este tratamiento, será su consentimiento, el interés legítimo o la necesidad para gestionar una relación contractual o similar. En cualquier momento podrá ejercer sus derechos de acceso, rectificación, supresión, oposición, limitación al tratamiento o portabilidad de los datos, ante la UNED, Oficina de Protección de datos<https://www.uned.es/dpj>, o a través de la Sede electrónica<https://uned.sede.gob.es/> de la Universidad.
Para más información visite nuestra Política de Privacidad<https://descargas.uned.es/publico/pdf/Politica_privacidad_UNED.pdf>.
11th Symposium on Corpus Approaches to Lexicogrammar (LxGr2026)
CALL FOR PAPERS
Deadline for abstract submission: 1 March 2026
The symposium will take place online on Thursday 2 and Friday 3 July 2026
Invited Speakers
Stefan Gries<https://www.stgries.info/> (University of California, Santa Barbara, USA)
Martin Hilpert<http://members.unine.ch/martin.hilpert> (University of Neuchâtel, Switzerland)
LxGr primarily welcomes papers reporting on corpus-based research on any aspect of the interaction of lexis and grammar -- particularly studies that interrogate the system lexicogrammatically to get lexicogrammatical answers. However, position papers discussing theoretical or methodological issues, as well as descriptions or demonstrations of tools or resources are also welcome, as long as they are relevant to both lexicogrammar and corpus linguistics.
If you would like to present, send an abstract of 500 words (excluding references) to lxgr(a)edgehill.ac.uk<mailto:lxgr@edgehill.ac.uk>.
* Abstracts for research papers should specify the research focus (research questions or hypotheses), the corpus, the methodology (techniques, metrics), the theoretical orientation, and the main findings.
* Abstracts for position papers should specify the theoretical orientation and the potential contribution to both lexicogrammar and corpus linguistics.
* Abstracts for tools or resources should provide a clear description of the main functions, and specify the potential contribution to both lexicogrammar and corpus linguistics.
Full papers will be allocated 35 minutes (including 10 minutes for discussion).
Work-in-progress reports will be allocated 20 minutes (including 5 minutes for discussion).
There will be no parallel sessions.
Participation is free.
For details, visit the LxGr website: https://sites.edgehill.ac.uk/lxgr
If you have any questions, please contact lxgr(a)edgehill.ac.uk<mailto:lxgr@edgehill.ac.uk>.
________________________________
Edge Hill University<http://ehu.ac.uk/home/emailfooter>
Modern University of the Year, The Times and Sunday Times Good University Guide 2022<http://ehu.ac.uk/tef/emailfooter>
University of the Year, Educate North 2021/21
________________________________
This message is private and confidential. If you have received this message in error, please notify the sender and remove it from your system. Any views or opinions presented are solely those of the author and do not necessarily represent those of Edge Hill or associated companies. Edge Hill University may monitor email traffic data and also the content of email for the purposes of security and business communications during staff absence.<http://ehu.ac.uk/itspolicies/emailfooter>
Dear all,
The Open Advanced Methods in Corpus Linguistics research group, offered by CASS - the ESRC Centre for Corpus Approaches to Social Science at Lancaster University, will resume this term. The group is open to anyone with an interest in corpus linguistics or quantitative approaches to language, and sessions will be held in a hybrid format (in person and online).
This term, we will focus on one of the most fundamental concepts in corpus linguistics, frequency, a concept that underpins the field as a whole.
When: Wednesdays, fortnightly, 12:00-12:50 pm (UK time)
Starting: Wednesday 21 January
Running topic: Measuring and reporting frequency in corpus linguistics
Free registration: https://forms.office.com/e/YT5md2fjkac
All are very welcome.
Best wishes,
Vaclav
Readings:
Brezina, V., & Gablasova, D. (2024). A frequency dictionary of British English: core vocabulary and exercises for learners. Routledge.
Divjak, D. (2019). Frequency in language: Memory, attention and learning. Cambridge University Press.
Gries, S.T. (2024). Frequency, Dispersion, Association, and Keyness: Revising and tupleizing corpus-linguistic measures. Amsterdam: John Benjamins.
Platt, W. C. (2025). Review of Gries (2024): Frequency, Dispersion, Association, and Keyness: Revising and tupleizing corpus-linguistic measures.
Tissari, H. (2023). Divjak, Dagmar: Frequency in language: memory, attention and learning. Cambridge: Cambridge University Press, 2019.[Book review]. Linguist list, 34.
*** First Call for Project Showcases ***
International Conference on Software and Systems Reuse, Product Lines,
and Configuration (VARIABILITY 2026)
29 September - 2 October 2026, 5* St. Raphael Resort and Marina
Limassol, Cyprus
https://conf.researchr.org/home/variability-2026
The VARIABILITY conference series brings together the communities previously served by
ICSR, SPLC, and VaMoS, forming a unified venue for research on variability, configuration,
customization, and related disciplines in software and systems engineering. As part of this
mission, VARIABILITY 2026 invites submissions to its Project Showcase Track, a forum
dedicated to presenting ongoing or recently completed research projects.
The track offers a stage for research teams to share their vision, goals, early outcomes,
intermediate results, final achievements, and lessons learned from funded projects of all
scales, including collaborative research centers, EU projects, and nationally or regionally
funded initiatives. The goal is to encourage interaction, foster collaboration opportunities,
and help disseminate project insights to the broader community.
Objectives and Scope
We welcome submissions on research projects that address reuse, product lines, and
variable/configurable software systems. A list of research topics that are relevant for this
track is available from the call for the papers for the VARIABILITY 2026 Research Track, at:
https://conf.researchr.org/track/variability-2026/variability-2026-papers#C…
Submissions are expected to describe ongoing or recently completed research projects
within this scope. This track is not intended for publishing mature research results.
Instead, it focuses on project summaries and overviews, highlighting goals, structure,
challenges, insights, and project level impact.
Examples of suitable submissions include:
• Ongoing projects focusing on goals, challenges, methodology, or early findings
• Recently completed projects summarizing outcomes, evidence, and impact
• Large scale, collaborative, or multi partner efforts, where visibility and networking are
beneficial
• Smaller or emerging projects that would benefit from early feedback and exposure
PhD thesis projects are not in scope for this track. We warmly encourage PhD candidates to
submit their work to the VARIABILITY 2026 Doctoral Symposium.
Submission Format
• Length: 7 to 10 pages, excluding references
• Format: LNCS (Springer), single blind submissions
Each submission will receive feedback from three reviewers.
All submissions must adhere to the LNCS (Springer) format. Please refer to the official
LNCS template at
https://www.springer.com/gp/computer-science/lncs/conference-proceedings-gu… .
Submissions must be in PDF format and submitted via EasyChair:
https://easychair.org/conferences/?conf=variability2026 (Select “Projects Showcase
Track”).
Presentation and Publication
Accepted papers will appear in the VARIABILITY 2026 Companion Proceedings published
by Springer in the LNCS series. Accepted submissions will receive a presentation slot. At
least one author of each accepted paper must:
• Register for the full conference, and
• Present the contribution at the event
Evaluation Criteria
Submissions will be evaluated on:
• Relevance to the conference scope
• Clarity of project goals, context, and contributions
• Potential for impact, collaboration, reuse, or technology transfer
• Value for discussion and interaction at the conference
The focus is on clarity, relevance, and value to the community rather than scientific
novelty.
Important Dates (AoE)
• Submission of Papers: 1 June 2026
• Notification of Acceptance: 21 June 2026
• Camera-Ready Submission: 15 July 2026
• Author Registration: 15 July 2026
Organisation
General Chairs
• George A. Papadopoulos, University of Cyprus, Cyprus
• Gilles Perrouin, FNRS & University of Namur, Belgium
Research Track Chairs
• Thorsten Berger, Ruhr University Bochum, Germany
• Ina Schaefer, KIT, Germany
Industry Track Chairs
• Shaukat Ali, Simula Research Lab and Oslo Metropolitan University, Norway
• Martin Becker, Fraunhofer IESE, Germany
Journal First Track Chairs
• Mathieu Acher, University Rennes, Inria, CNRS, IRISA, France
• Xhevahire Tërnava, LTCI, Télécom Paris, Institut Polytechnique de Paris, France
Doctoral Symposium Track Chairs
• Rick Rabiser, LIT CPS, Johannes Kepler University Linz, Austria
• Iris Reinhartz-Berger, University of Haifa, Israel
Demos and Tools Track Chairs
• Sandra Greiner, University of Southern Denmark, Denmark
• Leopoldo Teixeira, Federal University of Pernambuco
Projects Showcase Chairs
• Daniel Struber, Chalmers, University of Gothenburg, Radbound University, Sweden
• Dalila Tamzalit, Nantes Université, France
Hall of Fame Chairs
• Martin Becker, Fraunhofer IESE, Germany
• Goetz Botterweck, Lero - The Irish Software Research Centre and University of Limerick, Ireland
• Natsuko Noda, Shibaura Institute of Technology, Japan
Workshops Chairs
• Lidia Fuentes, Universidad de Malaga, Spain
• Malte Lochau, University of Siegen, Germany
Tutorials Chairs
• Loek Cleophas, Eindhoven University of Technology and Stellenbosch University, The Netherlands
• Mahsa Varshosaz, IT University of Copenhagen, Denmark
Proceedings Chair
• Sophie Fortz, King's College London, UK
Publicity Chairs
• Wesley Assunção, North Carolina State University, USA
• Kentaro Yoshimura, Hitachi Ltd, Japan
Local Organiser and Finance Chair
• George A. Papadopoulos, University of Cyprus, Cyprus
The rise of climate discourse on social media offers new channels for public engagement but also amplifies mis- and disinformation. As online platforms increasingly shape public understanding of science, tools that ground claims in trustworthy, peer-reviewed evidence are necessary. The new 2026 iteration of ClimateCheck builds on the results and insights from the 2025 iteration (run at SDP 2025/ACL 2025), extending it by adding training data, a new task on classifying disinformation narratives in climate discourse, and a focus on sustainable solutions.
The ClimateCheck shared task is a part of the the 3rd International Workshop on Natural Scientific Language Processing (NSLP 2026), which will take place on May 12 2026 and is co-located with LREC 2026 in Palma de Mallorca (Spain).
***Available Tasks***
Task 1: Abstract retrieval and claim verification: given a claim and a corpus of publications, retrieve the top 5 most relevant abstracts and classify each claim-abstract pair as supports, refutes, or not enough information.
Evaluation: Recall@K (K=2, 5) and B-Pref (for retrieval) + Weighted F1 (for verification) based on gold data; additional unannotated documents will be evaluated automatically. In addition, we will ask participants to use CodeCarbon to assess emissions and energy consumption at test inference.
Task 2: Disinformation narrative classification: given a claim, predict which climate disinformation narrative exists according to a predefined taxonomy.
Evaluation: Macro-, micro-, and weighted-F1 scores based on annotated documents.
***Important Dates***
- Release of datasets: December 15, 2025 (task 1); December 19, 2025 (task 2) -) Both datasets are now available for training!
- Testing phase begins: January 15, 2026 (Codabench link TBA)
- Deadline for system submissions: February 16, 2026
- Deadline for paper submissions: February 20, 2026
- Notification of acceptance: March 13, 2026
- Camera-ready papers due: March 30, 2026
- Workshop: May 12, 2026
We encourage and invite participation from junior researchers and students from diverse backgrounds. Participants are also highly encouraged to submit a paper describing their systems to the NSLP 2026 workshop.
***Call for Participation***
The call for participation with more information can be found here: https://nfdi4ds.github.io/nslp2026/
*** apologies for cross-posting ***
CALL FOR PAPERS
Ninth Workshop on Universal Dependencies (UDW 2026)
16 May 2026, Palma de Mallorca, Spain (co-located with LREC 2026)
https://universaldependencies.org/udw26/
Overview
Universal Dependencies (UD) is a framework for
cross-linguistically consistent treebank annotation that has so
far been applied to over 180 languages. The framework aims to
capture similarities as well as idiosyncrasies among
typologically different languages (e.g., morphologically rich
languages, pro-drop languages, and languages featuring clitic
doubling). The goal in developing UD was not only to support
comparative evaluation and cross-lingual learning but also to
facilitate multilingual natural language processing, enable
comparative linguistic studies, and provide resources for
language model understanding and evaluation.
The Universal Dependencies Workshop series was started to create
a forum for discussion of the theory and practice of UD, its use
in research and development, and its future goals and challenges.
Some of the previous workshops have been co-located with COLING,
EMNLP, and SyntaxFest. We invite papers on all topics relevant to
UD, including but not limited to:
* Theoretical foundations and universal guidelines
* Linguistic analysis of specific languages and/or constructions
* Language typology and linguistic universals
* Treebank annotation, conversion, and validation
* Word segmentation, morphological tagging and syntactic parsing
* Use of UD data for evaluating or understanding language models
* Linguistic studies based on the UD data
Priority will be given to papers that adopt a cross-lingual perspective.
Invited Speakers
* Marie-Catherine de Marneffe, UC Louvain
* Stephen Mayhew, Duolingo
Important Dates
Paper submission deadline: February 16, 2026
Notification of acceptance: March 16, 2026
Camera-ready version due: March 30, 2026
Workshop date: May 16, 2026
Submission Formats
We invite submissions in two formats:
* Regular (long) papers up to 8 pages of content (excluding
references and appendices). Regular papers should present
substantial, original, and unpublished research, including
empirical evaluation results where appropriate.
* Short papers up to 4 pages of content (excluding references
and appendices). Short papers may offer smaller, focused
contributions, such as work in progress, negative results,
surveys, or opinion pieces.
We also welcome non-archival papers, defined as work that has
already been published or accepted for publication at another
computational linguistics venue. These papers may be presented at
the workshop but will not appear in the LREC 2026 Workshop
Proceedings.
Accepted papers will be given one additional page to address
reviewer comments.
Paper Submission, Review Process and Selection Criteria
Submissions will be handled via the START Conference Manager.
* Submission link: https://softconf.com/lrec2026/UDW2026/
Papers should describe original work; they should emphasise
completed work rather than intended work, and should indicate
clearly the state of completion of the reported results.
Submissions will be judged on correctness, originality, technical
strength, significance and relevance to the conference, and
interest to the attendees.
All submissions should follow the two-column LREC style
guidelines. We strongly recommend the use of the LaTeX style
files, OpenDocument, or Microsoft Word templates created for
LREC: https://lrec2026.info/authors-kit/. Unlike LREC main
conference submissions, UDW submissions are allowed to include
appendices, and the UDW makes a distinction between short (up to
four pages) and long papers (up to eight pages). All papers must
be anonymous, i.e., not reveal author(s) on the title page or
through self-references. So, e.g., “We previously showed (Smith,
2020) …”, should be avoided. Instead, use citations such as
“Smith (2020) previously showed …”.
All papers will undergo a double-blind peer review process, with
final acceptance decisions made by the workshop chairs.
Submissions that violate the requirements above will be rejected
without review.
LRE-Map and Sharing Language Resources
When submitting a paper from the START page, authors will be
asked to provide essential information about resources (in a
broad sense, i.e. also technologies, standards, evaluation kits,
etc.) that have been used for the work described in the paper or
are a new result of your research. Moreover, ELRA encourages all
LREC authors to share the described LRs (data, tools, services,
etc.) to enable their reuse and replicability of experiments
(including evaluation ones).
Presentation Format
Accepted papers will be presented as oral or poster
presentations. The mode of presentation will be determined by the
workshop chairs and does not reflect the quality of the
submission.
UDW 2026 will primarily be an in-person event, but online
participation will also be possible for the participants who
cannot travel to the conference.
Accepted papers will be published in the LREC 2026 Workshop Proceedings.
Website: https://universaldependencies.org/udw26/
Contact: udw26(a)googlegroups.com
Organizing Committee
* Çağrı Çöltekin, Tübingen University
* Kaja Dobrovoljc, University of Ljubljana & Jozef Stefan Institute
* Joakim Nivre, Uppsala University
Call for Participation: Shared Task in Parsing into UMR
Please consider participating in shared task in multilingual parsing
into Uniform Meaning Representation. Details and registration link here:
https://ufal.mff.cuni.cz/umr-parsing
The shared task is part of the DMR 2026 workshop, see the call for
papers below:
Call for Papers: DMR 2026
DMR 2026 invites the submissions of long and short papers about original
works on the design, processing, and use of meaning representations.
While deep learning methods have led to many breakthroughs in practical
natural language applications, there is still a sense among many NLP
researchers that we have a long way to go before we can develop systems
that can actually “understand” human language and explain the decisions
they make. Indeed, “understanding” natural language entails many
different human-like capabilities, and they include but are not limited
to the ability to track entities in a text, understand the relations
between these entities, track events and their participants described in
a text, understand how events unfold in time, and distinguish events
that have actually happened from events that are planned or intended,
are uncertain, or did not happen at all. We believe a critical step in
achieving natural language understanding is to design meaning
representations for text that have the necessary meaning “ingredients”
that help us achieve these capabilities. Such meaning representations
can also potentially be used to evaluate the compositional
generalization capacity of deep learning models.
There has been a growing body of research devoted to the design,
annotation, and parsing of meaning representations in recent years. In
particular, formal meaning representation frameworks such as Minimal
Recursion Semantics (MRS) and Discourse Representation Theory are
developed with the goal of supporting logical inference in
reasoning-based AI systems and are therefore easily translatable into
first-order logic, while other meaning representation frameworks such as
Abstract Meaning Representation (AMR), Uniform Meaning Representation
(UMR), Tecto-grammatical Representation (TR) in Prague Dependency
Treebanks and the Universal Conceptual Cognitive Annotation (UCCA), put
more emphasis on the representation of core predicate-argument
structure. The automatic parsing of natural language text into these
meaning representations and the generation of natural language text from
these meaning representations are also very active areas of research,
and a wide range of technical approaches and learning methods have been
applied to these problems.
DMR intends to bring together researchers who are producers and
consumers of meaning representations and, through their interaction,
gain a deeper understanding of the key elements of meaning
representations that are the most valuable to the NLP community. The
workshop will provide an opportunity for meaning representation
researchers to present new frameworks and to critically examine existing
frameworks with the goal of using their findings to inform the design of
next-generation meaning representations. One particular goal is to
understand the relationship between distributed meaning representations
trained on large data sets using network models and the symbolic meaning
representations that are carefully designed and annotated by NLP
researchers, with an aim of gaining a deeper understanding of areas
where each type of meaning representation is the most effective.
The workshop solicits papers that address one or more of the following
topics:
* Development and annotation of meaning representations;
* Challenges and techniques in leveraging meaning representations for
downstream applications, including neuro-symbolic approaches;
* The relationship between symbolic meaning representations and
distributed semantic representations;
* Issues in applying meaning representations to multilingual settings
and lower-resourced languages;
* Challenges and techniques in automatic parsing of meaning
representations;
* Challenges and techniques in automatically generating text from
meaning representations;
* Meaning representation evaluation metrics;
* Cross-framework comparison of meaning representations and their
formal properties;
* Any other topics that address the design, processing, and use of
meaning representations.
Contact
For any questions regarding the workshop, please contact us
atdmr.workshop.2026(a)gmail.com.
*
————————————————————————————
Call for Participation: CRAC 2026 Shared Task on Multilingual
Coreference Resolution
————————————————————————————
https://ufal.mff.cuni.cz/corefud/crac26
We are excited to announce the fifth edition of the Shared Task on
Multilingual Coreference Resolution! As in the previous edition, where
LLMs were not able to dethrone traditional approaches, the upcoming task
again primarily focuses on LLMs and challenges their ability to predict
coreference across a range of typologically diverse languages.
The results will be presented at the CODI-CRAC 2026 workshop, co-located
with the ACL 2026 conference in San Diego, USA. System description
papers will be published in the proceedings of CODI-CRAC, also available
in the ACL anthology.
—————————————
Task description
—————————————
The task is to identify mentions and cluster them according to their
reference to the same entity.
Participants may choose between two tracks:
1.
LLM track
2.
Unconstrained track
—————————————
Data
—————————————
The Shared Task is based on the upcoming CorefUD 1.4. collection, which
comprises datasets with coreference annotation for at least 17
languages. Participants may also choose to compete on only a subset of
these languages.
—————————————
Timeline (tentative)
—————————————
January 16, 2026 - start of the development phase
*
release of training and development data
*
evaluation tool available on GitHub
*
baseline system available at GitHub
*
development evaluation possible via CodaBench
March 4, 2026 - start of the evaluation phase
*
test evaluation via CodaBench
March 18, 2026 (AoE) - end of the evaluation phase
April 10, 2026 - submission of system description papers to the
CODI-CRAC 2026
workshop
July 3/4, 2026 – the CODI-CRAC workshop at ACL
—————————————
Participation
—————————————
Please let us know by filling in the registration form:
https://ufal.mff.cuni.cz/corefud/crac26#llm-registration
<https://ufal.mff.cuni.cz/corefud/crac26#llm-registration>
—————————————
Organizers
—————————————
Michal Novák, Charles University, Czechia
Martin Popel, Charles University, Czechia
Zdeněk Žabokrtský, Charles University, Czechia
Dan Zeman, Charles University, Czechia
Anna Nedoluzhko, Charles University, Czechia
Milan Straka, Charles University, Czechia
Ondřej Pražák, University of West Bohemia, Czechia
Miloslav Konopík, University of West Bohemia, Czechia
Jakub Sido, University of West Bohemia, Czechia
*
Dear all,
I'm collecting data on corpus interface usability to support a standardization proposal.
If you use corpora for research or teaching, I'd very much appreciate 5-10 minutes of your time:
https://www.1ka.si/a/65063543
The survey covers common frustrations (export limits, query interfaces, metadata gaps) and desired improvements.
It is fully anonymous, doesn't even ask how far you are in your academic career.
Thanks a lot!
Varya Magomedova
varvara.magomedova[(a)]ung.si
University of Nova Gorica