Call for Papers
**************************************************************
19th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA
Co-located with LREC 2026, Palma de Mallorca (in-person & online)
May 11, 2026
Paper submission deadline: February 28, 2026
Workshop website: https://comparable.lisn.upsaclay.fr/bucc2026/
Main conference website: https://lrec2026.info/
**************************************************************
MOTIVATION
In the language engineering and linguistics communities, research
in comparable corpora has been motivated by two main reasons. In
language engineering, on the one hand, it is chiefly motivated by
the need to use comparable corpora as training data for data-driven
NLP applications such as statistical and neural machine translation, or
cross-lingual retrieval. In linguistics, on the other hand, comparable
corpora are of interest because they enable cross-language discoveries
and comparisons. It is generally accepted in both communities that
comparable corpora consist of documents that are comparable in content
and form in various degrees and dimensions across several languages.
Parallel corpora are on the one end of this spectrum, and unrelated
corpora are on the other. Increasingly, these resources are not only
collected, but also augmented or even created synthetically, which
raises new questions about how to define and measure comparability.
In recent years, the use of comparable corpora for pre-training Large
Language Models (LLMs) has led to their impressive multilingual and
cross-lingual abilities, which are relevant to a range of applications,
including information retrieval, machine translation, cross-lingual text
classification, etc. The linguistic definitions and observations related
to comparable corpora are crucial to improve methods to mine such corpora,
to assess and document synthetic data, and to improve cross-lingual transfer
of LLMs. Therefore, it is of great interest to bring together builders and
users of such corpora.
PANEL DISCUSSION
The panel discusses the impact of synthetic data on comparable corpora
research. Fundamental questions about how LLMs transform our understanding
and use of multilingual data are addressed.
TOPICS
We solicit contributions on all topics related to comparable (and parallel)
corpora, including but not limited to the following:
Building Comparable Corpora
- Automatic and semi-automatic methods, including generating
comparable corpora using LLMs
- Methods to mine parallel and non-parallel corpora from the web
- Tools and criteria to evaluate the comparability of corpora
- Parallel vs non-parallel corpora, monolingual corpora
- Rare and minority languages, within and across language families
- Multi-media/multi-modal comparable corpora
Synthetic Data for Comparable Corpora
- LLM generation of comparable/parallel data
- Improving comparability of synthetic data
- Incidental bilingualism & pre-training use of comparable data
- Comparability & cross-lingual consistency
- Detection & attribution of synthetic vs. human text
- English-centric effects & fairness across languages/scripts
- Evaluation & reproducibility for downstream tasks
Applications of Comparable Corpora
- Human translation
- Language learning
- Cross-language information retrieval & document categorization
- Bilingual and multilingual projections
- (Unsupervised) machine translation
- Writing assistance
- Machine learning techniques using comparable corpora
Mining from Comparable Corpora
- Cross-language distributional semantics, word embeddings and
pre-trained multilingual transformer models
- Extraction of parallel segments or paraphrases from comparable corpora
- Methods to derive parallel from non-parallel corpora (e.g. to provide
for low-resource languages in neural machine translation)
- Extraction of bilingual and multilingual translations of single words,
multi-word expressions, proper names, named entities, sentences,
paraphrases etc. from comparable corpora.
- Induction of morphological, grammatical, and translation rules from
comparable corpora
- Induction of multilingual word classes from comparable corpora
Comparable Corpora in the Humanities
- Comparing linguistic phenomena across languages in contrastive linguistics
- Analyzing properties of translated language in translation studies
- Studying language change over time in diachronic linguistics
- Assigning texts to authors via authors' corpora in forensic linguistics
- Comparing rhetorical features in discourse analysis
- Studying cultural differences in sociolinguistics
- Analyzing language universals in typological research
IMPORTANT DATES
28 Feb 2026: Paper Submission deadline
22 Mar 2026: Notification of acceptance
29 Mar 2026: Camera-ready final papers
14 Apr 2026: Workshop Programme final version
11 May 2026: Workshop date
All deadlines are 11:59PM UTC-12:00 (“anywhere on earth”).
For updates of the schedule, please see the workshop website.
PRACTICAL INFORMATION
The workshop is a hybrid event, both in-person and online. Workshop
registration is via the main conference registration site, see
https://lrec2026.info/
The workshop proceedings will be published in the ACL Anthology
(https://aclanthology.org/).
SUBMISSION GUIDELINES
Please follow the style sheet and templates (for LaTeX, Overleaf and
MS-Word) provided for the main conference at
https://lrec2026.info/authors-kit/
Papers should be submitted as a PDF file using the START conference
manager at https://softconf.com/lrec2026/BUCC2026/
Submissions must describe original and unpublished work and range from 4
to 8 pages plus unlimited references. Reviewing will be double blind, so
the papers should not reveal the authors' identity. Accepted papers will
be published in the workshop proceedings.
Double submission policy: Parallel submission to other meetings or
publications is possible but must be notified to the workshop organizers
by e-mail immediately upon submission to another venue.
For further information and updates, please see the BUCC 2026 web page
at https://comparable.lisn.upsaclay.fr/bucc2026/.
WORKSHOP ORGANIZERS
- Reinhard Rapp (University of Mainz, Germany)
- Ayla Rigouts Terryn (Université de Montréal, Mila, Canada)
- Serge Sharoff (University of Leeds, United Kingdom)
- Pierre Zweigenbaum (Université Paris-Saclay, CNRS, France)
Contact: reinhardrapp (at) gmx (dot) de
PROGRAMME COMMITTEE
- Ebrahim Ansari (Institute for Advanced Studies in Basic Sciences, Iran)
- Eleftherios Avramidis (DFKI, Germany)
- Gabriel Bernier-Colborne (National Research Council, Canada)
- Kenneth Church (VecML.com, USA)
- Patrick Drouin (Université de Montréal, Canada)
- Alex Fraser (Technical University of Munich, Germany)
- Natalia Grabar (CNRS, University of Lille, France)
- Amal Haddad Haddad (Universidad de Granada, Spain)
- Kyo Kageura (University of Tokyo, Japan)
- Natalie Kübler (Université Paris Cité, France)
- Philippe Langlais (Université de Montréal, Canada)
- Yves Lepage (Waseda University, Japan)
- Shervin Malmasi (Amazon, USA)
- Michael Mohler (Language Computer Corporation, USA)
- Emmanuel Morin (Nantes Université, France)
- Dragos Stefan Munteanu (RWS, USA)
- Preslav Nakov (Mohamed bin Zayed University of AI, United Arab Emirates)
- Ted Pedersen (University of Minnesota, Duluth, USA)
- Reinhard Rapp (University of Mainz, Germany)
- Ayla Rigouts Terryn (Université de Montréal & Mila, Canada)
- Nasredine Semmar (CEA LIST, Paris, France)
- Serge Sharoff (University of Leeds, UK)
- Richard Sproat (Sakana.ai, Tokyo, Japan)
- Marko Tadić (University of Zagreb, Croatia)
- François Yvon (CNRS & Sorbonne Université, France)
- Pierre Zweigenbaum (Université Paris-Saclay, CNRS, France)
INFORMATION ABOUT THE LRE 2026 MAP AND THE "SHARE YOUR LRs!" INITIATIVE
When submitting a paper from the START page, authors will be asked to
provide essential information about resources (in a broad sense, i.e.
also technologies, standards, evaluation kits, etc.) that have been used
for the work described in the paper or are a new result of the research.
Moreover, ELRA encourages all LREC authors to share the described LRs
(data, tools, services, etc.) to enable their reuse and replicability of
experiments (including evaluation ones).
Call for Participation
We are pleased to announce the PestCLEF2026<https://www.imageclef.org/PestCLEF2026> shared task as part of the LifeCLEF evaluation lab<https://clef2026.clef-initiative.eu/labs/lifeclef/> at CLEF 2026<https://clef2026.clef-initiative.eu/>.
Task
-------------------------
PestCLEF2026 is a knowledge graph extraction task framed as a document-level relation extraction problem. Target relations reflect ecological interactions and events relevant to plant health monitoring, which involve entities such as Host, Pest, Disease, Vector, and Location.
Important Dates
-------------------------
- Now: Registration is open and training data is released
- 23 April 2026: Registration closes
- 7 May 2026: Competition Deadline
- 28 May 2026: Deadline for submission of working note papers at the CLEF Conference by participants (CEUR-WS proceedings)
- 30 June 2026: Notification of acceptance of working note papers
- 6 July 2026: Camera-ready deadline for working note papers
- 21–24 September 2026: CLEF 2026 Conference in Jena, Germany
Register and Participate
-------------------------
Registration: https://clef-labs-registration.dipintra.it/registrationForm.php
CLEF2026 Discord server (see #lifeclef): https://discord.gg/PEMh4a2YHV
PestCLEF2026 homepage (general info): https://www.imageclef.org/PestCLEF2026
PestCLEF2026 Kaggle page (to participate): https://www.kaggle.com/competitions/pest-clef-2026
Organizers
-------------------------
Robert Bossy (Paris-Saclay University, INRAE, France)
Claire Nédellec (Paris-Saclay University, INRAE, France)
Marine Courtin (Paris-Saclay University, INRAE, France)
Louise Deléger (Paris-Saclay University, INRAE, France)
Keep in touch via Kaggle or Discord. We are looking forward to your submission!
The PestCLEF2026 team
*** Last Call for Workshop Proposals ***
International Conference on Software and Systems Reuse, Product Lines,
and Configuration (VARIABILITY 2026)
29 September - 2 October 2026, 5* St. Raphael Resort and Marina
Limassol, Cyprus
https://conf.researchr.org/home/variability-2026
VARIABILITY is a new conference that has been merged of three prominent conferences
focussing on software and systems variability, configuration and reuse: SPLC (the
International Systems and Software Product Line Conference, 29 successful editions,
ranked as a top conference), VaMoS (the International Working Conference on Variability
Modelling of Software-Intensive Systems, 19 successful editions), and ICSR (the
International Conference on Systems and Software Reuse, 22 successful editions).
We invite you to submit proposals for half-day or full-day workshops in any area related
to the field of Software and Systems Reuse, Product Lines, and Configuration, all of which
fall under the broader area of Variability. In particular, workshops on challenging,
emerging areas related to the conference topics are especially sought. We particularly
encourage workshop proposals for highly interactive and collaborative workshops, rather
than mini-conferences, e.g., apart from the traditional short and long papers, consider
allowing position papers with only one page (not included in the proceedings) and focus
on a lively discussion after the presentation, to foster new ideas and gather feedback
(rather than just defending the presented work). The expected date of the workshops
will be the September 29th, 2026, before the main track of the conference.
Submissions / Publishing
VARIABILITY workshop papers will be published in a volume of the conference proceedings
published by Springer. Moreover, a one-page summary of each accepted workshop will be
published in the proceedings as well.
Workshop proposals should be authored by at least two organizers, preferably from
different institutions, and they should contain the following three sections and address
each corresponding point:
1. Organizers
• Name: organizers’ full names
• Contact information: affiliations, job titles, postal addresses, e-mail addresses, URLs,
and phone
• Brief biography: 100-200 words, focusing on the organizers’ expertise in the field and
experience as workshop organizers
2. Workshop Content
• Title: workshop title and acronym
• Abstract: max 150 words describing the workshop (suitable for the conference’s website)
• Tentative Website URL
• Topics and motivation:
• What are the topics, themes, and areas of interest of the workshop?
• How is the workshop relevant to VARIABILITY?
• How does the workshop connect VARIABILITY to other research communities?
• Goals and expected results:
• Explicitly state the goals of the workshop and how you intend to reach them
• What are the expected results of the workshop?
• How will these results be disseminated?
• Format:
• What is the planned workshop format (paper presentations, working sessions, invited
talks (please note here that such talks are not financially supported by the conference),
lightning talks, demonstrations, etc.)?
• To avoid duplicated topics and cancellations, did you coordinate with or (plan to)
merge workshops on the same/similar topics from previous years (if there are any)?
• What will be done to stimulate collaborative interaction?
• What are the planned pre- and post-workshop activities?
• Participants:
• What is the expected number of submissions and participants? Provide a plan for
attracting sufficient submissions and promoting attendance
• If applicable, please provide information from previous or related workshops. Have
there been previous workshops on the same or a closely related topic? When, where
and with how many participants?
• Special room equipment (please note that VARIABILITY conference and the workshops
are in-person events) like flip charts, microphone, etc.
• Do you plan for a half-day or full-day workshop?
• Program Committee: list of tentative program committee members, names and
affiliations
3. Preliminary Call for Papers
This will necessarily repeat some of the information from the previous sections but should
be targeted towards prospective participants. It should address the following items:
• Overview of the motivation, topics, and goals
• Workshop format
• Deadlines of the workshop (see dates in this call for proposals)
• Submission guidelines and review process
• References to previous workshops (websites)
• Dissemination campaign to distribute the CFP
4. References to previous workshops (websites)
Submission Instructions
Please send your workshop proposals using EasyChair:
https://easychair.org/conferences?conf=variability2026
A workshop proposal must be at most 8 pages long. Submissions must follow the
Springer guidelines:
https://www.springer.com/gp/computer-science/lncs/conference-proceedings-gu…
Relevant supporting material, such as proceedings from previous editions of the proposed
workshop or other workshops organized by the proposal authors, should be included if
available but are not required for submission.
Acceptance Criteria
Each workshop proposal will be evaluated according to the relevance of its topic, the
expertise and experience of the workshop organizers, and the workshop’s potential for
attracting participants and generating useful results. We underline the importance of
active and creative workshops that foster a collaborative environment of interest to both
practitioners and researchers, aiming, e.g., to evolve the field of Variability and to identify
elements of joint future work. To obtain a balanced and cohesive workshop program, the
Organizing Committee will collaborate closely with workshop organizers and reserves the
right to circulate proposals to other submitters in view of possible workshop mergers. The
organizers of accepted workshops will be required to create and maintain a website in a
timely manner to serve as a workshop information center and to provide a repository for
documenting pre- and post-workshop activities.
At least one author of each accepted proposal must register and attend VARIABILITY 2026
in order for the workshop to be accepted and the summary of the workshop published.
The submission and review platform for workshop papers will be the one for the main
conference (i.e., all workshops will be as different tracks under the same Easy Chair
installation).
Important Dates (AoE)
• Workshop Proposals: 2 March 2025
• Notification of Acceptance: 16 March 2026
• Workshop Papers Submission: 15 June 2026
• Workshop Papers Notification: 7 July, 2026
• Camera-Ready Version Submission: 14 July, 2026
• Workshop Summary: 14 July, 2026
• Author Registration: 14 July, 2026
Organisation
General Chairs
• George A. Papadopoulos, University of Cyprus, Cyprus
• Gilles Perrouin, FNRS & University of Namur, Belgium
Research Track Chairs
• Thorsten Berger, Ruhr University Bochum, Germany
• Ina Schaefer, KIT, Germany
Industry Track Chairs
• Shaukat Ali, Simula Research Lab and Oslo Metropolitan University, Norway
• Martin Becker, Fraunhofer IESE, Germany
Journal First Track Chairs
• Mathieu Acher, University Rennes, Inria, CNRS, IRISA, France
• Xhevahire Tërnava, LTCI, Télécom Paris, Institut Polytechnique de Paris, France
Doctoral Symposium Track Chairs
• Rick Rabiser, LIT CPS, Johannes Kepler University Linz, Austria
• Iris Reinhartz-Berger, University of Haifa, Israel
Demos and Tools Track Chairs
• Sandra Greiner, University of Southern Denmark, Denmark
• Leopoldo Teixeira, Federal University of Pernambuco
Projects Showcase Chairs
• Daniel Struber, Chalmers, University of Gothenburg, Radbound University, Sweden
• Dalila Tamzalit, Nantes Université, France
Hall of Fame Chairs
• Martin Becker, Fraunhofer IESE, Germany
• Goetz Botterweck, Lero - The Irish Software Research Centre and University of Limerick, Ireland
• Natsuko Noda, Shibaura Institute of Technology, Japan
Workshops Chairs
• Lidia Fuentes, Universidad de Malaga, Spain
• Malte Lochau, University of Siegen, Germany
Tutorials Chairs
• Loek Cleophas, Eindhoven University of Technology and Stellenbosch University, The Netherlands
• Mahsa Varshosaz, IT University of Copenhagen, Denmark
Proceedings Chair
• Sophie Fortz, King's College London, UK
Publicity Chairs
• Wesley Assunção, North Carolina State University, USA
• Kentaro Yoshimura, Hitachi Ltd, Japan
Local Organiser and Finance Chair
• George A. Papadopoulos, University of Cyprus, Cyprus
# CLEF 2026 - Call for Papers
CLEF 2026 Conference and Labs of the Evaluation Forum
Information Access Evaluation meets Multilinguality, Multimodality, and Visualization
21-24 September 2026, Jena, Germany
https://clef2026.clef-initiative.eu/calls/papers/
# Good to Know
The CLEF 2026 Conference welcomes papers in the Information Access domain that describe rigorous hypothesis testing regardless of whether the results are positive or negative. Each submission is reviewed in two stages, see details below.
# Aim and Scope
The CLEF Conference addresses all aspects of Information Access in any modality and language. CLEF consists of the presentation of research papers and a series of workshops presenting the results of lab-based comparative evaluation benchmarks.
CLEF 2026 is the 17th CLEF conference, continuing the popular CLEF campaigns that have run since 2000, contributing to the systematic evaluation of information access systems, primarily through experimentation on shared tasks.
The CLEF conference has a clear focus on experimental Information Access as carried out within evaluation forums (e.g., CLEF Labs, TREC, NTCIR, FIRE, MediaEval, RomIP, SemEval, and TAC) with special attention to the challenges of multimodality, multilinguality, and interactive search in different domains, also considering specific classes of users, such as children, students, or impaired users in different tasks (e.g., academic, professional, or everyday life).
We invite paper submissions on significant new insights demonstrated on information access test collections, on the analysis of test collections and evaluation measures, and on concrete proposals to push the boundaries of the Cranfield-style evaluation paradigm.
All submissions to the CLEF main conference will be reviewed on the basis of relevance, originality, importance, and clarity. CLEF welcomes papers that describe rigorous hypothesis testing regardless of whether the results are positive or negative. CLEF also welcomes past runs, results, data analyses, and new data collections. Methods are expected to be written so that they are reproducible by others, and the logic of the research design should be clearly described in the paper. Linking to additional resources, such as code or data repositories, is encouraged. The conference proceedings will be published in the Springer Lecture Notes in Computer Science (LNCS).
# Topics
Relevant topics for the CLEF 2026 Conference include, but are not limited to:
- Information access in any language or modality: information retrieval, question answering, recommender systems, image retrieval, search interfaces and design, infrastructures, etc.
- Interactive and conversational search evaluation: the interactive/conversational evaluation of retrieval-augmented generation systems, information retrieval systems using user-centered methods, evaluation of novel search interfaces, novel interactive/conversational evaluation methods, simulation of interaction/conversation, etc.
- Analytics for information access: theoretical and practical results in the analytics field specifically targeted at information access data analyses, data enrichment, etc.
- Reproducibility and replicability: analyses of past results/runs in depth.
- Fairness, Accountability, Transparency, Ethics, and Explainability (FATE) in information access.
- Language diversity in information access: work on low-resource languages.
- Models leveraging collaborative and social data, and their evaluation.
User studies, either based on lab studies or crowdsourcing.
- Evaluation initiatives: conclusions, lessons learned, impact, and projections of any evaluation initiative upon completing its cycle.
- Evaluation: methodologies, metrics, statistical and analytical tools, component-based evaluation, user groups and use cases, ground-truth creation, impact of multilingual/multicultural/multimodal differences, etc.
- Technology transfer: economic impact/sustainability of information access approaches, deployment and exploitation of systems, use cases, etc.
- Specific application domains: information access and its evaluation in application domains such as cultural heritage, digital libraries, social media, health information, legal documents, patents, news, books, and in the form of text, audio, and/or image data.
- New data collection: presentation of new data collections with potential high impact on future research, specific collections from companies or labs, and multilingual collections.
- Reflections on past achievements and future research directions, roadmaps, outlooks for future developments, and lessons learned.
# Format
Authors are invited to electronically submit original papers, which have not been published and are not under consideration elsewhere, using the LNCS proceedings format (http://www.springer.com/it/computer-science/lncs/conference-proceedings-gui…)
Three categories of papers will be accepted:
- Long research papers: 12 pages plus references for complete research work
- Short research papers: 6 pages plus references for position/discussion papers, new evaluation proposals, developments and applications, etc.
- Past, Present, Future: Up to 12 pages for future directions and research roadmaps, reflections on past achievements, etc.
Papers should be anonymous. Sharing code and data with reviewers should be done via anonymous repositories (such as https://anonymous.4open.science/).
# Review Process for Research Papers
Research papers will be peer-reviewed by three members of the programme committee in two stages using a results-blind reviewing process. At the first stage, the members will review the paper's originality, clarity, technical & theoretical soundness, and methodology. At the second stage, the complete manuscripts that passed the first stage will be reviewed. At this stage, reviewers will also look at the reproducibility of the work. The final decision will not be based on whether results are positive or beat a baseline. Therefore, negative results and failed experiments are explicitly welcome.
Authors of long and short papers are asked to submit TWO versions of their manuscript:
1. Methodology version (restricted): This version does NOT report anything related to the results of the study. At this stage, the manuscripts will be evaluated based on the importance of the problem addressed and the soundness of the methodology. Manuscripts can include an introduction, a description of the proposed methodology, and datasets used. However, there should be no section on results and discussion. The authors should also remove any mentions of results from the included sections (e.g., the abstract and introduction).
2. Experimental version (complete): The complete manuscript that contains all the sections of the paper, including the experiments and results.
The submission deadline for both versions is 15 May 2026.
Authors of Past, Present, Future papers are asked to submit to the “Past, Present, Future” track.
The submission deadline for the Past, Present, Future track is 15 May 2026.
# Paper Submission
Papers should be submitted in PDF format at https://easychair.org/conferences/?conf=clef2026
- Submit the methodology/restricted version to the "Conference - Methodology Part" track.
- Submit the experimental/complete version to the "Conference - Experimental Part" track.
- Submit the Past, Present, Future papers to “Conference - Past, Present, Future” track.
- Submit the best of CLEF 2025 Labs papers to the "Conference - Best of CLEF 2025 Labs" track.
# Best Paper Award
A Best Paper Award will be given to one outstanding conference paper accepted to the conference. This award, sponsored by Springer LNCS, includes a certificate and a 500 EUR prize.
# Organisation
Programme Chairs
- Philipp Schaer, Technische Hochschule Köln, Germany
- Eva Zangerle, University of Innsbruck, Austria
Lab Chairs
- Sean MacAvaney, University of Glasgow, United Kingdom
- Julia Maria Struß, Fachhochschule Potsdam University of Applied Sciences, Germany
General Chairs
- Matthias Hagen, Friedrich-Schiller-Universität Jena, Germany
- Martin Potthast, University of Kassel, hessian.AI, ScaDS.AI, Germany
- Benno Stein, Bauhaus-Universität Weimar, Germany
===apologies for cross-postings===
*Third Call for Papers*
*CHiPSAL 2026: Second Workshop on Challenges in Processing South Asian
Languages*
Extended submission date: *28 February 2026.*
*Submit here: *https://softconf.com/lrec2026/CHiPSAL2026/
We are pleased to announce the Second Workshop on Challenges in Processing
South Asian Languages (CHiPSAL 2026), to be held in hybrid mode on 16 May
2026, co-located with LREC 2026.
CHiPSAL 2026 invites substantial, original, and unpublished research on all
areas of natural language processing, language resources, and
evaluation—covering spoken, signed, and multimodal language—as well as
system demonstrations. We welcome long and short papers addressing
challenges, resources, tools, and innovations for South Asian languages.
Topics include, but are not limited to:
- Encoding and Unicode issues
- Orthographic complexities
- Morphology and generation
- Dialectal variation and standardisation
- Code-mixing and multilingualism
- Building linguistic resources
- Speech recognition and synthesis
- Technology for linguistic heritage preservation
- Benchmarking models
- Large language models for South Asian languages
*Important Dates (AoE)*
- Submission Deadline: 28 February 2026
- Notification of Acceptance: 20 March 2026
- Camera-ready Papers: 29 March 2026
- Workshop (Hybrid): 16 May 2026
*Submission Guidelines*
CHiPSAL 2026 accepts oral, poster, and poster+demo papers.
- Short papers: 4 pages
- Long papers: 8 pages
(Excluding ethics/limitations, references, acknowledgements, and data/code
availability statements)
*All submissions must:*
- Follow the LREC 2026 stylesheet: https://lrec2026.info/authors-kit/
- Be fully anonymised for double-blind review
- Include required ethics/limitations and data/code availability
statements
- Be self-contained (no appendices or supplementary files at
submission)
- Be relevant to South Asian language processing
Papers must report original, unpublished work. Concurrent submissions must
be declared. Accepted papers will appear in the workshop proceedings.
CHiPSAL will also accept submissions that were not selected by ACL Rolling
Review, the LREC main conference, and EACL 2026, provided they are
accompanied by their reviews and are related to South Asian languages. Such
submissions must be uploaded as a ZIP file in the submission system,
including the review decision email and all reviews in a text file. Each
submission will be reviewed again by the programme committee.
*More Information: https://sites.google.com/view/chipsal/
<https://sites.google.com/view/chipsal/>*
Do not miss the opportunity to submit your work, strengthen the South Asian
NLP community, and support the development of language technology in one of
the world’s most populous and linguistically diverse regions.
We look forward to your contributions.
Best regards,
The CHiPSAL 2026 Organising Committee
--
*Dr Kengatharaiyer Sarveswaran (Sarves)*
Senior Lecturer (Grade-I) in Computer Science
Department of Computer Science
Faculty of Science
University of Jaffna
Sri Lanka
sarves.github.io
Join Our Team at the Centre for Language Technology, University of Copenhagen!
We are seeking a talented and motivated postdoctoral researcher to join a pioneering Nordic project at the forefront of Natural Language Processing. This role focuses on the linguistic and cultural alignment of Large Language Models. As part of the CAALLM Consortium, you will collaborate with research groups in Denmark, Sweden, Norway, Latvia, and the Faroe Islands. You will also contribute to the Danish AI initiative, Danish Foundation Models, working closely with Alexandra Institutet and top Danish universities.
Application deadline: March 10th, 2026
Start date: 1st of July 2026 or as soon as possible thereafter
Duration: 30 months
Application link: https://employment.ku.dk/faculty/?show=156687
Best regards,
Ali Basirat<https://nors.ku.dk/english/staff/?pure=en/persons/776431>
Associate Professor of Natural Language Processing
University of Copenhagen
Center for Language Technology (CST)<https://cst.ku.dk/english/>
Department of Nordic Studies and Linguistics<https://nors.ku.dk/>
Emil Holms Kanal 2
2300 København S
Email:alib@hum.ku.dk
Room: 22.3.34
Dear colleagues,
The European Summer University in Digital Humanities (ESU) invites
proposals for workshops for the upcoming 16th edition in Université
Marie-et-Louis-Pasteur, Besançon, France from July 6 – July 18, 2026.
Workshop areas. We are particularly looking for proposals covering:
- Audiovisual analysis and cultural analytics;
- Computational & digital philology;
- Statistics for humanities;
- Design and construction of databases and corpora;
- Large Language Models and critical AI.
All digital humanities scholars and practitioners, GLAM professionals,
interdisciplinary teams at the intersection of humanities, social science
and computer science are welcome to submit.
Workshop formats
1. Two week course (15 + 15 teaching hours, + 1.5h (x2) open teaser session);
2. One week course (15 teaching hours + 1.5h teaser);
Two week workshops will be prioritized in evaluation as they are the core
teaching units of the ESU. They run in parallel across the schedule and are
designed as comprehensive, intensive hands-on training in key areas of
Digital Humanities. Each workshop’s optimal audience is 10-12 participants
for efficient learning; a workshop may be cancelled if it has less than 5
registered participants.
Submission & timeline. To submit a proposal, please prepare:
- Instructors' CV (max. 2 pages) and short bio (max. 250 words);
- Title and short abstract (max. 250 words);
- Extended description and syllabus (800–1200 words), including a day-by-day outline
- Please include intended outcomes, any prerequisites, description of datasets and materials, technical requirements
Proposals should be submitted by March 1 via the form here
<https://framaforms.org/european-summer-university-in-digital-humanities-wor…>.
The Steering Committee will review the applications and announce results on March 6.
Evaluation. Workshops will be evaluated according to their 1) scientific
and pedagogical quality; 2) fit to the school’s scope; 3) feasibility; 4)
diversity and inclusiveness; 5) sustainability and openness.
Practicalities. Accommodation and most meals (with some evenings and Sunday
possibly excluded) will be provided for all instructors during their stay
in Besançon. Travel costs within Europe will be fully covered;
reimbursement of travel costs from outside Europe will be partial and must
be agreed upon with the local organizers prior to the start of the ESU. A
small teaching honorarium can be expected. Further information will be
provided by the local organizers once the workshop’s financial arrangements
are secured.
We usually avoid significant thematic overlap in the workshop offer, so there
will be less priority given to proposals on distant reading, mapping and
GIS, stylometry, TEI-XML and digital archives.
Feel free to write to esudhsteering [@] gmail.com if you wish to inquire
about the fit of your workshop or have any other questions.
With best wishes,
Chairs of the ESU Steering Committee
Artjoms Šeļa & Jeremi Ochab
Dr. Artjoms Šeļa
Institute of Czech Literature, Сzech Academy of Sciences
Website<https://artjomsh.github.io/web/> | Scholar
<https://scholar.google.com/citations?user=4eYapmIAAAAJ&hl=en> | PoeTree
<https://versologie.cz/poetree/>
______________________________
Prof. Dr. phil. Elisabeth Burr
Französische, frankophone und italienische Sprachwissenschaft
Gründerin und Direktorin der European Summer University in Digital Humanities "Culture & Technology" (ESU DH C&T) 2009-2022 (https://esu.fdhl.info/)
Präsidentin der European Association for Digital Humanities (EADH) 2016-2023 (https://eadh.org/)
(Vize-)Präsidentin des Constituent Organisations Board der Alliance of Digital Humanities Organisations (ADHO) 2019-2021 (https://adho.org/)
Vorsitzende des Wissenschaftlichen Programmkomitees von EADH2020-2021, Krasnoyarsk (https://eadh2020-2021.org/)
Universität Leipzig
Beethovenstr. 15
D-04107 Leipzig
https://home.uni-leipzig.de/burr/
LREC Workshop: Leveraging Derived Text Formats to Unlock Copyrighted
Collections for Open Science
Palma de Mallorca
Final Call for Papers
The workshop Leveraging Derived Text Formats to Unlock Copyrighted
Collections for Open Science will be held at the Language Resources and
Evaluation Conference (LREC 2026).
Derived Text Formats (DTF), also known as extracted features, offer a
promising solution for enabling research on textual data that cannot be
shared in its original form due to copyright or privacy restrictions.
This workshop brings together researchers, legal experts, and
infrastructure providers to explore the creation, standardization, legal
framing, and scientific use of derived data in linguistics, digital
humanities, and language technology.
We invite contributions from the community that address practical
experiences, challenges, and solutions related to:
* The creation and processing of DTF
* Legal and ethical considerations in publishing derived data
* Use cases from digital humanities, linguistic research, corpus
linguistics, or NLP
* Infrastructure and tools supporting DTF flows
* Standardization efforts (e.g., TEI, SynAF, MAF, ISO standards)
The workshop will be held as a hybrid event. The exact workshop date
will be communicated in due time.
Submission Format
Submissions should be 4 to 8 pages in length (excluding references and
potential Ethics Statements). Submissions should follow the LREC
stylesheet, available on the conference website on theAuthor’s kit page
<https://lrec2026.info/authors-kit/> at
https://lrec2026.info/authors-kit/. Submissions will be reviewed by the
workshop organizers and the programme committee.
Important Dates
* Extended submission Deadline: 2 March 2026 (AoE)
* Reviewing period: – 10 March 2026
* Notification of Acceptance: 11 March 2026
* Camera Ready paper submission Deadline: 30 March 2026
* Workshop Date: 11, 12 or 16 May, 2026
Submission
Submissions will be handled via the submission system Softconf
<https://softconf.com/lrec2026/DTF> at https://softconf.com/lrec2026/DTF.
When submitting a paper from the START page, authors will be asked to
provide essential information about resources (in a broad sense, i.e.
also technologies, standards, evaluation kits, etc.) that have been used
for the work described in the paper or are a new result of your
research. Moreover, ELRA encourages all LREC authors to share the
described LRs (data, tools, services, etc.) to enable their reuse and
replicability of experiments (including evaluation ones)
Workshop Organisers
* Florian Barth, Göttingen State and University Library
* Keli Du, University of Trier
* José Calvo Tello, Göttingen State and University Library
* Philippe Genêt, German National Library
* Piroska Lendvai, Bavarian Academy of Sciences and Humanities
* Christof Schöch University of Trier
* Thorsten Trippel, University of Tübingen and Leibniz-Institut für
Deutsche Sprache
Contact
For questions, please contact:dtf-at-lrec2026@googlegroups.com
<mailto:dtf-at-lrec2026@googlegroups.com>
Updates
For updates, see
https://text-plus.org/en/aktuelles/veranstaltungen/2026-05-12-lrec-dtf/
The deadline for the CAWL workshop has been extended to February 23! Please
see the workshop website at https://sigwrit.org/workshops/cawl2026/ for
submission details. The call for papers follows.
The Third Workshop on Computation and Written Language (CAWL 2026) will be
held in conjunction with LREC 2026 as a half-day workshop on May 12th in
Palma, on the island of Mallorca, Spain. The workshop will feature an
invited talk, a tutorial on working with different writing systems, and
posters and presentations for submitted work. Annual CAWL workshops are
organized under the guidance of the ACL Special Interest Group on Writing
Systems and Written Language (SIGWrit).
We welcome submissions of scientific papers to be presented at the workshop
and archived in the ACL Anthology. Please see the submission guidelines
below and see the workshop webpage (https://sigwrit.org/workshops/cawl2026/)
for additional relevant information.
For the first time ever, CAWL will also feature a cash prize of $500 USD for
the best student submission.
Topics
Most work in NLP focuses on language in its canonical written form. This
has often led researchers to ignore the differences between written and
spoken language or, worse, to conflate the two. Furthermore, methods for
dealing with written language issues (e.g., various kinds of normalization
or conversion) or for recognizing text input (e.g. OCR & handwriting
recognition or text entry methods) are often regarded as precursors to NLP
rather than as fundamental parts of the enterprise, despite the fact that
most NLP methods rely centrally on representations derived from text rather
than (spoken) language. This general lack of consideration of writing has
led to much of the research on such topics to largely appear outside of ACL
venues, in conferences or journals of neighboring fields such as speech
technology (e.g., text normalization) or human-computer interaction (e.g.,
text entry).
This workshop will bring together researchers who are interested in the
relationship between written and spoken language, the properties of written
language, the ways in which writing systems encode language, and
applications specifically focused on characteristics of writing systems.
Topics of interest include but are not limited to:
-
Writing systems for less-resourced, Indigenous, and minoritized languages
-
Multi-writing system models
-
Text entry and tokenization
-
Processing abbreviations and homographs
-
Grapheme-to-phoneme conversion, transliteration, and diacritization
-
Text normalization for speech and for processing “informal'” genres of
text
-
Information-theoretic and machine-learning approaches to decipherment
-
Optical character (incl. handwriting) recognition and historical
document processing
-
Orthography for unwritten languages
-
Spelling error detection and correction
-
Script normalization and encoding
-
Writing system typology and its relevance to speech and language
processing
-
Properties of written language
-
Applications specifically focused on characteristics of writing systems
Important dates (all deadlines anywhere-on-earth time):
Paper submission deadline: February 23, 2026
Notification of acceptance: March 17, 2026
Camera-ready paper due: March 30, 2026
Workshop date: May 12, 2026
Submission Guidelines
Please submit short (4 page) or long (8 page) submissions in PDF format.
Both short and long paper submissions will be reviewed in the same process.
Authors should follow the formatting guidelines of LREC 2026, available in
the authors’ kit (https://lrec2026.info/authors-kit/). Note that, as with
the main conference, reviewing is double-anonymous, i.e., reviewers will
not know author identity and vice versa, hence no author information should
be included in the papers; self-reference that identifies the authors
should be avoided or anonymised. Accepted papers will appear in the
workshop proceedings in the ACL anthology.
Submissions will be accepted at https://softconf.com/lrec2026/CAWL/ between
now and February 23, 2026.
For questions about the submission guidelines, please contact workshop
organizers at cawl-2026-organizers(a)googlegroups.com.
[Sorry for cross-posting]
HISEMOTIONS 2026 focuses on emotion detection in Early Modern Spanish
epistolary texts (16th–17th centuries). Unlike most existing emotion
detection tasks that target modern languages or contemporary genres (e.g.,
social media, reviews), this challenge explores historical linguistic
variation and semantic shift, raising a new challenge for NLP systems.
Participants are tasked with developing systems that automatically identify
emotional content in fragments of historical Spanish letters, addressing
issues such as diachronic language variation, historical semantic shifting,
and domain adaptation. The shared task aims to foster methodological
progress at the intersection of Natural Language Processing (NLP) and Digital
Humanities.
🔗 *Task page / resources*:
https://github.com/albinasarymsakova/HISEMOTIONS_2026
<https://github.com/albinasarymsakova/HISEMOTIONS_2026?utm_source=chatgpt.com>
------------------------------
Why Participate?
-
Advance emotion detection methods in low-resource and historical domains.
-
Explore semantic change effects on automatic classification.
-
Benchmark systems on a novel historical dataset with expert human
annotation.
-
Cutting-edge research into the computational study of cultural heritage.
------------------------------
Shared Task Details
Task: Multi-label emotion detection
Each text fragment should be labelled with one or more of the following
emotions:
• joy • sadness • fear • anger • surprise • hope
Participants will predict binary labels for each emotion per fragment.
Systems will be evaluated against hidden gold labels using standard metrics
such as precision, recall, and macro-F1 score.
------------------------------
Important Dates
Task announcement
February 9, 2026
Development data release
February 12, 2026
Training data release
March 12, 2026
Test data release & evaluation start
March 27, 2026
Deadline for system submissions
April 30, 2026
Official results published
May 4, 2026
System description papers due
May 20, 2026
Review notification
June 19, 2026
Camera-ready papers due
July 1, 2026
IberLEF Workshop (presentations)
September 22, 2026
------------------------------
Contact & Submission
For updates, questions, or discussions about the task, please open issues
in the task GitHub repository or contact the organisers. Detailed
submission guidelines will be published alongside the data releases.
------------------------------
We look forward to your participation and contribution to this exciting
challenge!
— HISEMOTIONS 2026 Organising Committee
Dra. Albina Sarymsakova, CSIC-IEGPS-XuGa
Dra. Patricia Martín Rodilla, CSIC-IEGPS-XuGa
Dr. Eugenio Martínez Cámara, Universidad de Jaén
Dr. Alfonso Ureña López, Universidad de Jaén
--
Suelo trabajar a deshoras por lo que este correo puede haberte llegado
fuera de tu horario laboral, y al cual puedes responder en el momento que
mejor se ajuste a tus hábitos de trabajo. | I sometimes work at irregular
times and this email might arrive out of working hours so please be assured
that I respect your working pattern and look forward to your response when
it suits you.
[image: Universidad de Jaén] <https://www.ujaen.es/> Eugenio Martínez Cámara
Vicepresidente de la SEPLN <http://www.sepln.org/> | Vice President of the
SEPLN <http://www.sepln.org/en>.
Profesor Titular de Universidad | Associate Professor.
Investigador en Proc. del Lenguaje Natural | Postdoctoral Researcher in
Natural Language Proc.
Grupo de Investigación SINAI <http://sinai.ujaen.es/> | SINAI
<http://sinai.ujaen.es/> Research Group.
emcamara(a)ujaen.es
Código ORCID:0000-0002-5279-8355 <http://orcid.org/0000-0002-5279-8355>
Universidad de Jaén
Dpto. de Informática | Computer Science Department.
Edificio A3, despacho 145
| +34 953212883
<https://www.ujaen.es/servicios/sinformatica/sites/servicio_sinformatica/fil…>
[image: Universidad de Jaén] <https://www.ujaen.es/>
Este mensaje y los ficheros anexos son confidenciales dirigiéndose
exclusivamente al destinatario mencionado en el encabezamiento. Los mismos
contienen información reservada que no puede ser difundida. Si usted ha
recibido este correo por error, tenga la amabilidad de eliminarlo de su
sistema y avisar al remitente mediante reenvío a su dirección electrónica;
no deberá copiar el mensaje ni divulgar su contenido a ninguna persona.
Los datos personales facilitados por usted o por terceros serán tratados
por UNIVERSIDAD DE JAÉN, con la finalidad de gestionar y mantener los
contactos y relaciones que se produzcan como consecuencia de la relación
que mantiene con UJA. Normalmente, la base jurídica que legitima este
tratamiento, será su consentimiento, el interés legítimo o la necesidad
para gestionar una relación contractual o similar. El plazo de conservación
de sus datos vendrá determinado por la relación que mantiene con nosotros.
Para más información al respecto, o para ejercer sus derechos de acceso,
rectificación, cancelación/supresión, oposición, limitación o portabilidad,
dirija una comunicación por escrito a UNIVERSIDAD DE JAÉN, Campus Las
Lagunillas s/n. 23071 – Jaén, o a nuestro delegado de protección de datos [
dpo(a)ujaen.es]. En caso de considerar vulnerado su derecho a la protección
de datos personales, podrá interponer una reclamación ante el Consejo
Andaluz de Transparencia y Protección de Datos (www.ctpdandalucia.es).
Asimismo, es su responsabilidad comprobar que este mensaje o sus archivos
adjuntos no contengan virus informáticos, y en caso que los tuvieran
eliminarlos.