New Book Series: Corpus Linguistics and Technology-Mediated Language Education in the AI Era, Applied Linguistics Press
Corpus linguistics and technology-mediated language education in the AI era invites proposals for authored or edited volumes that advance
trustworthy, reproducible work at the intersection of corpora, digital learning environments and AI-supported language pedagogy. The series
encourages submissions that combine corpus design, annotation, analytics and AI-based learning ecosystems to improve educational
decision-making with traceable, verifiable data.
Topics of interest include corpus pedagogy and AI interfaces in education, multilingual and multimodal learning, accessibility, Datadriven
learning (DDL), corpora and technology-mediated language education, Corpus-Based Language Pedagogy (CBLP), corpus-literacy,
AI literacy, language data and AI, low-resource languages and corpus education, open datasets in corpus-based pedagogy, shareable code,
AI risk evaluation, learner metacognition in corpus education and teacher/learner agency in CBLP and DDL.
Corpus linguistics and technology-mediated language education in the AI era is open access and encourages global authorship. Proposals
submitted to the series will undergo initial evaluation by the ALP General Editor and the series Co-Editors and will then be sent out for external peer review.
Please send your proposals outlining aim(s), topics addressed, and how the volume fulfils the ALP mission to contribute to open science.
To submit a proposal for this book series, download the proposal template HERE<https://docs.google.com/document/d/1KK57vC0-hwqHfd7ilaJC7zngLigXTFGO/edit?r…> <https://docs.google.com/document/d/1KK57vC0-hwqHfd7ilaJC7zngLigXTFGO/edit?r…> and return when completed to the series Editors via email (pascualf(a)um.es. and maqing(a)eduhk.hk)
We welcome proposals for monographs and edited volumes.
Series Editors: Pascual Pérez-Paredes (Universidad de Murcia) & Qing (Angel) Ma (The Education University of Hong Kong)
Applied Linguistics Press<https://www.appliedlinguisticspress.org/home> is a scholar-led digital publisher promoting open science, fair practice, and wider access, offering monographs and collections with multimedia features, founded in 2023 by Prof Luke Plonsky and run by volunteers.
Feel free to contact us if you’d like to discuss your idea.
Pascual Pérez-Paredes
https://webs.um.es/pascualf
Ninth Workshop on Universal Dependencies (UDW 2026)
May 2026, Palma de Mallorca, Spain (co-located with LREC 2026)
https://universaldependencies.org/udw26/
Universal Dependencies (UD, https://universaldependencies.org) is a
framework for cross-linguistically consistent treebank annotation that
has so far been applied to over 180 languages. The framework aims to
capture similarities as well as idiosyncrasies among typologically
different languages (e.g., morphologically rich languages, pro-drop
languages, and languages featuring clitic doubling). The goal in
developing UD was not only to support comparative evaluation and
cross-lingual learning but also to facilitate multilingual natural
language processing, enable comparative linguistic studies, and
provide resources for language model understanding and evaluation.
The Universal Dependencies Workshop series was started to create a
forum for discussion of the theory and practice of UD, its use in
research and development, and its future goals and challenges. Some of
the previous workshops have been co-located with COLING, EMNLP, and
SyntaxFest. We invite papers on all topics relevant to UD, including
but not limited to:
- Theoretical foundations and universal guidelines
- Linguistic analysis of specific languages and/or constructions
- Language typology and linguistic universals
- Treebank annotation, conversion, and validation
- Word segmentation, morphological tagging and syntactic parsing
- Use of UD data for evaluating or understanding language models
- Linguistic studies based on the UD data
Priority will be given to papers that adopt a cross-lingual perspective.
## Important Dates
- Paper submission deadline: February 16, 2026
- Notification of acceptance: March 16, 2026
- Camera-ready version due: March 30, 2026
- Conference dates: May 11-16, 2026
We invite submissions in two formats:
- Regular (long) papers up to 8 pages of content
(excluding references and appendices). Regular papers should present
substantial, original, and unpublished research, including empirical
evaluation results where appropriate.
- Short papers up to 4 pages of content (excluding references and
appendices). Short papers may offer smaller, focused contributions,
such as work in progress, negative results, surveys, or opinion
pieces.
We also welcome non-archival papers, defined as work that has already
been published or accepted for publication at another computational
linguistics venue. These papers may be presented at the workshop but
will not appear in the LREC 2026 Workshop Proceedings.
Accepted papers will be given one additional page to address reviewer
comments.
## Paper Submission, Review Process and Selection Criteria
Submissions will be handled via the START Conference Manager. The
submission link will be provided on the workshop website as soon as it
becomes available. Papers should describe original work; they should
emphasise completed work rather than intended work, and should
indicate clearly the state of completion of the reported results.
Submissions will be judged on correctness, originality, technical
strength, significance and relevance to the conference, and interest
to the attendees.
All submissions should follow the two-column LREC style guidelines. We
strongly recommend the use of the LaTeX style files, OpenDocument, or
Microsoft Word templates created for LREC:
<https://lrec2026.info/authors-kit/>. Unlike LREC main conference
submissions, UDW submissions are allowed to include appendices, and
the UDW makes a distinction between short (up to four pages) and long
papers (up to eight pages). All papers must be anonymous, i.e., not
reveal author(s) on the title page or through self-references. So,
e.g., “We previously showed (Smith, 2020) …”, should be avoided.
Instead, use citations such as “Smith (2020) previously showed …”.
All papers will undergo a double-blind peer review process, with final
acceptance decisions made by the workshop chairs. Submissions
that violate the requirements above will be rejected without review.
## LRE-Map and Sharing Language Resources
When submitting a paper from the START page, authors will be asked to
provide essential information about resources (in a broad sense, i.e.
also technologies, standards, evaluation kits, etc.) that have been
used for the work described in the paper or are a new result of your
research. Moreover, ELRA encourages all LREC authors to share the
described LRs (data, tools, services, etc.) to enable their reuse and
replicability of experiments (including evaluation ones).
## Presentation Format
Accepted papers will be presented as oral or poster presentations. The
mode of presentation will be determined by the workshop chairs and
does not reflect the quality of the submission.
Accepted papers will be published in the LREC 2026 Workshop Proceedings.
## Organizing committee
Çağrı Çöltekin, Tübingen University
Kaja Dobrovoljc, University of Ljubljana & Jozef Stefan Institute
Joakim Nivre, Uppsala University
Event: 12th Workshop on the Representation and Processing of Sign Languages (sign-lang@LREC 2026)
Submission deadline: 14 February 2026
Workshop date: 16 May 2026
Website: https://www.sign-lang.uni-hamburg.de/lrec2026/
Submission page: tbd
CALL FOR PAPERS
Submissions are invited for a full day workshop on sign language resources and technologies, to take place on 16 May 2026 as a satellite event of LREC 2026 in Palma de Mallorca, Spain.
During the past years, a number of large-scale sign language corpus projects have started. Some have already been completed, but many more projects are about to start. At the same time, sign language technologies are maturing and are promising to support the time-consuming basic annotation. The workshop aims at bringing together those researchers who already work with multimodal sign language corpora (and those who see the need for empirical underpinnings of their current research) with those who develop sign language technologies. It provides the platform to compare competing approaches.
As sign language resource technologies build to a large extent on methodologies and tools used in the language resource community in general, but add very specific perspectives (e.g. no writing system established, use of video as data source) and works with a different modality of human language, sign language research is able to feed back to the language resource community at large. At the same time, as the raw data are in the visual domain, the field naturally bridges into Computer Vision. Thus, researchers use Machine Learning methods on both visual and linguistic data.
We invite submissions of papers to be presented either on stage (20 minutes plus 10 minutes discussion) or as posters (with or without demonstrations) on the following topics:
2026 SPECIAL TOPIC: LANGUAGE IN MOTION
Motion is at the core of sign languages, both literally, through their existence in the visual-gestural modality, and figuratively, in how their communities drive language change. Equally, sign language research must stay in motion, adapting to new insights and technological possibilities, advancing how we create and use resources, evolving the capabilities of tools, and pushing the boundaries of what can be expected from the field, both technologically and ethically. We especially invite contributions relating to the representation and processing of sign languages that address these various facets of language in motion, but also welcome papers on other general issues relating to sign language resources and technologies.
GENERAL ISSUES ON SIGN LANGUAGE CORPORA AND TOOLS
• Evaluation of sign language resources
• Experiences in building sign language corpora
• Elicitation methodology appropriate for corpus collection
• Proposals for standards for linguistic annotation or for metadata descriptions
• Experiences from linguistic research using corpora
• Use of (parallel) corpora and lexicons in translation studies and machine translation
• Avatar technology as a tool in sign language corpora and corpus data feeding into advances in avatar technology
• Language documentation and long-term accessibility for sign language data
• Annotation and visualization tools
• Linking corpora and lexicons and integrated presentation of corpus and dictionary contents
• “Internet as a corpus” for sign languages
• Sign language corpus mining
• Crowd and community sourcing for corpus work
• Multi-lingual sign language resources and connecting sign language resources to language resources for spoken languages
• Language change and how it relates to resource creation, corpus-driven linguistic research, and language technologies
In the tradition of LREC, oral/signed presentations and poster presentations (with or without demonstrations) have equal status, and authors are encouraged to suggest the presentation format best suited to communicate their ideas. Papers (4–8 pages) of all accepted submissions to this workshop will be published as workshop proceedings published on the conference website – independent of whether you have a poster or an oral/signed presentation. The workshop does not differentiate between long, short, or position papers.
Please submit your paper through the LREC START system (link tbd) not later than 14 February 2026, indicating whether you prefer an oral/signed presentation, a poster presentation or a poster presentation with demo. Unlike the main conference, the workshop will be reviewed single-blind, so submissions SHOULD NOT BE ANONYMOUS. In all other respects, submissions should follow the LREC 2026 style guide (https://lrec2026.info/authors-kit/).
ATTENTION Please note that you are expected to submit the full paper, not an extended abstract as in previous years!
IMPORTANT DATES
• Deadline for submissions: 14 February 2026 (11:59PM UTC-12:00 “anywhere on Earth”)
• Notification of acceptance: 16 March, 2026
• Early bird registration ends: tbd
• Camera ready version of the paper (for both oral/signed presentations and posters): 27 March 2026
• Submission of slides for interpreters' preparation (oral/signed presentations only): 6 May 2026
• This workshop: 16 May 2026
• LREC main conference: 13–15 May 2026
• LREC workshops 11, 12 & 16 May 2026
Workshop on Learning Non-Literal Expressions with Small Data
To be held in conjunction with LREC 2026, Palma de Mallorca, Spain on 11
May 2026.
Overview
Non-Literal Expressions (NLEs) in natural language are a reflection of
fundamental cognitive processes such as analogical reasoning and
categorisation, and are deeply rooted in everyday communication. NLEs
understanding is therefore an essential task for language modeling. This
task is especially challenging because it cannot be tackled by falling
back on individual word meanings, but requires taking into account
larger chunks of surrounding text or even contextual information. At the
same time, it is important because the reliable processing of NLEs is
relevant for optimizing downstream tasks like translation and
summarization.
This workshop focuses on understanding of Non-Literal Expressions. While
most of the earlier work on NLEs had been devoted to metaphor and
metonymy, recent activities target other forms of NLEs as well, e.g.,
hyperbole (deliberate exaggeration), litotes (understatement),
rhetorical questions, and irony. Humanly annotated corpora for NLEs have
very recently started becoming available to the research community and
may serve as the basis for data-driven approaches to NLEs processing,
with the interrelated goals of first identifying and then interpreting
such expressions. Such data is mostly of high linguistic quality, but
still very limited in size. Thus, the workshop’s focus is on adaptation
of Language Models (LMs) and Deep Learning (DL) for processing of
Non-Literal Expressions with limited high-quality data, since such
constructs still pose big identification and processing challenges in
natural language analysis tasks.
Topics of Interest
We are interested in contributions which focus on the use of techniques
like self-training for leveraging unlabelled data, as well as in work
that focuses on the incorporation of external linguistic resources and
knowledge injection to enrich features, and also in research that
describes work on utilisation of multitask learning with the aim to
benefit from related tasks.
The workshop also wants to discuss alternative approaches which may
elaborate on the use of pre-trained Language Models (LMs) as a
foundation and the application of techniques like contrastive learning
and clustering to identify challenging examples within the data, the
ultimate aim of the workshop being to highlight the necessity of
high-quality data, as well as cross-lingual datasets.
Invited Speakers
- Prof. Barbara Plank, LMU Munich (https://bplank.github.io/)
- Dr. Debanjan Ghosh, Princeton, USA
Details will be announced on the workshop website (tba).
Submission Guidelines
Papers must be submitted electronically through Softconf: [link to
come]. Submissions should:
• Be 4–8 pages, excluding references and optional Ethics Statements
• Follow the LREC 2026 style guidelines, available on the conference
website: https://lrec2026.info/authors-kit/
• Use templates provided here:
https://lrec2026.info/calls/second-call-for-papers/
Authors will be asked to supply information on any language resources
(broadly defined — data, tools, standards, evaluation sets, etc.) used
in or resulting from their work. ELRA strongly encourages sharing such
resources to support reproducibility and reuse.
Accepted papers will appear in the workshop proceedings. Presentation
format (oral/poster) will be based solely on how best to communicate the
work.
Important Dates
• 20 February 2026 — Submission Deadline
• 11 March 2026 — Notification of Acceptance
• 28 March 2026 — Camera-ready Papers Due
Endorsements
The workshop is endorsed by: Collaborative Research Centre 1412
"REGISTER" funded by the DFG Deutsche Forschungsgemeinschaft (German
Research Foundation)
Organizers
• Markus Egg — Humboldt-Universität zu Berlin, Germany
• Valia Kordoni - Humboldt-Universität zu Berlin, Germany
Contact: kordonie at rz.hu-berlin.de
First Call for Papers: Joint Workshop on Legal and Ethical Issues in Human Language Technologies (LEGAL2026) and Computational Approaches to Language Data Pseudonymization, Anonymization, De-identification, and Data Privacy (CALD-pseudo 2026)
Website: https://legal2026.mobileds.de/
We invite submissions to the Joint Workshop on Legal and Ethical Issues in Human Language Technologies (LEGAL2026) and Computational Approaches to Language Data Pseudonymization, Anonymization, De-identification, and Data Privacy (CALD-pseudo 2026), to be held at LREC 2026 on the 12th of May 2026.
Important Dates
*
20th of February 2026: paper submission deadline
*
30th March 2026: camera ready deadline (strict)
*
12th May 2026: workshop date
Introduction
Access to text and speech data is essential for research, yet personal and sensitive information often prevents open sharing. Techniques such as pseudonymization and anonymization offer potential solutions, but their effectiveness, limitations, and impact on data utility require deeper investigation. Balancing privacy protection with meaningful scientific use remains a key challenge.
At the same time, legal and ethical requirements increasingly shape how language resources can be created, processed, and distributed. Regulatory frameworks, such as the GDPR, the Data Act, and the Artificial Intelligence Act, affect access, reuse, and documentation duties for both text and speech data, creating a complex environment that demands interdisciplinary insight.
The workshop brings these two perspectives together by addressing both the technical and practical aspects of de-identification as well as the legal and ethical obligations governing data handling. Topics include anonymization and pseudonymization methods, compliance in practical workflows, provenance and rights tracking, and emerging approaches to legal metadata. The goal is to foster responsible, legally sound, and technically robust innovation in human language technologies.
Topics of Interest
We invite contributions from all disciplines involved in the creation, processing, governance, and de-identification of text and speech data. Submissions may address theoretical, empirical, methodological, legal, or technical questions, including cross-disciplinary work. We particularly encourage research on less-represented languages and on data from under-represented communities.
1. Legal Aspects of Language Data (LEGAL2026)
*
Regulatory frameworks and global governance
*
Intellectual property, data protection, and LLM governance
*
Ethics, fairness, trust, and transparency
*
Compliance in practice
*
Ethics, fairness, and trust
*
Operationalizing compliance
*
Emerging and grey areas
*
Interdisciplinary and cross-border coordination
2. Pseudonymization, Anonymization, and De-identification: Theoretical, Methodological, and Technical Aspects (CALD-pseudo 2026)
*
Detection and classification of personal information (PI)
*
Replacement and transformation of PI
*
Utility and bias after de-identification
*
Approaches to evaluation and adversarial testing
*
Dataset creation for de-identification research
*
Low-resource scenarios
*
Speech-specific challenges
*
Cross-disciplinary applications and challenges
We invite submissions from fields where de-identification of data plays an important role, including but not limited to Computational Linguistics, Applied Linguistics, Corpus Linguistics, Digital Humanities, Social Sciences, Political Sciences, Medical Science etc., from the perspectives of researchers, public organizations, and industry.
Submission Guidelines
Authors are invited to submit original and unpublished research papers in the following categories:
*
Long papers (up to 8 pages) for substantial contributions
*
Short papers (up to 4 pages) for:
*
Small, focused contributions or ongoing or preliminary work
*
Extended abstracts for non-technical submissions only, such as conceptual, theoretical, legal, ethical, policy-oriented, or position papers. Extended abstract submissions are expected to be developed into regular papers by the camera-ready submission deadline.
The full papers will be published as workshop proceedings along with the LREC main conference. They should follow the LREC stylesheet, which is available on the conference website on the Author’s kit<https://lrec2026.info/authors-kit/> page.
Submission deadline: 20th of February 2026
The submission link will be provided in due time on the workshop website.
When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research.
Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.) to enable their reuse and replicability of experiments (including evaluation ones).
Keynote Talks
We are delighted to announce the workshop will host keynote talks from two speakers:
*
Paweł Kamocki, Leibniz-Institut für Deutsche Sprache, Germany
*
Ivan Habernal, Ruhr University Bochum, Germany
Workshop Organizers
LEGAL 2026:
*
Ingo Siegert, Otto-von-Guericke Universität Magdeburg, Germany
*
Paweł Kamocki, Leibniz-Institut für Deutsche Sprache, Germany
*
Kossay Talmoudi, ELDA, France
*
Khalid Choukri, ELDA, France
CALD-pseudo 2026
*
Maria Irena Szawerna, University of Gothenburg, Sweden
*
Simon Dobnik, University of Gothenburg, Sweden
*
Therese Lindström Tiedemann, University of Helsinki, Finland
*
Pierre Lison, Norwegian Computing Center & University of Oslo, Norway
*
Ildikó Pilán, Norwegian Computing Center, Norway
*
Ricardo Muñoz Sánchez, University of Gothenburg, Sweden
*
Lisa Södergård, University of Helsinki, Finland
*
Elena Volodina, University of Gothenburg, Sweden
*
Xuan-Son Vu, Lund University & DeepTensor AB, Sweden
Program Committee
A list of program committee members is available on the workshop webpage.
Contact
For general inquiries, please contact mail(a)legal2026.mobiles.de
Best regards,
Maria Irena Szawerna
____________________
PhD student
Språkbanken Text<https://spraakbanken.gu.se/>
Institutionen för svenska, flerspråkighet och språkteknologi<https://www.gu.se/svenska-spraket>
UNIVERSITY OF GOTHENBURG<https://www.gu.se/>
https://spraakbanken.gu.se/om/personal/maria-szawerna
Dear colleagues,
We are delighted to announce SemEval-2026 Task 3 Track B: Dimensional
Stance Analysis
*Aspect-Based Sentiment Analysis (ABSA)* is a widely used technique for
analyzing people’s opinions and sentiments at the aspect level. However,
current ABSA research predominantly adopts a coarse-grained, categorical
sentiment representation (e.g., positive, negative, or neutral). This
approach stands in contrast to long-established theories in psychology and
affective science, where sentiment is represented along fine-grained,
real-valued dimensions of valence (ranging from negative to positive) and
arousal (from sluggish to excited). This valence-arousal (VA)
representation has inspired the rise of dimensional sentiment analysis as
an emerging research paradigm, enabling more nuanced distinctions in
emotional expression and supporting a broader range of applications.
Given an utterance or post and a target entity, stance detection involves
determining whether the speaker is in favor or against the target. *This
track reformulates stance detection as a Stance-as-DimABSA task with the
following transformations:*
*1. The stance target is treated as an aspect.2. Discrete stance labels are
replaced with continuous VA scores.*
Building on this, we introduce *Dimensional Stance Analysis (DimStance)*, a
Stance-as-DimABSA task that reformulates stance detection under the ABSA
schema in the VA space. This new formulation extends ABSA beyond consumer
reviews to public-issue discourse (i.e., politics and environmental
protection) and also generalizes stance analysis from categorical labels to
continuous VA scores. Given a text and one or more aspects (targets),
predict a real-valued valence-arousal (VA) score for each aspect,
reflecting the stance expressed by the speaker toward it.
———————
*Languages*
———————
*We provide data in 5 languages*, including: German (deu), English (eng),
Hausa (hau), Swahili (swa), and Chinese (zho)
———————
*Evaluation*
———————
RMSE is used.
———————
*Participation*
———————
*Website* (checkout details):
https://github.com/DimABSA/DimABSA2026
*Codabench* (register and submit results)
- Track B: https://www.codabench.org/competitions/11139/
*Discord* (community and discussion)
https://discord.gg/xWXDWtkMzu
*Google Group* (official updates):
https://groups.google.com/g/dimabsa-participants
———————
*Important Dates *
———————
- Sample Data Ready: 15 July 2025
- Training Data Ready: 30 September 2025
- Evaluation Start: 12 January 2026
- Evaluation End: 30 January 2026
- System Description Paper Due: February 2026
- Notification to Authors: March 2026
- Camera Ready Due: April 2026
- SemEval Workshop 2026: co-located with ACL 2026 (San Diego, CA, USA)
We warmly invite the community to participate in this exciting shared task
and contribute to advancing NLP research.
Best regards,
SemEval-2026 Task 3 Organizers
***********************************************************************************
The 6th workshop on: "Resources and ProcessIng of linguistic, para-linguistic and extra-linguistic Data from
people with various forms of cognitive/psychiatric/developmental impairments" in collaboration with the MENTAL.ai -consortium
Workshop: co-located with LREC 2026 | Palma de Mallorca, Spain | May 12th, 2026
RaPID-6(a)MENTAL.ai serves as an interdisciplinary platform for researchers to exchange insights, methods, and experiences related to collecting and processing data from individuals with mental, cognitive, neuropsychiatric, or neurodegenerative impairments. The workshop focuses on creating, processing, and applying such data resources from individuals at different stages and severity levels of these impairments. The ultimate goal of RaPID-6(a)MENTAL.ai is to facilitate the study of relationships among linguistic, paralinguistic, and extra-linguistic observations, with applications ranging from aiding diagnosis to enhancing monitoring and predicting individuals at higher risk, ultimately promoting multidisciplinary collaboration across clinical/medical, language technology, computational linguistics, and computer science communities.
Workshop date: Tue., 12th of May 2026
Submission deadline: Sun., 22nd of February, 2026 (anywhere on earth)
Paper submission: <SOFTCONF-TBA>
Invited Speakers: Prof. Brian MacWhinney, Carnegie Mellon University, USA. and Assoc Prof, MD, Sunny X. Tang, Feinstein Institutes for Medical Research, Northwell Health, USA.
Website and details: https://spraakbanken.gu.se/en/rapid-2026
Contact: Dimitrios Kokkinakis
Contact email: dimitrios.kokkinakis(a)gu.se<mailto:dimitrios.kokkinakis@gu.se>
Organizing committee:
*
Dimitrios Kokkinakis, University of Gothenburg, Sweden
*
Charalambos Themistocleous, University of Oslo, Norway
*
Gaël Dias, University of Caen Normandie, France
*
Kathleen C. Fraser, University of Ottawa, Canada
*
Fredrik Öhman, University of Gothenburg and Sahlgrenska University Hospital, Sweden
*
Sebastião Pais, University of Beira Interior, Portugal
************************************************************************************
We have released a public anonymized dataset and LLM models of mental health support conversations in Hebrew and Arabic. Thanks to Israeli Innovation Authority for their support!
https://lnkd.in/eYbPhN2y
Sincerely
Kobi Gal
https://ailab.ise.bgu.ac.il/
Workshop on Dialects in NLP: A Resource Perspective
To be held in conjunction with LREC 2026, Palma de Mallorca, Spain on 11, 12 and 16 May 2026.
Website: https://dialres.github.io/dialres/
Overview
DialRes-LREC26 addresses the growing need for high-quality resources supporting dialect-focused NLP. The workshop aims to bring together researchers from linguistics, computational linguistics, digital humanities, and adjacent fields to exchange insights on the creation, documentation, evaluation, and use of dialectal resources.
Topics of Interest
We invite submissions relating to any aspect of developing or using resources for dialectal NLP. Topics include — but are not limited to — the following:
• Creation and evaluation of spoken and written dialect resources
• Orthographic normalization and standardization
• Treatment of dialect–standard distinctions in annotation frameworks for speech and text
• Cross-dialect and cross-lingual transfer; model adaptation methods
• Scalability issues and resource-efficient techniques
• Use of LLMs in resource creation, augmentation, annotation, or processing
• Resources supporting dialect preservation, revitalization, and community engagement
• Pedagogical, sociolinguistic, and linguistic applications viewed through a resource lens
• Practical considerations when working with dialect resources (legal, financial, academic, societal)
• Empowering dialect communities in developing their own resources
Invited Speaker
Prof. Barbara Plank, LMU Munich (https://bplank.github.io/)
Details will be announced on the workshop website.
Submission Guidelines
Papers must be submitted electronically through Softconf: [link to come]. Submissions should:
• Be 4–8 pages, excluding references and optional Ethics Statements
• Follow the LREC 2026 style guidelines, available on the conference website:
https://lrec2026.info/authors-kit/
• Use templates provided here: https://lrec2026.info/calls/second-call-for-papers/
Authors will be asked to supply information on any language resources (broadly defined — data, tools, standards, evaluation sets, etc.) used in or resulting from their work. ELRA strongly encourages sharing such resources to support reproducibility and reuse.
Accepted papers will appear in the workshop proceedings. Presentation format (oral/poster) will be based solely on how best to communicate the work.
For inquiries: dialres-lrec26(a)googlegroups.com
Important Dates
• 20 February 2026 — Submission Deadline
• 11 March 2026 — Notification of Acceptance
• 28 March 2026 — Camera-ready Papers Due
Endorsements
The workshop is endorsed by:
• UniDive COST Action CA21167, which supports work on language diversity and resource development
• Archimedes/Athena RC, a major AI research hub in Greece with strong academic and industrial collaborations
Organizing Committee
• Antonios Anastasopoulos — George Mason University / Archimedes–Athena RC
• Stella Markantonatou — ILSP / Archimedes–Athena RC
• Angela Ralli — University of Patras / Archimedes–Athena RC
• Marcos Zampieri — George Mason University
• Stavros Bompolas — Archimedes–Athena RC
• Vivian Stamou — Archimedes–Athena RC