Dear colleagues,
Please find below a link to the Survey of English Usage Annual Report for 2025:
https://www.ucl.ac.uk/arts-humanities/english/research/survey-english-usage…
All the best,
Bas
Prof. Bas Aarts
Department of English Language and Literature UCL
Substack: https://basaarts.substack.com/
Continuous Professional Development and INSET courses for teachers: https://bit.ly/39qnKIH
X: @UCLEnglishUsage and @EngliciousUCL
Note: I respect your work/life balance. If I send you an email outside of your normal working hours there is no expectation that you will read or respond to the message at that time.
Here, history happens.
[image.png]
The Information Disorder Workshop
Collocated with LREC 2026 in Palma de Mallorca, Spain
https://information-disorder-workshop.github.io/
* February 24: Paper submission
* March 17: Notification of acceptance
* March 30: Camera-ready submission
* May 12, 2026: InDor at LREC!
Online disinformation is a pressing challenge for our societies. Its role in influencing elections (Allcott & Gentzkow, 2017) and behaviours (van der Linden et al., 2020) has gathered the attention of different societal actors aimed at mitigating its negative impact.
The Natural Language Processing (NLP) community is contributing to fighting this phenomenon with a growing number of datasets (Hussain et al., 2025) and technologies (VeraAI, AskVera, Bellingcat) (Lupi et al., 2023; Wuhrl et al., 2023) for the automatic recognition of fake news. However, this field of research suffers from a lack of a common theoretical framework, which causes a fragmentation of approaches. The increasing attention of the NLP community to human-label variation (Plank, 2022) raises additional challenges regarding the cross-cultural and pragmatic implications that determine the spreading of disinformation (Dabbous et al., 2022).
The goal of the Information Disorder (InDor) workshop is to promote an interdisciplinary and intersectorial discussion towards the development of NLP research on disinformation.
Information Disorder is a recent framework introduced by Wardle and Derakhshan (2017) to organize theories, definitions, and approaches for the study of disinformation.
The framework is characterized by two main pillars: 1) acknowledging the need to categorize fake news under a finer-grained taxonomy of disorders (mis-information, dis-information, and mal-information); 2) exploring the role of the contextual factors that determine the spreading of fake news.
InDor aims to
Define a common theoretical ground for the research on disinformation in NLP and beyond
Discuss the cultural factors determining subjectivity to disinformation
Promote interdisciplinarity in the development of datasets and models
Discuss the impact of real-world applications to contrast disinformation
The InDor workshop (half-day duration) will be co-located with the fifteenth biennial Language Resources and Evaluation Conference (LREC) held at the Palau de Congressos de Palma in Palma de Mallorca, Spain, on 11-16 May 2026.
Submissions
When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.) to enable their reuse and replicability of experiments (including evaluation ones). In addition, authors will be required to adhere to ethical research policies on AI and may include an ethics statement in their papers.
The papers should be submitted as a PDF document, conforming to the formatting guidelines provided in the call for papers of the LREC conference. Templates are provided here https://lrec2026.info/authors-kit/
We accept three types of submissions:
Regular research papers;
Non-archival submissions: like research papers, but will not be included in the proceedings;
(Non-archival) research communications: 1-page abstracts summarising relevant research published elsewhere.
InDor will also accept submissions that have been rejected from ACL rolling review, provided they are accompanied by their reviews, and they fit the topic of the workshop.
Research papers (archival or non-archival) may consist of up to 8 pages of content. Research communications may consist of up to 1 page of content. Please make the submission here: https://softconf.com/lrec2026/InDor26/
Topics
We invite original research papers specifically on the following topics, with a particular focus on resources, taxonomies, and benchmarks for the evaluation of NLP systems on Information Disorder:
new interdisciplinary theoretical proposals and foundational aspects
surveys on Information Disorder
multiculturality and multilinguality in datasets and technologies
interdisciplinary computational methods and frameworks
community- and user-centred approaches
real-world applications to contrast false information
experimental applications and projects for social good
evaluation of Information Disorder-focused systems
generative approaches to contrast false information
participatory approaches
positions on Information Disorder
Submissions are open to all and are to be submitted anonymously (and must conform to the instructions for double-blind review). All papers will be refereed through a double-blind peer review process by at least three reviewers, with final acceptance decisions made by the workshop organisers. Scientific papers will be evaluated based on relevance, significance of contribution, impact, technical quality, scholarship, and quality of presentation.
Attendance
At least one author of each accepted paper is required to participate in the conference and present the work, in-person or online.
Workshop organisers:
Simona Frenda, Heriot-Watt University
Marco Antonio Stranisci, University of Turin
Shaina Ashraf, Phillips University of Marburg
Ada Ren, Macquarie University
Ioannis Konstas, Heriot-Watt University
Usman Naseem, Macquarie University
Contact us at s.frenda(a)hw.ac.uk if you have any questions.
Website: https://information-disorder-workshop.github.io/
We'd like to invite you to participate in the following shared tasks at PAN
2026 held in conjunction with the CLEF conference in Jena, Germany.
1. Voight-Kampff Generative AI Detection.
Given a (potentially obfuscated) text, decide whether it was written by a
human or an AI.
https://pan.webis.de/clef26/pan26-web/generated-content-analysis.html
2. Text Watermarking.
Insert a watermark into a given text. Then, after we have attacked the
text, detect the inserted watermark.
https://pan.webis.de/clef26/pan26-web/text-watermarking.html
3. Multi-Author Writing Style Analysis.
Given a document, determine at which positions the author changes.
https://pan.webis.de/clef26/pan26-web/style-change-detection.html
4. Generative Plagiarism Detection.
Given a document and a collection of documents, your task is to identify
all sources in the collection that the document plagiarizes.
https://pan.webis.de/clef26/pan26-web/generated-plagiarism-detection.html
5. Reasoning Trajectory Detection.
Detect the source and safety of LLM-generated and human-written reasoning
trajectories.
https://pan.webis.de/clef26/pan26-web/reasoning-trajectory-detection.html
More information:
https://pan.webis.de/clef26/pan26-web/index.html
Important Dates
--------------------------
now Training Data Released
April 23, 2026: Registration closes
May 07, 2026: Software submission deadline
May 28, 2026: Participant paper submission
June 30, 2026: Peer review notification
July 06, 2026: Camera-ready participant papers submission
September 21-24, 2026: CLEF Conference
Links
--------------------------
PAN: https://pan.webis.de
Contact: pan(a)webis.de
We are looking forward to your submission!
The PAN team
Dear all,
The TurkuNLP research group at the University of Turku, Finland, invites applications for 2–4 fixed-term postdoctoral researcher positions in Digital Linguistics and/or Natural Language Processing (NLP). Employment begins in March–April 2026 at the earliest (or as agreed) and continues for 24 months or until the end of May 2028.
The positions are part of several research projects targeting web language use and massively multilingual web-crawled data. All projects are funded by the Research Council of Finland, including the newly selected Centre of Excellence Human Diversity through Contacts.
The research topics include, but are not limited to:
- Modeling and linguistically analyzing language use across varieties such as registers (genres) in massively multilingual web-scale data, using multilingual deep learning methods, LLMs, and corpus-linguistic statistical approaches
- Developing NLP methods for advanced semantic analysis and labeling of multilingual web data
- Theoretical and methodological advances for web language use and web-as-corpus research
The closing date for applications is February 26, 2026 at 16:00 (Europe/Helsinki).
For more information, please contact me or visit
https://ats.talentadore.com/apply/tutkijatohtorin-maaraaikaisia-tehtavia-di…
and
https://sites.utu.fi/humandiversity/news/positions-open-in-human-diversity-….
Best regards,
Veronika Laippala
11th Symposium on Corpus Approaches to Lexicogrammar (LxGr2026)
CALL FOR PAPERS
Deadline for abstract submission: 1 March 2026
The symposium will take place online on Thursday 2 and Friday 3 July 2026
Invited Speakers
Stefan Gries<https://www.stgries.info/> (University of California, Santa Barbara, USA)
Martin Hilpert<http://members.unine.ch/martin.hilpert> (University of Neuchâtel, Switzerland)
Serge Sharoff<https://ssharoff.github.io/> (University of Leeds, UK)
Organiser: Costas Gabrielatos<https://ehu.ac.uk/gabrielatos> (Edge Hill University)
LxGr primarily welcomes papers reporting on corpus-based research on any aspect of the interaction of lexis and grammar -- particularly studies that interrogate the system lexicogrammatically to get lexicogrammatical answers. However, position papers discussing theoretical or methodological issues, as well as descriptions or demonstrations of tools or resources are also welcome, as long as they are relevant to both lexicogrammar and corpus linguistics.
If you would like to present, send an abstract of 500 words (excluding references) to lxgr(a)edgehill.ac.uk<mailto:lxgr@edgehill.ac.uk>.
* Abstracts for research papers should specify the research focus (research questions or hypotheses), the corpus, the methodology (techniques, metrics), the theoretical orientation, and the main findings.
* Abstracts for position papers should specify the theoretical orientation and the potential contribution to both lexicogrammar and corpus linguistics.
* Abstracts for tools or resources should provide a clear description of the main functions, and specify the potential contribution to both lexicogrammar and corpus linguistics.
Full papers will be allocated 35 minutes (including 10 minutes for discussion).
Work-in-progress reports will be allocated 20 minutes (including 5 minutes for discussion).
There will be no parallel sessions.
Participation is free.
For details, visit the LxGr website: https://sites.edgehill.ac.uk/lxgr
If you have any questions, please contact lxgr(a)edgehill.ac.uk<mailto:lxgr@edgehill.ac.uk>.
________________________________
Edge Hill University<http://ehu.ac.uk/home/emailfooter>
Modern University of the Year, The Times and Sunday Times Good University Guide 2022<http://ehu.ac.uk/tef/emailfooter>
University of the Year, Educate North 2021/21
________________________________
This message is private and confidential. If you have received this message in error, please notify the sender and remove it from your system. Any views or opinions presented are solely those of the author and do not necessarily represent those of Edge Hill or associated companies. Edge Hill University may monitor email traffic data and also the content of email for the purposes of security and business communications during staff absence.<http://ehu.ac.uk/itspolicies/emailfooter>
Dear all,
We are looking to hire a PhD student for a term of three years at the
Chair for Fundamentals of Natural Language Processing
(https://www.uni-bamberg.de/en/nlproc/) at the University of Bamberg in
Germany, starting either May 1 2026 or as soon as possible thereafter.
The position will be part of the project "Relating Probabilities of
Words to Probabilities of Worlds (PoWPoW)", to be funded by the German
Research Foundation (DFG) as part of the Priority Programme
LaSTing (https://www.lasting-spp.org/). The project’s goal is to
investigate how LLMs represent and reason about probabilistic world
knowledge. Within this scope, the project will explore methods for
eliciting probability judgments from LLMs, test how well such judgments
agree with empirical probabilities, investigate the internal consistency
of related probability judgments, and compare LLMs’ probabilistic
reasoning to that of humans.
The full details can be found here:
https://www.uni-bamberg.de/fileadmin/abt-personal/Homepage_ab_2016-03/10_St…
Applicants should have:
* M.Sc. or comparable degree in Computer Science, Computational
Linguistics, or a related area.
* Experience working with NLP methods and LLMs.
* Strong grasp of probability theory – experience with graphical
models or formal logic a bonus.
* Good knowledge and practical experience with deep learning.
* Very good command of English; German knowledge is not essential.
* Ability to work in a team, excellent communication skills,
enthusiasm, and intrinsic motivation.
If you would like to apply, please send your application to Sean Papay
<sean.papay(a)uni-bamberg.de>. Your application should be a single .pdf
file, comprising a brief motivation letter, your CV, the contact details
of two references, and an academic writing sample -- for example, your
Master's Thesis or an academic paper previously written by you.
Best regards,
--Sean Papay
2nd CALL FOR PAPERS
Fourth International Workshop on Gender-Inclusive Translation Technologies (GITT) at EAMT 2026
15 June 2026, Tilburg, The Netherlands
https://sites.google.com/view/gitt2026/
@gitt-workshop.bsky.social
Important Dates (Time zone: Anywhere on Earth)
Submission deadline: 20 April, 2026
Notification of Acceptance: 13 May, 2026
Camera Ready Copy due: 20 May, 2026
Workshop: 15 June, 2026
**Aim and scope**
The Gender-Inclusive Translation Technologies Workshop (GITT) is set out to be the dedicated workshop that focuses on gender-inclusive language in translation and cross-lingual scenarios. The workshop aims to bring together researchers from diverse areas, including industry partners, MT practitioners, and language professionals. GITT aims to encourage multidisciplinary research that develops and interrogates both solutions and challenges for addressing bias and promoting gender inclusivity in MT and translation tools, including LMs applications for the translation task.
**Topics**
GITT invites technical as well as non-technical submissions, which consist of experimental, theoretical or methodological contributions. We explicitly welcome interdisciplinary submissions and submissions that focus on innovative, non-binary linguistic strategies and/or with sociolinguistically-informed perspectives. The topics of interest include, but are not limited to:
- Models or methods for assessing and mitigating gender bias
- New resources for inclusive language and gender translation (e.g., datasets, translation memories, dictionaries)
- Social, cross-lingual, and ethical implications of gender bias
- Qualitative and quantitative analyses on the potential limits of current approaches to gender bias in translation and MT, error taxonomies as well as best practices and guidelines
- User-centric case studies on the impact of biased language and/or mitigating approaches which can include translators, post-editors, or monolingual MT users
GITT is also open to other non-listed topics aligned with the scope of the workshop and works focusing on non-textual modalities (e.g., audiovisual translation)
**Submission**
We welcome four types of submissions, two archival and two non-archival.
ARCHIVAL
- Research papers: of at least 4 up to 10 pages (excluding references)
- Extended Abstracts: up to 2 pages (including references)
Accepted papers and extended abstracts consisting of novel work will be published online as proceedings in the ACL Anthology.
NON-ARCHIVAL
- Research Communications: up to 2 pages (including references).
We include a parallel submission policy in the form of Research Communications for papers related to the topic of GITT that were accepted in other venues in 2025 and 2026.
- Potluck Communications: short abstract up to 500 words (including references).
Potluck Communications offer a space for anyone—especially students and early career researchers—to discuss bold new ideas for collaboration, brainstorm about ongoing work, and explore future research directions.
The communications will not be included in the proceedings, but will serve to promote the dissemination of research aligned with the scope of the workshop.
All submissions should adhere to the EAMT 2026 guidelines and style templates (PDF, LaTeX, Word) and be uploaded on Easychair ( https://easychair.org/conferences?conf=eamt2026)
**Workshop organizers**
Manuel Lardelli, University of Padova
Janiça Hackenbuchner, University of Ghent
Luisa Bentivogli, Fondazione Bruno Kessler
Joke Daems, University of Ghent
Beatrice Savoldi, Fondazione Bruno Kessler
Eleni Gkovedarou, University of Ghent
HUMIC – Humans and Machines in Conversation: Linguistic, Social and Relational Perspectives on Conversational AI
https://www.ias.surrey.ac.uk/event/humic-humans-and-machines-in-conversatio…
University of Surrey | In-person Workshop
16th June 2026
As generative AI and large language models reshape how we interact with chatbots, voice assistants and conversational agents, HUMIC focuses on the linguistic, social and relational dimensions of these technologies—areas often overlooked in technical development. HUMIC<https://www.ias.surrey.ac.uk/event/humic-humans-and-machines-in-conversatio…>, led by Dr. Doris Dippold and supported by the Surrey Institute for Advanced Studies, the BAAL Special Interest Group ‘Humans, Machines, Languages’ and the Surrey Institute for People-Centred AI, aims to foster interdisciplinary dialogue and connect academic insights with industry practice. Such insights are vital for developing conversational technologies that are context-aware, socially responsive, and cater for their users’ rapport needs.
We invite contributions that explore the complex interplay between humans and machines with reference to these factors. We welcome submissions from researchers working across the disciplines, for example but not limited to linguistics, psychology, sociology, natural language processing, UX research, and conversation design. Submissions may focus on any domain. We particularly welcome submissions from industry, focusing for example on common challenges and practices in designing conversational systems with linguistic, social and relational perspectives in focus.
During the workshop, participants will be invited to participate in a collaborative session. The session will encourage the generation of new research ideas and explore how research can respond to industry challenges. Selected works resulting from this workshop will be considered for a potential special issue.
Keynote Speakers:
* Maaike Groonewege (ConvoCat, Netherlands)<https://www.linkedin.com/in/maaikegroenewege/?originalSubdomain=nl> – Linguist and Conversation Designer
* Bettina Migge (University College Dublin, Ireland)<https://people.ucd.ie/bettinamigge> – Language and AI Technology
* Christian Hildebrand (University of St Gallen, Switzerland)<https://www.ibt.unisg.ch/team/christian-hildebrand/> – AI and Language in Consumer Behaviour
We invite 300-word proposal on topics related to the workshop. Themes of interest include, but are not limited to:
* Linguistic and pragmatic dimensions of human-machine dialogue
* Social and relational dynamics in conversational AI, including rapport-building, empathy and trust
* Designing inclusive and accessible conversational systems that account for the needs of diverse users (linguistic, cultural, neurodiverse)
* Linguistic choices and their role in shaping user expectations, satisfaction, engagement and decision-making
* Evaluation methods and metrics for linguistic, social and relational outcomes in human–machine interaction (qualitative, quantitative, or mixed-methods)
* Model training and fine-tuning strategies for enhancing linguistic, social and relational outcomes in human–machine interaction.
* Interdisciplinary and academic-industry collaboration in the development of linguistically, socially and relationally aware conversational technologies
Accepted submissions will be assigned to oral or poster presentation formats according to the mode of presentation best suited to their content.
Submission Details:
* Abstract length: 300 words (excluding title, authors and references)
* Deadline: 16th March 2026
* Notification of acceptance: 27th April 2025
* Submission: HUMIC – Humans and Machines in Conversation<https://forms.office.com/e/gyLXEs9QFi>
ORGANISERS
Dr Doris Dippold<https://www.surrey.ac.uk/people/doris-dippold>, Literature and Languages, University of Surrey
Dr Fabio Fasoli<https://www.surrey.ac.uk/people/fabio-fasoli>, School of Psyschology, University of Surrey
Dr Di Fu<https://www.surrey.ac.uk/people/di-fu>, School of Psychology, University of Surrey
Dr Richard Green<https://www.surrey.ac.uk/people/richard-green>, School of Health Sciences, University of Surrey
Assistant Professor Amal Haddad<https://www.ugr.es/personal/amal-haddad-haddad>, University of Granada
Prof Constantin Orasan<https://www.surrey.ac.uk/people/constantin-orasan>, Literature and Languages, University of Surrey
Dr Valentina Pitardi<https://www.surrey.ac.uk/people/valentina-pitardi>, Strategy, Marketing and International Business, University of Surrey
---
Prof Constantin Orăsan
Professor of Language and Translation Technologies
Centre for Translation Studies<https://www.surrey.ac.uk/centre-translation-studies>
Personal page: https://www.surrey.ac.uk/people/constantin-orasan
Office: 06LC03, Phone: +44 (0) 1483 68 4115
Library and Learning Centre, University of Surrey, Guildford, Surrey, GU2 7XH, UK
Workshop on Structured Linguistic Data and Evaluation (SLiDE)
A full-day workshop at <https://lrec2026.info/> LREC 2026<https://lrec2026.info/>, 11-16 May 2026, Palma, Mallorca (Spain)
The workshop will be held on May 11, 2026
Webpage: https://www.slide-workshop.org/
Third Call For Papers
In the last ten years, significant advances in deep learning models and the development of Large Language Models (LLMs) have revolutionized the fields of computational linguistics (CL) and natural language processing (NLP). In turn, this has led to a complete re-assessment of the language resources and evaluation practices necessary for training LLMs and analyzing their outputs. In particular, the availability of very large amounts of unstructured data for training foundational models has come into focus, while the value of high-quality structured linguistic data with rich annotations at various levels of linguistic analysis has been downplayed by comparison. However, as CL and NLP practitioners engage further with LLMs and debate their strengths and weaknesses, the importance of high-quality, structured linguistic data has been re-emphasized.
The proposed workshop can be seen as related to the Treebanks and Linguistic Theories (TLT) conference series and the more recent SyntaxFest venue. Over the years, these venues have provided a central forum for high-quality research on treebanks, syntactic theory, syntax-semantics interface, structured meaning representations, and annotated linguistic resources. With record participation in recent years, they demonstrate the vitality and relevance of this line of work. The Workshop on Structured Linguistic Data is conceived as both a continuation of this tradition and an adaptation to the new realities of an LLM-dominated research landscape. The workshop will bring together researchers from these overlapping traditions to advance methods, resources, and practices for integrating structured linguistic data into the LLM era.
Topics of interest include but are not limited to:
Linguistic Data Analyses, Language Resources, and Evaluation
*
Grammar processing with NLP and LLM-based tools
*
Phonological and morphological analysis and LLM tokenization
*
Annotation strategies with LLM-empowered methodologies and tools
*
Design principles and annotation schemes for structured linguistic data
*
Multi-lingual and cross-lingual settings
*
Mapping of structured linguistic data to Linked Open Data resources
*
Evaluation informed by language typology
*
Language resources for underresourced and endangered languages
*
The use of structured linguistic data for NLP applications
*
The use of structured linguistic data in acquiring linguistic knowledge
*
(Semi-)automatic methods for creating structured linguistic data
Spoken language Data
*
Speech-to-text applications
*
Speech Generation techniques
*
Speech data preparation, curation and evaluation
Multimodality and Situated Dialogue
*
Structured multimodal resources: gesture AMR (GAMR), gaze and posture annotation, multimodal dialogue corpora.
*
Multimodal grounding: linking language with visual, gestural, and action representations
*
Structured representations for co-attention and alignment in multiparty dialogue
*
Multimodal evaluation resources for LLMs
Pragmatics and Discourse
*
Structured data for discourse and dialogue: discourse relation annotation, coherence structures, dialogue acts
*
Pragmatic annotation (speech acts, presupposition, implicature, politeness, stance)
*
Structured approaches to common ground tracking and Theory of Mind in LLMs
Semantics and Lexical Meaning
*
Dependency analysis and semantic parsing
*
Annotation beyond syntax: semantics, pragmatics and discourse
*
Structured data for lexical semantics: sense inventories, semantic frames, qualia structure, and type-theoretic resources
*
Computational semantics resources: Abstract Meaning Representation (AMR), Universal Meaning Representation (UMR), Discourse, Representation Structures, Minimal Recursion Semantics (MRS), Type Theory with Records (TTR)
*
Distributional and neural-symbolic representations of lexical meaning: (e.g., Holographic Reduced Representations (HRR), hyperdimensional computing) for structured LLM grounding
*
Aligning vector-based meaning representations with symbolic/typed structures
We invite paper submissions in two distinct tracks:
*
regular papers on substantial and original research, including empirical evaluation results, where appropriate – 8 pages excluding references and potential ethics statements;
*
short papers on smaller, focused contributions, work in progress, negative results, surveys, or opinion pieces – 4 pages excluding references and potential ethics statements.
Invited speakers
Naiara Perez (University of the Basque Country)
Shira Wein (Amherst College)
Paper Submission and Templates
*
Submission follows the LREC 2026 conference instructions, using the Softconf START conference management system accessible through the following link: https://softconf.com/lrec2026/SLiDE/
*
Submissions should follow the LREC stylesheet, available on the conference website on the <https://lrec2026.info/authors-kit/> Author’s kit<https://lrec2026.info/authors-kit/> page.
Papers must be anonymized to support double-blind reviewing.
Important Dates
February 22, 2026: Paper submission deadline
March 15, 2026: Notification of acceptance
March 25, 2026: Camera-ready papers
May 2026: Workshop at LREC 2026
All deadlines are 11.59 pm UTC -12h (“anywhere on Earth”).
Workshop Organizers
Jan Hajič (Prague University, Czech Republic)
Erhard Hinrichs (Tübingen University, Germany)
Sandra Kübler (Indiana University, USA)
Joakim Nivre (Uppsala University, Sweden)
Petya Osenova (Sofia University, Bulgaria)
James Pustejovsky (Brandeis University, USA)
Second Call for Papers
LANLP: Bridging Ibero and Latin American NLP communities
16 May 2026, Palma de Mallorca, Spain
http:<http://lanlp>https://sites.google.com/view/lanlp2026/home
Co-located Networking Symposium @ LREC 2026
https://lrec2026.info/
Description and Goals
We organise a Networking Symposium on Latin American NLP (LANLP), focusing on natural language processing for the diverse languages of the Iberian Peninsula and Latin America. This region includes major world languages (e.g. Spanish (~558M speakers), Portuguese (~267M) as well as regional and indigenous languages. For example, Latin America alone hosts tens of millions of speakers of Quechua (~10M), Guaraní (>6M), Nahuatl (~2M), Aymara (~2M), among many others. Such languages are highly under‐resourced: over 88% of the world’s languages remain largely unsupported by language technologies. This networking event addresses that gap by promoting collaboration on ethically and culturally sensitive resource creation, evaluation, and novel methods for low-resource multilingual NLP in Iberian and Latin American languages and varieties. Our goal is to bring together communities (SEPLN<http://www.sepln.org/>, CLARIAH-ES<https://www.clariah.es/>, PROPOR<https://propor2024.citius.gal/>, AmericasNLP<https://turing.iimas.unam.mx/americasnlp/index.html>, and SomosNLP<https://somosnlp.org/>) to share cutting-edge research, language resources, and best practices.
LANLP focuses on community-driven resource development and evaluation for Iberian languages, and diverse Latin American languages (including indigenous and minority languages). We aim to bridge regional communities: for instance, past forums like OpenCor note that “Latin American and Iberian communities... did not have an established event” to share initiatives, corpora and tools. LANLP fills this gap, fostering new contacts between Iberian and Latin American NLP research groups. The goals are to (1) highlight challenges in processing these languages, (2) share novel datasets and models, and (3) catalyze future collaborations and shared tasks. We emphasize both academic rigor and community inclusivity, encouraging contributions from established researchers and grassroots language advocates alike.
Topics of Interest
We invite submissions on topics including (but not limited to):
*
Language resource creation: Corpora, lexicons, and annotations for Iberian and Latin American languages (text, speech, multimodal).
*
LLMs opportunities and challenges: Small Language Models, synthetic data, mitigating biases, linguistic inequalities, data scarcity, language domination.
*
Multilingual transfer & modeling: Cross-lingual and multilingual representations, transfer learning, and embedding methods that bridge Spanish, Portuguese, varieties and minority languages.
*
Machine translation & generation: MT, summarization, and language generation for Spanish, Portuguese, and low-resource languages (e.g., Quechua, Aymara, Nahuatl).
*
Speech and audio processing: ASR, TTS, and spoken language resources for under-resourced languages and regional dialects (e.g. indigenous languages, Brazilian Portuguese, Latin American Spanish).
*
Dialectal and code-switching NLP: Identification and handling of dialectal variation and code-switching (e.g. Spanish–Portuguese code-mixing, Spanish–indigenous language contact).
*
Morphology and syntax: Analysis and tagging for morphologically rich or under-documented languages (e.g. Basque, Mapudungun, Bribri) using universal dependencies or other frameworks.
*
Domain-specific NLP: Social media, sentiment, hate-speech detection, and other tasks in Iberian and Latin American language contexts (e.g. Latin American social media analysis).
*
Digital humanities & cultural heritage: NLP for historical texts, literature, and cultural content in Spanish, Portuguese, and regional languages.
*
Community-driven methods: Crowdsourcing, citizen science, and participatory approaches for data collection and annotation in these languages.
*
Evaluation and benchmarks: Development of evaluation metrics and benchmarks tailored to low-resource Iberian/Latin languages.
*
Ethical and social issues: Fairness, bias, and indigenous language rights in NLP; collaboration with native speaker communities; data governance and sustainability of resources.
Important dates
*
February 18, 2026: Paper submission deadline
*
March 20, 2026 Notification of acceptance
*
March 30, 2026: Camera-ready deadline
*
May 16, 2026: Networking Symposium Date
Submission Instructions
We invite non anonymous submissions in English, Spanish or Portuguese on the topics of interest between 4 and 8 pages of content. The page limit of 8 pages does not include acknowledgements, references, potential Ethics Statements and discussion on Limitations in line with the policy of the main LREC conference. All submissions must follow the LREC stylesheet (https://lrec2026.info/authors-kit/).
Any submissions which are over-length, poorly formatted or make excessive use of appendices to circumvent page limits are liable to desk-rejection.
At the time of submission, authors are offered the opportunity to share related language resources with the community. All repository entries are linked to the LRE Map (https://lremap.elra.info/), which provides metadata for the resource.
Organizing Committee
*
Luis Chiruzzo Inco (AmericasNLP, luischir(a)fing.edu.uy<mailto:luischir@fing.edu.uy>)
*
Pablo Gamallo (PROPOR, CiTIUS, pablo.gamallo(a)usc.gal<mailto:pablo.gamallo@usc.gal>)
*
María Grandury (SomosNLP, EPFL, mariagrandury(a)gmail.com<mailto:mariagrandury@gmail.com>)
*
Rafael Muñoz Guillena (SEPLN, CENID, UA, rafael(a)dlsi.ua.es<mailto:rafael@dlsi.ua.es>)
*
German Rigau Claramunt (CLARIAH-ES. HiTZ Center, EHU, german.rigau(a)ehu.eus<mailto:german.rigau@ehu.eus>)