- Corpora - ELRA lists

Call for abstracts: Second Workshop on Replication in the Language Science (WoReLa 2)
by Elen Le Foll 15 Jun '26

15 Jun '26

Dear colleagues, We are pleased to announce that the* Second Workshop on Replication in the Language Science (WoReLa 2 <https://sites.google.com/view/worela2/home>)* will take place in *Cologne, Germany *on*8–9 July 2027* with a fantastic line-up of keynote speakers and an exciting pre-conference reproduction hackathon (more on that soon!). The call for abstracts <https://sites.google.com/view/worela2/call-for-abstracts> is open and the deadline is *15 August 2026*. We welcome submissions reporting on the outcomes of replications in all subdisciplines of linguistics using any (set of) method(s), but also reproductions, multiverse analyses, and meta-scientific contributions on the state of replicability and the adoption of open science practices in the language sciences, as well as demonstrations of concrete tools, teaching materials, software, and workflows that can help linguists do better research and build more cumulative knowledge. WoReLa 1 <https://sites.google.com/view/worela1/home> in Frankfurt last year was a wonderful meeting of like-minded people with positive ideas for change and we hope that you will want to join us for this exciting second edition in Cologne! We would also be very grateful if you could circulate this call among any interested friends, colleagues, and students. Many thanks and best wishes, Elen, on behalf of the entire organising committee -- *Dr. Elen Le Foll* /Senior Researcher and Lecturer (Akademische Rätin)/ Department of Romance Studies <https://romanistik.phil-fak.uni-koeln.de/> • Data Center for the Humanities <https://dch.phil-fak.uni-koeln.de/> • University of Cologne <https://portal.uni-koeln.de/en/uoc-home> Applied Linguistics • Corpus Linguistics • Language Teaching & Learning ORCID <https://orcid.org/0000-0002-5839-8010> • HAL Science <https://cv.hal.science/elenlefoll>

1 0

[CFP] Special issue on ethics in NLP and CL in Computational Linguistics Journal
by ACL Announcements 15 Jun '26

15 Jun '26

[Apologies for cross-posting] Dear colleagues! We are very happy to announce the forthcoming special issue on ethics in natural language processing and computational linguistics in the journal Computational Linguistics ([https://direct.mit.edu/coli](https://direct.mit.edu/coli)). Language processing technologies from Siri to Google Translate and ChatGPT have become ubiquitous in our societies. Consequently, there has been a decade of attention to algorithmically mediated harms, including harms arising from social biases and discriminatory technologies. In this special issue, we broadly invite contributions that reflect on the ethics of language technologies. We invite **theoretical** and technical submissions around - but not limited to: * Reflection on bias and fairness research and its outcomes; * Interdisciplinary perspectives on bias, fairness, and algorithmically mediated harms; * Critiques of existing approaches; * Development of new theories for bias, fairness, and algorithmically mediated harms; * Analyses of power relationships (including conflicts of interests) and their impacts; * Multilingual considerations; * Decolonisation in relation to bias and fairness; * Reflections on ongoing debates on bias, fairness, and ethics; * Language technologies in context (e.g., in political/social/cultural tensions); * Slow science and its relationship to bias and fairness; * Luddite and decomputing perspectives on language technologies, bias, and fairness; * Environmental impacts of language technologies * The impacts and social considerations of data labour; * Social and societal harms of language technologies; and * External perspectives on language technologies. We particularly encourage **inter-, cross- and transdisciplinary** submissions which center questions around the fairness, bias, justice, and ethics of natural language processing technologies and computational linguistics. ### [](https://notes.inria.fr/2HKIxrP4TD-yH3GDvkwWkw#important-dates-and-information "important-dates-and-information")Important dates and information *Submission deadline*: 27 November, 2026 *Notification*: February, 2027 *Publication (Expected)*: October, 2027 Submission site: [https://submissions.cljournal.org/index.php/cljournal/submission](https://s… ### [](https://notes.inria.fr/2HKIxrP4TD-yH3GDvkwWkw#guest-editors "guest-editors")Guest Editors * Karën Fort, Université de Lorraine / LORIA, France * Margot Mieskes, University of Applied Sciences, Darmstadt, Germany * Zeerak Talat, University of Edinburgh, Edinburgh, UK If you have any questions feel free to reach out to us [cl\_si\_ethics@inria.fr](mailto:cl_si_ethics@inria.fr) ### [](https://notes.inria.fr/2HKIxrP4TD-yH3GDvkwWkw#about-the-journal "about-the-journal")About the journal Computational Linguistics is the longest-running publication devoted exclusively to the computational and mathematical properties of language and the design and analysis of natural language processing systems. This highly regarded quarterly offers university and industry linguists, computational linguists, artificial intelligence and machine learning investigators, cognitive scientists, speech specialists, and philosophers the latest information about the computational aspects of all the facets of research on language. Computational Linguistics is a diamond open access journal, which means that "there is **no fee to publish and the content is open to anyone to read**. All of these titles employ a Creative Commons license for individual articles." ([https://direct.mit.edu/journals/pages/open-access](https://direct.mit.edu/j…)

1 0

Reminder : MIAI–PRAIRIE Online Seminar: Adele Goldberg (Princeton): Compositionality, creativity in natural language and LLMs (15/6, 5pm)
by Thierry Poibeau 15 Jun '26

15 Jun '26

MIAI–PRAIRIE Online Seminar on LLMs and the Study of Language, Mind, and Society Our next speaker will be Adele Goldberg, from Princeton, for a talk on ''Compositionality, creativity in natural language and LLMs’’, on Monday 15 June, 5pm (French time), Online, free access, with no registration Organized by Caroline Rossi (Université Grenoble Alpes / MIAI) and Thierry Poibeau (ENS–PSL / PRAIRIE–PSAI). Next year’s speakers will include Eloïse Boisseau (AMU, Marseille), and Dallas Card (U. Michigan), among others. ---- *** Compositionality, creativity in natural language and LLMs *** Monday 15 June, 5pm (French time), online (free access, no registration) Connexion link: https://webinaire.numerique.gouv.fr/meeting/signin/invite/78275/creator/433… Adele Goldberg, Princeton Abstract: Today’s LLMs interpret and produce familiar and novel language without abstract symbolic rules. An appreciation of the complexity of natural languages indicates this is more a feature than a bug. New evidence demonstrates that LLMs are also at least as creative as the typical person. Parallels between LLMs and human language highlight the statistical and functional aspects of both systems. For cognitive scientists, LLMs promise of a deeper understanding of compositionality and creativity. Bio: Adele Goldberg is the M. Taylor Pyne Professor of Psychology at Princeton University. Her research explores the formal, semantic, social, statistical, and memory-based factors that shape how languages are learned, represented, and used. She is fascinated by what makes human language both creative and constrained, across adults and children, first and second language learners, and neurotypical and atypical populations. Her current work touches on word meaning, language change, island constraints, metaphor and emotion, good-enough language production, and the forms and functions of grammatical constructions. She is a Fellow of the Linguistic Society of America, the Association for Psychological Science, and the Cognitive Science Society, and an elected member of the American Academy of Arts and Sciences.

1 0

Lancaster Summer School highlights
by Brezina, Vaclav 13 Jun '26

13 Jun '26

Dear all, I hope this message finds you well. We are pleased to invite you to a series of online live-streamed highlights from the Lancaster Summer Schools in Corpus Linguistics, hosted by Lancaster University. These sessions are open to participants who are unable to attend in person and will provide insights into key topics in corpus linguistics, statistics and data analysis. Certificate of attendance will be provided to registered participants who attend all sessions. 📅 Programme of online sessions (UK time) * 15 June 2026 | 3:30pm–4:30pm Introduction to corpus statistics * 16 June 2026 | 11:00am–12:00pm Analysing and visualising collocations * 17 June 2026 | 10:30am–1:00pm EMI corpus launch * 18 June 2026 | 11:00am–12:00pm Statistics and data visualisation 📝 Registration To attend, please register using the following form: https://forms.office.com/e/7UPvCDfwAX After registration, further details and access links for the live sessions will be provided. Vaclav Professor Vaclav Brezina Professor in Corpus Linguistics Co-Director of the ESRC Centre for Corpus Approaches to Social Science

1 0

NeTTIT 2026: Call for Participation
by Constantin Orasan 13 Jun '26

13 Jun '26

International Conference ‘New Trends in Translation and Interpreting Technology’ (NeTTIT 2026) Dubrovnik, Croatia, 24-27 June 2026 https://nettt-conference.com Call for Participation Dear Colleagues, We are delighted to invite you to participate in the third edition of the International Conference 'New Trends in Translation and Interpreting Technology' (NeTTIT 2026), taking place in Dubrovnik, Croatia, from 24 to 27 June 2026. Following the success of previous editions, NeTTIT 2026 continues to serve as a unique bridge between academia and industry, bringing together researchers, developers, practitioners, language service providers, and vendors interested in the latest technologies for translation and interpreting. Why Participate? * Discuss cutting-edge research, tools, and practices in translation, interpreting, subtitling, localisation, and machine translation. * Attend invited talks by renowned experts, including Yves Champollion (Wordfast LLC) and Marko Grobelnik (Jožef Stefan Institute). * Join pre-conference tutorials on: * Post-editing and AI-augmented translation (Marie Escribe) * Machine Translation Quality Evaluation (Tharindu Ranasinghe) * Automatic Speech Recognition for interpreters (Constantin Orasan) * Network with peers and establish research or business collaborations. Accepted Papers The list of accepted papers is available on the conference website: https://nettt-conference.com/2026/28003-2/ Important Dates (Participation) Conference dates: 24-27 June 2026 Early registration finishes on 31 May 2026 Venue The conference will be held at the Centre for Advanced Academic Studies (CAAS) of the University of Zagreb in Dubrovnik. Registration Registration details (fees, deadlines, and accommodation recommendations) are available on the conference website: https://nettt-conference.com/2026/fees-registration/ Follow and Share Stay updated via social media: LinkedIn: https://www.linkedin.com/company/nettit2026/ Twitter/X: https://x.com/NeTTIT2026 Contact For any questions, please contact the organisers at: nettit2026(a)nettt-conference.com We look forward to welcoming you to Dubrovnik for an inspiring and collaborative event! NeTTIT 2026 Organising Committee

1 1

Funded PhD in NLP/Speech Analysis/Digital Mental Health [University of Birmingham and University of Melbourne]
by Mike Conway 13 Jun '26

13 Jun '26

The University of Birmingham (UK) and the University of Melbourne (Australia) has a full-time funded joint PhD fellowship at the intersection of NLP, speech analysis, and digital mental health. Please find further details below. To apply, send your CV and academic transcripts to both Dr Melanie Jouaiti (m.jouaiti(a)bham.ac.uk<mailto:m.jouaiti@bham.ac.uk>) and Dr Mike Conway (mike.conway(a)unimelb.edu.au<mailto:mike.conway@unimelb.edu.au>). Please note that as this is an interdisciplinary project, applicants from various disciplinary backgrounds are welcome (e.g. computer science, linguistics, psychology, engineering, cognitive science). However, a first class honours degree (or international equivalent) and some prior research experience (e.g. a masters dissertation) is required. PROJECT DESCRIPTION Project title: Automated analysis of clinical interviews Project Summary: One fully funded project on the study of “Automated analysis of clinical interviews from speech and language” is available. This Joint PhD project will be primarily based at the University of Birmingham, UK with a minimum 12-month stay at the University of Melbourne, Australia. Project leaders: Please contact both Dr Melanie Jouaiti (Birmingham, m.jouaiti(a)bham.ac.uk<mailto:m.jouaiti@bham.ac.uk>) and A/Prof Mike Conway (Melbourne, mike.conway(a)unimelb.edu.au<mailto:mike.conway@unimelb.edu.au>) with your CV and academic transcripts. Project description: This project will be developed in close collaboration with clinicians and brings together expertise in speech processing, natural language processing, and machine learning Talking therapies can be effective for common mental health and behavioural problems. Although the use of talking therapies has increased substantially in recent years, the mechanisms underlying their effectiveness remain poorly understood, and quality assurance still relies on human assessors — an approach that is both costly and difficult to scale. Achieving a better understanding of the characteristics of different talking therapy-approaches, combined with developing automated assessment methods for gauging therapy quality, could improve therapist training and patient outcomes. This project will use language analysis (what was said), speech analysis (how it was said), and large language models to address two research questions: Can automatic analysis of talking therapy match the effectiveness of human assessment? What mechanisms underpin talking therapy? Qualifications: * A bachelor degree in a relevant discipline which includes a substantial research component equivalent to at least 25% of one year of full-time study. Students should have achieved a first class honours degree (or international equivalent) AND/OR * A Masters degree in a relevant discipline which includes a substantial research component equivalent to at least 25% of one year of full-time study. Students should have achieved a distinction (or international equivalent)

2 5

SEO website
by vacod43519＠dyleris.com 13 Jun '26

13 Jun '26

https://www.adultfanweekly.com

2 8

CfP: IMPACT-SPEECH@EMNLP’26
by Monorama Swain 12 Jun '26

12 Jun '26

Dear Colleagues, We invite submissions to IMPACT-SPEECH: Identifying, Measuring, Preventing, and Assessing Consequences of Bias in Speech LLMs, a workshop that brings together researchers from speech processing, natural language processing, machine learning, and social sciences to discuss emerging challenges related to bias in speech-enabled large language models and multimodal speech systems. �� Website: https://impactspeech.github.io/ [https://impactspeech.github.io/] The Workshop will be co-located with EMNLP 2026 in Budapest, Hungary, October 24–29, 2026. Topics of interest include, but are not limited to : - Fairness evaluation for speech models, including methods to measure disparities across accents, genders, speaker identities, socioeconomic groups, and other factors. - Bias detection and benchmarking datasets for speech systems, including the development of datasets designed to identify and evaluate bias in speech recognition, speech generation, and speech-enabled LLMs. - Ethical considerations in speech AI development, including responsible data collection, annotation practices, transparency, accountability, and governance of speech technologies. - Evaluation of Speech LLMs for fairness and inclusivity, including methods to assess bias in speech understanding, speech generation, and multimodal speech-language systems. - Cross-lingual and multilingual fairness, with a focus on challenges faced by underrepresented languages, dialects, and low-resource speech communities. - Bias mitigation strategies for speech systems, including algorithmic approaches, training strategies, and fairness-aware adaptation techniques. - Real-world impacts of biased speech technologies, including implications for accessibility, employment, digital inclusion, and human–AI interaction. - Responsible deployment and governance of speech technologies, including best practices for monitoring, auditing, and mitigating bias in real-world applications. Important Dates - Direct paper submission deadline: 15 July 2026 - Pre-reviewed ARR commitment deadline: 25 August 2026 - Extended abstract deadline: 25 August 2026 - Notification of acceptance: 3 September 2026 - Workshop date: 29 October 2026 All deadlines are 11:59 PM Anywhere on Earth. IMPACT-SPEECH welcomes both archival and non-archival papers: - Archival papers: Research papers presenting original empirical or theoretical results - Non-archival papers: Previously published, work-in-progress, and extended abstracts Papers may be submitted as long papers (up to 8 pages plus references), short papers (up to 4 pages plus references), and extended abstracts (up to 2 pages plus references). Papers should be submitted in the ACL format [https://github.com/acl-org/acl-style-files] following the ACL Author guidelines [https://www.aclweb.org/adminwiki/index.php?title=ACL_Author_Guidelines]. The review process will be two-way anonymized; therefore, all identifying information must be removed from submissions. For questions, please contact: impactspeech.workshop(a)gmail.com [impactspeech.workshop(a)gmail.com] On behalf of the workshop organisers: Ravi Shekhar, Monorama Swain, Jagabandhu Mishra, Sandipan Dhar, Haralambos Mouratidis, Matthew Purver

1 0

Special Issue on the Ethics of NLP and Computational linguistics
by z＠zeerak.org 12 Jun '26

12 Jun '26

1 0

[3rd CFP] LM Playschool Workshop at EMNLP 2026 — Submission Deadlines Approaching!
by Sabrina McCallum 12 Jun '26

12 Jun '26

Call for Participation: LM Playschool (LMP 2026) Improving Language Models through Learning from Dialogue Interaction Co-located with EMNLP 2026 — 28 October 2026, Budapest Website: https://lm-playschool.github.io/ Starter Kit: https://github.com/lm-playpen/playpen The submission deadlines for the LM Playschool Workshop are fast approaching! Please note that some dates have been updated since our previous announcements — see below. 📝 SUBMISSION TRACKS We welcome either long or short submissions for the following tracks: 1. Challenge track: Technical reports for the LM-Playschool challenge (archival). 2. Paper-only track: Work-in-progress (archival or non-archival) or recently published papers (non-archival). 🏆 CHALLENGE TRACK (SHARED TASK) The shared task focuses on post-training LLMs to master communicative skills in unseen dialogue games while retaining original language capabilities. Participants are free to choose any base model; evaluation is based on improvement relative to that base model on an unseen test set. Not signed up yet? Register here: https://forms.gle/fhpXPH5kZk4psPXp9 🛠️ STARTER KIT Our sandbox environment Playpen is available on GitHub and ready to use. Key features: * Comprehensive Evaluation: A single command to get your "clemscore" (interactive competence) and "statscore" (classic benchmarks). * Training Recipes: Example scripts for SFT and GRPO to help models learn from game-state success. * Resource Friendly: We encourage using the Qwen3.5 family (0.8B to 27B) to ensure participation is possible with modest compute. 🎯 PAPER-ONLY TRACK: TOPICS OF INTEREST We welcome original research and work-in-progress on: * Architectures and training regimes for interactive agents. * Intrinsic rewards and learning signals (RL from game-state success). * Benchmarking via dialogue games. * Data efficiency and social interaction. * Social cognition and Theory of Mind in interactive systems. * Human-agent collaboration and coordination. * Embodied interactive agents. * Communicative and perceptual grounding. 📅 IMPORTANT DATES (updated) * ARR paper submission deadline: May 25, 2026 * Challenge submission deadline: July 5, 2026 (3 weeks away!) * Direct paper submission deadline: July 12, 2026 (1 month away!) * Pre-reviewed ARR commitment deadline: August 2, 2026 * Notification of acceptance: August 8, 2026 * Camera ready due: August 23, 2026 * Challenge winners announced: Early October 2026 * Workshop at EMNLP 2026: October 28, 2026 — Budapest For more information, visit our website: https://lm-playschool.github.io/ We look forward to your submissions! The LMP 2026 Organizing Committee: Raffaella Bernardi, Raquel Fernández, Mario Giulianelli, Sherzod Hakimov, Alexander Koller, Dieu-Thu Le, Oliver Lemon, Davide Mazzaccara, Sabrina McCallum, David Schlangen, Alessandro Suglia

1 0

2026

2025

2024

2023

2022

Corpora