- Corpora - ELRA lists

[ÚLTIMA CHAMADA] XVII Encontro de Linguística de Corpus (ELC) e XIV Escola Brasileira de Linguística Computacional (EBRALC)
by Jackson Souza 19 Jun '26

19 Jun '26

*XVII Encontro de Linguística de Corpus (ELC)* *XIV Escola Brasileira de Linguística Computacional (EBRALC)* *9 a 12 de novembro de 2026, na Universidade Federal da Bahia* *ÚLTIMA CHAMADA* para ELC/EBRALC 2026, um dos principais encontros acadêmicos dedicados às relações entre linguagem, tecnologia e sociedade. Nesta edição, o evento trará como tema central: *“Linguística de Corpus, Inteligência Artificial e Transformação Digital: Dados, Métodos e Impactos Sociais na Era Digital”.* O evento [presencial] reunirá pesquisadores, estudantes e profissionais interessados em discutir os desafios e as oportunidades das tecnologias linguísticas contemporâneas, com foco em inteligência artificial, ética de dados e análise baseada em grandes volumes de informação. O *ELC* consolida-se como um espaço de divulgação científica e debate acadêmico, promovendo o intercâmbio de pesquisas sobre linguagem em contextos digitais e sociais. Já a *EBRALC* possui caráter formativo, oferecendo oficinas e minicursos voltados à aplicação prática de ferramentas de Linguística de Corpus e Processamento de Linguagem Natural. Nesta edição, o encontro será realizado na *Universidade Federal da Bahia (UFBA)*, fortalecendo a presença da pesquisa linguística baseada em dados na região Nordeste e incentivando o diálogo interdisciplinar. Compartilhem a chamada em suas respectivas instituições e programas de pós-graduação! *Indicações de temas* ● Linguística de corpus e análise baseada em dados ● Processamento de Linguagem Natural ● Modelos de linguagem e representação semântica ● Métodos computacionais e análise de dados linguísticos ● Linguagem, discurso e cultura digital ● Ética, governança e impactos sociais da IA ● Linguística de corpus aplicada ● Terminologia, Lexicografia ● Tradução ● Linguística de corpus e ensino ● Linguística de corpus e cultura digital *Datas importantes* *28/06/2026* – Submissão de trabalhos para ELC *28/06/2026* – Submissão de propostas de oficinas/minicursos para EBRALC *27/07/2026* – Resposta aos autores/proponentes *09/08/2026* – Envio das versões finais 09 a 12/11/2026 – Realização do evento *Submissão de propostas* Acesse o formulário: https://forms.gle/d8uHzNY86xWSh7YSA *Dúvidas e contato* Site do evento <https://sites.google.com/view/elc-ebralc2026/in%C3%ADcio>. elcebralc2026(a)gmail.com | acksoncruz(a)ufba.br <jacksoncruz(a)ufba.br> --

1 0

Two PhD positions in Uncertainty Quantification and Communication for LLMs
by Christian Hardmeier 19 Jun '26

19 Jun '26

We have two open PhD positions in our project on Uncertainty Quantification and Communication for Large Language Models. Large language models (LLMs) have become immensely popular tools for laypeople and experts to find quick answers to any questions, including such that require specialized knowledge or complex reasoning. Their replies are far from always correct. Right or wrong, but often couched in a confident and persuasive tone, they may fool even experts into believing entirely false statements. Accurate and trustworthy LLM tools must know how much confidence a model has in its output and communicate this to the user in a nuanced and understandable form, respecting the principles of communication between humans. Our interdisciplinary project unites machine learning, natural language processing and linguistics. We study how LLMs acquire and verbally communicate (un)certainty, develop novel methods in Bayesian uncertainty quantification and apply them to LLMs in health applications. Using methods from linguistics, we emphasize targeted communication that gets interpreted correctly by diverse users, including laypeople and experts, and create LLMs that communicate their uncertainty according to humanlike standards. One position is in natural language processing at the IT University of Copenhagen, supervised by me and Jes Frellsen (DTU). In this position, you will develop methods to help LLMs express their uncertainty in a way that humans understand, while matching the perceived uncertainty to the model's internal state. Application deadline 24 June (very soon!). This position was advertised before. Apply here: https://candidate.hr-manager.net/ApplicationInit.aspx?cid=119&ProjectId=181… The second position is in machine learning at the Technical University of Denmark, supervised by Jes Frellsen and me. In this position, you will develop efficient machine learning methods to quantify the uncertainty of large language models. This position is new. Application deadline 7 August. Apply here: https://efzu.fa.em2.oraclecloud.com/hcmUI/CandidateExperience/da/sites/CX_2… Both positions include an extended research stay at the University of Edinburgh, the first with Prof. Hannah Rohde (Linguistics) and the second with Prof. Ivan Titov (NLP/machine learning). -- Christian Hardmeier Associate Professor, IT University of Copenhagen https://christianhardmeier.rax.ch/

1 0

CFP: 47th Translating and the Computer Conference (TC47)
by Amal Haddad 19 Jun '26

19 Jun '26

Call for Presentations and papers 47th Translating and the Computer Conference (TC47) Luxembourg, 8 to 10 December 2026 https://asling.org/tc47/ [1] AI-assisted or AI-eclipsed? Language services between promise and pressure AsLing invites submissions for the 47th edition of the Translating and the Computer Conference (TC47), to be held from 8 to 10 December 2026 in Luxembourg. The TC conference series brings together professionals, researchers, developers and decision-makers from the language industry, academia and public institutions. TC47 will explore how technological innovation - particularly AI - is reshaping multilingual communication, raising new questions about human agency, professional ethics, and sustainable practices in the language services sector. Conference theme _AI-assisted or AI-eclipsed? Language Services between Promise and Pressure_ _ _ From Machine Translation and LLMs applied to translation, language professionals face unprecedented change. TC47 invites reflection on how to navigate this evolving landscape - to ensure that technology empowers rather than eclipses, and that multilingual communication remains inclusive, trusted and professionally grounded. We especially welcome contributions exploring: * Synergy between human expertise and AI-powered tools * The role of AI in promoting or undermining inclusion and equity * Strategies for sustainable and ethical language services * Cross-sector collaboration between academia, industry, and institutions Submissions not focused on AI are equally welcome, particularly those addressing broader trends in multilingual communication, training, translation workflows, and evolving professional practices. We also welcome critical reviews and discussions on: * The broader impact of AI and automation on the language industry * Implications for training, education and career development of language professionals * Coexistence of AI and traditional practices * Impact of AI on language professionals * Adoption barriers and risks for LSPs new to AI * Future trends in translation, interpreting, and localisation - with or without AI * Responsible and sustainable development in language technologies (environmental, social, professional) Key areas of interest Include, but are not limited to: * Multilingual NLP and large language models * Human-in-control systems vs. human-in-the-loop AI * Terminology management and controlled language * AI readiness and digital transformation in LSPs * NLP, semantic technologies and linked data * Collaborative translation tools and environments * Quality assurance, benchmarking and evaluation * Training, professional development and digital upskilling * Inclusive and culturally aware AI systems * Sustainable practices across the language lifecycle * Language policy and digital language equality * FAIR data, corpora and infrastructure * Ethical implications and human oversight * Empowering language professionals to shape - not just use - AI tools * Non-AI innovations and evolutions in translation, interpreting, localisation or terminology work We invite: * Innovative research: studies that expand the boundaries of language technologies, multilingual NLP, or AI ethics. * Practical applications: case studies from public or private sector stakeholders showcasing language technology use and development. * Workshops and panels: interactive formats encouraging dialogue on timely, challenging or divisive issues in AI and language work. * Critical reflections: well-argued contributions questioning current uses of AI and proposing alternative, human-centred approaches. * Posters and short talks: snapshots of emerging projects, tools, or preliminary research. Submission tracks All submissions are for talks, within the following categories: * Research track (Academic) * 20-minute talk * Followed by a paper (max. 5,000 words) presenting original, unpublished research * User experience track (Non-academic) * 20-minute talk * Optional post-facto paper (max. 5,000 words) detailing workflows, tools or implementation cases * Posters / Short talks * 7-8-minute talk * Followed by a paper (max. 2,000 words) outlining a project, experiment, or tool * Workshops and panels * Interactive sessions with multiple speakers * Moderators may submit an optional post-facto paper summarising key takeaways Submission instructions Submissions must be made via the START conference submission system: https://www.softconf.com/p/tc2026 [2] Important dates * Deadline for research/user experience talks: 30 June 2026 ➤ Notification of acceptance: 31 August 2026 * Deadline for workshops and panels: 31 July 2026 ➤ Notification of acceptance: 15 September 2026 * Deadline for posters and short talks: 15 September 2026 ➤ Notification of acceptance: 30 September 2026 * Final paper submission (except post facto workshop and panel papers): 31 October 2026 * Conference dates: 8-10 December 2026 Submission guidelines Detailed submission guidelines, including templates and formatting instructions, will be available on the TC47 conference website. We look forward to your contributions that will help shape the future of language services through innovation, collaboration, and inclusivity. Why submit to TC47? TC47 offers a unique opportunity to engage in a multi-stakeholder dialogue that bridges research, practice and policy. It is a space for shared reflection on what language professionals need, what tools actually deliver and how we co-create a future where humans and AI work better together. For any questions, reporting of problems concerning submissions or the Conference at least, please email tc47-info(a)asling.org. Let's explore, challenge and shape the future of multilingual communication together! -- Amal Haddad Haddad (She/her) Facultad de Traducción e Interpretación Universidad de Granada |https://www.ugr.es/personal/amal-haddad-haddad Lexicon Research Group |http://lexicon.ugr.es/haddad Co-Convenor, BAAL SIG 'Humans, Machines, Language'|https://r.jyu.fi/humala Event Coordinator, BAAL SIG 'Language, Learning and Teaching' =============== Cláusula de Confidencialidad: "Este mensaje se dirige exclusivamente a su destinatario y puede contener información privilegiada o confidencial. Si no es Ud. el destinatario indicado, queda notificado de que la utilización, divulgación o copia sin autorización está prohibida en virtud de la legislación vigente. Si ha recibido este mensaje por error, se ruega lo comunique inmediatamente por esta misma vía y proceda a su destrucción. This message is intended exclusively for its addressee and may contain information that is CONFIDENTIAL and protected by professional privilege. If you are not the intended recipient you are hereby notified that any dissemination, copy or disclosure of this communication is strictly prohibited by law. If this message has been received in error, please immediately notify us via e-mail and delete it" =============== Links: ------ [1] https://asling.org/tc47/ [2] https://www.softconf.com/p/tc2026/

1 0

Computational Linguistics Fall School in Nuremberg, Germany (Sep 21-Oct 2)
by Michael Roth 19 Jun '26

19 Jun '26

We are excited to announce the Computational Linguistics Fall School 2026! From 21 September to 2 October 2026, the CL Fall School will take place at the University of Technology Nuremberg in Germany. The Fall School is aimed at Master's students and early-stage PhD students from linguistics and computer science who want to deepen their knowledge of Computational Linguistics and Natural Language Processing. The focus is on courses covering topics that are not typically offered in regular degree programs. The event is sponsored by the German Society for Computational Linguistics and Language Technology (GSCL) and the Special Interest Group CL of the Linguistic Society of Germany (DGfS-CL). This year, participants will join in-person courses over two weeks on the following topics: * Programming for CL and NLP * Corpus linguistics methodology * Explainable NLP * NLP and sociolinguistics Registration is now open, with early bird registration available until 15 July 2026. More information and registration: https://cl-fallschool-2026.github.io/ We look forward to welcoming you in Nuremberg! --- Prof. Michael Roth [he/him] Natural Language Understanding Lab University of Technology Nuremberg Technische Universität Nürnberg

1 0

PhD Position in Speech Processing
by Cieliebak Mark (ciel) 19 Jun '26

19 Jun '26

Dear List Moderator It would be great if you could forward the position below to your mailing list. Best wishes Mark Dear All The Centre for AI at Zurich University of Applied Sciences is offering a 3-year PhD position in Speech Processing for Swiss German: https://www.zhaw.ch/de/jobs/offene-stellen/stelleninserat/job/detail/3800605 Start: September 2026 Requirement: Good skills in German, but Swiss German not mandatory Best wishes Mark ----------------------------------------------------------------------------------------- ZHAW Zurich University of Applied Sciences CAI Centre for Artificial Intelligence Prof. Dr. Mark Cieliebak Technikumstrasse 71, TN 03.64 CH - 8401 Winterthur, Switzerland Phone: +41 58 934 72 39 Email: ciel(a)zhaw.ch<mailto:ciel@zhaw.ch> Web: https://www.zhaw.ch/de/ueber-uns/person/ciel

1 0

2nd CFP: Data-driven Storytelling Workshop (DDS 2026) at ISWC – Submit by July 24th
by Pasquale Lisena 19 Jun '26

19 Jun '26

Dear colleagues, This is a second call for papers for the Data-driven Storytelling: Bridging Semantics, AI, and Narrative (DDS 2026) workshop, co-located with ISWC 2026. We invite submissions from researchers and practitioners in knowledge graphs, NLP, HCI, and generative AI to explore the role of semantic technologies in enhancing narrative creation and engagement. Important Dates Submission Deadline: July 24th, 2026 (23:59 AoE) Notifications: August 21st, 2026 Camera-ready Version: September 18th, 2026 Workshop Dates: October 25-26, 2026 We welcome submissions of research papers, demos, and short papers addressing topics such as: • Knowledge graphs and ontologies for storytelling • AI-driven narrative generation (LLMs, GenAI) • Benchmarking narrative quality and coherence • Interactive and participatory storytelling tools • Ethics and explainability in automated storytelling Website and submission: https://data-driven-storytelling-workshop.replit.app/ We look forward to your contributions! Best regards, -- Pasquale Lisena EURECOM, Campus SophiaTech 450 route des Chappes, 06410 Biot, France e-mail: pasquale.lisena(a)eurecom.fr site: http://pasqlisena.github.io/

1 0

Save the date – COLING 2027
by Joakim Nivre 19 Jun '26

19 Jun '26

Preliminary Call for Papers We announce that the 32nd International Conference on Computational Linguistics (COLING 2027) will take place in Macau, China, May 9-14 2027. COLING 2027 invites the submission of long and short papers featuring substantial, original, and unpublished research in all aspects of computational linguistics and natural language processing. More details soon. * Conference website: https://2027.coling-iccl.org/ Paper Submission Information This year, COLING will adopt the ARR system for reviewing. Papers may be submitted to the ARR 2026 October cycle. Papers that have already received reviews and a meta-review from ARR from earlier cycles can also be committed to COLING 2027. Please note that NAACL 2027 will also use the October ARR cycle for reviewing. Important Dates * ARR submission deadline (long & short papers): October 12, 2026 * Main Conference: May 9–14, 2027 General Chairs * Katrin Erk, University of Massachusetts Amherst * Chengqing Zong, Institute of Automation, Chinese Academy of Sciences Program Chairs * Minlie Huang, Tsinghua University * Joakim Nivre, Uppsala University * Alexis Palmer, University of Colorado Boulder * Sina Zarrieß, University of Bielefeld * Email: coling2027-pcchairs(a)googlegroups.com<mailto:coling2027-pcchairs@googlegroups.com> När du har kontakt med oss på Uppsala universitet med e-post så innebär det att vi behandlar dina personuppgifter. För att läsa mer om hur vi gör det kan du läsa här: http://www.uu.se/om-uu/dataskydd-personuppgifter/ E-mailing Uppsala University means that we will process your personal data. For more information on how this is performed, please read here: http://www.uu.se/en/about-uu/data-protection-policy

1 0

[Second Call for Participation] WMT 2026 Shared Task on Multitask LLMs with Limited Resources
by daryna.dementieva＠tum.de 18 Jun '26

18 Jun '26

Our WMT 2026 Shared Task on Multitask LLMs with Limited Resources, co-located with EMNLP 2026 is entering the second phase! Task website: https://www2.statmt.org/wmt26/limited-resources-llm.html?utm_source=chatgpt… Google Group: https://groups.google.com/g/llms-with-limited-resources-2026?utm_source=cha… This shared task extends the WMT 2025 edition on low-resource LLMs by introducing three additional multitask settings: * Spell Checking * Grammar Checking * modern evaluation of Maths Reasoning The benchmark focuses on low-resource evaluation for three languages: Ukrainian (uk), Upper Sorbian (hsb), andLower Sorbian (dsb). Participants are required to train a single multitask model based on the Qwen3.5-2B family and jointly evaluate it on all tasks: * Machine Translation * Multiple-Choice Question Answering * Math Reasoning (new) * Spell Checking (new) * Grammar Checking (new) The task aims to study: * Cross-task transfer in compact LLMs * Multitask learning under strict parameter constraints * Efficient adaptation for low-resource Slavic languages * Interactions between translation, linguistic correction, QA, and reasoning capabilities So, if you would like to challenge yourself in developing LLMs with restricted resources -- consider our shared task! Important dates (AoE): * Training/development data release: Now! * Test data release: end of June 2026 * Submission deadline: July 2026 We welcome participation from both academia and industry, especially teams interested in multilingual NLP, efficient LLM adaptation, low-resource machine translation, and multitask learning. Organisers (in alphabetical order): * Daryna Dementieva * Marion Di Marco * Lukas Edman * Alexander Fraser * Kathy Hämmerl * Shu Okabe All good vibes, Daryna Dementieva On behalf of LLMs with Limited Resources Shared task organisers

1 0

Two fully funded positions at UEF in Finland
by Mikko Laitinen 18 Jun '26

18 Jun '26

Two Fully Funded Doctoral Positions in Languages, linguistics and discourse analysis The University of Eastern Finland is pleased to announce two fully funded doctoral positions within the Marie Skłodowska‑Curie Doctoral Programme (MSCA) in Finland. While the programme broadly focuses on interdisciplinary research on the sustainable forest‑based bioeconomy, two positions are specifically dedicated to language‑related research on sustainability, nature, and discourses. Key Facts * Application period: 11 June – 10 August 2026 * Employment duration: 48 months (fulltime) * Mobility: International mobility required (MSCA rules apply) * Total project duration: April 2026 – March 2031 * More information of the programme: https://sites.uef.fi/dp-fobi/ Position 1 Environmental Discourses and the Spread of Multilingual (Dis)information in Social Networks This project invites a doctoral candidate interested in how environment‑related discourses circulate in digital social networks. The research aims to examine how discourses and (dis)information about climate, the environment, and climate‑smart solutions spread across digital media platforms. Ideal Candidate Profile * Master’s degree in linguistics (general or subfields), computational linguistics, translation studies, or a closely related discipline * Competence with large‑scale datasets and computational tools such as R or Python * Strong expertise in the nuances of language use * Excellent written and oral English skills * Ability to work in more than one language is an advantage * Must meet the language requirements of the Doctoral Programme of the Philosophical Faculty at UEF (see: https://www.uef.fi/en/degree-programme/doctoral-programme-of-the-philosophi…) Further details: https://sites.uef.fi/dp-fobi/environmental-discourses/ Position 2 Contested Conceptualizations of Forest in Bioeconomy‑Related Discourses This project focuses on how forest and nature are conceptualized in public discourses related to bioeconomy. It examines the rhetorical, metaphorical, and affective strategies used to frame forest‑based solutions to societal and environmental challenges. Ideal Candidate Profile * Master’s degree in languages, translation studies, linguistics, media studies, communication studies, cultural studies, environmental humanities, or a related field * Excellent written and oral English skills * Must meet the language requirements of the Doctoral Programme of the Philosophical Faculty at UEF (see: https://www.uef.fi/en/degree-programme/doctoral-programme-of-the-philosophi…) * Ability to work in more than one language is an advantage * Prior methodological knowledge of metaphor analysis, rhetorical analysis, or other language‑data methods is beneficial but not required Further details: https://sites.uef.fi/dp-fobi/contested-conceptualizations/ More information: Position 1: Prof. Mikko Laitinen (mikko.laitinen(a)uef.fi<mailto:mikko.laitinen@uef.fi>) and Prof. Maarit Koponen (maarit.koponen(a)uef.fi<mailto:maarit.koponen@uef.fi>) Position 2: Dr. Esa Penttilä (esa.penttila(a)uef.fi<mailto:esa.penttila@uef.fi>) and Prof. Tuija Saresma (tuija.saresma(a)uef.fi<mailto:tuija.saresma@uef.fi>) -- Mikko Laitinen, PhD Professor of English mikko.laitinen(a)uef.fi PI of COMET Weak-tie hypothesis in complex digital networks (Research Council of Finland 2024-28) PI of FIN-CLARIAH Common Language Resources and Technology Infrastructure (2022-2029) https://uefconnect.uef.fi/en/person/mikko.laitinen/ --

1 0

PostDoc position: AI for Science, Saarland University, Germany
by Dietrich Klakow 17 Jun '26

17 Jun '26

PostDoc position: AI for Science Saarland University, Germany =========================================== (Computer Science, Computational Linguistics, Bioinformatics, Physics, Mathematics or similar) In recent years a new field has emerged: AI for Science. Our group has a long and strong tradition in language modelling (several best paper awards at major NLP conferences) and is building on top of this. Our research is done in very close collaboration with the Helmholtz Institute for Pharmaceutical Research as well as the University Hospital, giving us access to exciting unique data. This position is topically open as long as it matches our group and the environment at Saarland University well. Fundamental and applied topics are equally welcome. The ideal candidate for the position would have: 1. Excellent knowledge of machine learning and deep learning 2. Excellent programming skills 3. PhD degree in Computer Science, Computational Linguistics, Bioinformatics, Physics, Mathematics, or similar 4. Successful experience working both independently and as part of a small team Salary: The PhD position will be full time on the German E13 scale (TV-L) which is about 4967€ per month before tax and social security contributions. The appointments will be for two years with a possible extension. Saarland University has a very strong computer science department (two Max Planck institutes, German Research Center for Artificial Intelligence) and is strong in medical/pharmaceutical research (CBI, HIPS, UKS). In both areas it is a leading location in Germany and Europe and therefore perfect for bridging from AI to science. How to apply: Please send us a letter of motivation, your CV, your transcripts, if available a list of publications, and the names and contact information of at least two references, as a single PDF or a link to a PDF if the file size is more than 5 MB. Please apply latest by July 1st, 2026. Contact: Applications and any further inquiries regarding our research should be directed to dietrich.klakow(a)lsv.uni-saarland.de <mailto:dietrich.klakow@lsv.uni-saarland.de>

1 0

2026

2025

2024

2023

2022