- Corpora - ELRA lists

10th Workshop on Online Abuse and Harms (WOAH) @EMNLP: 3rd CFP
by Agostina Calabrese 27 May '26

27 May '26

*** Third Call for Papers *** We invite paper submissions to the 10th Workshop on Online Abuse and Harms (WOAH), which will take place on the 29th of October at EMNLP 2026. Website: https://www.workshopononlineabuse.com/cfp.html Important Dates * Registration deadline for mentorship programme: April 10, 2026 * Notification of mentor/mentee match: April 25, 2026 * Submission due: June 26, 2026 * ARR reviewed submission due: August 3, 2026 * Notification of acceptance: August 15, 2026 * Camera-ready papers due: September 10, 2026 * Workshop: 29th October 2026 Overview Digital technologies have brought significant benefits to society, transforming how people connect, communicate, and interact. However, these same technologies have also enabled the widespread dissemination and amplification of abusive and harmful content, such as hate speech, harassment, and misinformation. Given the sheer volume of content shared online, addressing abuse and harm at scale requires the use of computational tools. Yet, detecting and moderating online abuse remains a complex task, fraught with technical, social, legal, and ethical challenges. The 10th Workshop on Online Abuse and Harms (WOAH) invites paper submissions from a diverse range of fields, including but not limited to natural language processing, machine learning, computational social science, law, political science, psychology, sociology, and cultural studies. We explicitly encourage interdisciplinary research, technical and non-technical contributions, and submissions that focus on under-resourced languages. Non-archival papers and civil society reports are also welcome. Topics covered by WOAH include, but are not limited to: * New models or methods for detecting abusive and harmful online content, including misinformation; * Biases and limitations in existing detection models or datasets for abusive and harmful content, especially those in commercial use; * Development of new datasets and taxonomies for online abuse and harms; * Novel evaluation metrics and procedures for detecting harmful content; * Analyses of the dynamics of online abuse, its propagation, and its impact on different communities; * Social, legal, and ethical considerations in detecting, monitoring, and moderating online abuse. Special Theme: “Ten Years of WOAH: Reflecting on Progress and New Frontiers” In its 10th edition, WOAH highlights the theme “Ten Years of WOAH: Reflecting on Progress and New Frontiers”. Over the past decade, WOAH has become a central interdisciplinary venue for online harms research. As harms and enabling technologies have evolved, the field has moved beyond an early focus on textual hate speech and harassment to address more complex phenomena. Advances in AI and online ecosystems have expanded the scale and diversity of harms. Transformer models, multimodal platforms, and recommendation systems have contributed to the escalation of issues like misinformation, radicalisation, child sexual exploitation, identity-based abuse, algorithmic bias, privacy violations, and AI-mediated harms. Methods tackling this have evolved from monolingual lexicon-based approaches to deep learning, multilinguality, multimodality, interpretability, and interdisciplinarity. Despite this progress, fundamental challenges remain. There is limited consensus on what constitutes “harm”, how context and thresholds should be defined, or how harms vary across cultures and modalities. These ambiguities affect datasets and models, constrain comparability, and often marginalise affected communities. The past decade also calls for critical self-reflection. Research has frequently prioritised detection, high-resource languages, and narrowly defined phenomena over intervention, global perspectives, and systemic or structural harms, with insufficient attention to user agency, platform incentives, lived experience, and participatory approaches. Finally, ten years of work have underscored that interdisciplinarity is essential for addressing the sociotechnical nature of the phenomenon. Addressing future online harms will require deeper integration across NLP, ML, social sciences, law, policy, and HCI. WOAH 10 seeks to consolidate lessons from the past decade, identify enduring gaps, and connect research, practice, and policy to guide the next generation of work on online harms. Submission Submission is electronic, using the Softconf START conference management system. Submission link: https://softconf.com/emnlp2026/woah2026/ The workshop will accept three types of papers. 1) Academic Papers (long and short): Long papers of up to 8 pages, excluding references, and short papers of up to 4 pages, excluding references. Unlimited pages for references and appendices. Accepted papers will be given an additional page of content to address reviewer comments. Previously published papers cannot be accepted. 2) Non-Archival Submissions: Up to 2 pages, excluding references, to summarise and showcase in-progress work and work published elsewhere. 3) Civil Society Reports: Non-archival submissions, with a minimum of 2 pages and no upper limit. Can include work published elsewhere. All submissions must use the official ACL style files<https://github.com/acl-org/acl-style-files>. Submissions that do not conform to the required styles, including paper size, margin width, and font size restrictions, will be rejected without review. All submissions should adhere to the workshop policies https://www.workshopononlineabuse.com/policies.html. WOAH Community We are excited to share the WOAH community Slack channel — a workspace for researchers interested in or working on understanding and addressing online abuse and harms! Join us here: https://join.slack.com/t/hatespeechdet-47d7560/shared_invite/zt-2a8d96j4z-g… Contact Info Please send any questions about the workshop to organizers(a)workshopononlineabuse.com<mailto:organizers@workshopononlineabuse.com> Organisers Agostina Calabrese, Cohere Thomas Davidson, Rutgers University-New Brunswick Christine de Kock, University of Melbourne Urja Khurana, Delft University of Technology Marta Marchiori Manerba, University of Turin Paloma Piot, Universidade da Coruña Zeerak Talat, University of Edinburgh The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. Is e buidheann carthannais a th’ ann an Oilthigh Dhùn Èideann, clàraichte an Alba, àireamh clàraidh SC005336.

1 0

CALL FOR PARTICIPATION - NLPAICS 2026
by Ranasinghe, Tharindu 27 May '26

27 May '26

Second International Conference on Natural Language Processing and Artificial Intelligence for Cyber Security (NLPAICS 2026) The Second International Conference on Natural Language Processing and Artificial Intelligence for Cyber Security (NLPAICS 2026) invites researchers, practitioners, industry experts, and students to participate in this international forum dedicated to the latest advances in NLP, Artificial Intelligence, and Cyber Security. The will take place in Alicante, Spain, on 11–12 June 2026. The list of accepted papers is published at: https://nlpaics2026.gplsi.es/ . Further details about keynote speakers, and submissions are also available on the official conference website. The conference programme will be announced shortly. Researchers and attendees interested in participating are encouraged to register as soon as possible to secure the reduced rates and join the NLPAICS 2026 community. Early registration must be completed before Monday, 25 May 2026 in order to benefit from the discounted registration fees. Special fee rates are also available for participants attending NLPAICS + Summer School: “The Paradigm Shift: From Rules to Models in Natural Language Processing” (https://summer-school.gplsi.es/) that will be held from 15-17 June 2026 in Alicante. For updates and additional information, please visit the official conference website. Best Regards Organising committee, NLPAICS

1 0

AthNLP2026 – Call for Participation
by ACL Announcements 27 May '26

27 May '26

10 days left to apply for AthNLP 2026 (Sept 2-8, Athens)! CALL FOR PARTICIPATION ATHNLP 2026 - 4TH ATHENS NLP SUMMER SCHOOL =========================================== We invite everyone interested in Natural Language Processing and Machine Learning to participate in the 4th Athens Natural Language Processing Summer School taking place in Athens, Greece at NCSR Demokritos Campus between 2-8 September 2026: https://athnlp.github.io/2026/ Important Dates -------------------------- * Application Deadline: May 31, 2026 * Decision Announcement: June 10, 2026 * Registration: June 30, 2026 * Summer School: September 2-8, 2026 Description ------------------ Following successful AthNLP editions in 2019, 2024, and 2025, AthNLP 2026 returns to the campus of NCSR Demokritos in Athens. The summer school is organized jointly by RC "Athena", NCSR "Demokritos", Athens University of Economics and Business (Department of Informatics), Heriot-Watt University, Archimedes Unit / Athena RC, and ELLIS - University of Manchester, in close collaboration with LxMLS (Lisbon, 20-25 July 2026). The school focuses on Machine Learning methods for NLP, especially Deep Learning and Large Language Models (LLMs), offering: Morning lectures on theory, afternoon hands-on lab sessions, evening research talks, poster sessions, and demos. Our target audience is: * Students and researchers in NLP/Computational Linguistics and Machine Learning; * Computer scientists with interest in NLP and ML; * Industry professionals seeking deeper understanding of these fields. While previous experience with the topics will be helpful, the school assumes no previous knowledge of Natural Language Processing and Machine Learning. The only background assumed is basic mathematics and Python programming. Features of AthNLP: * Attendance at the Social Event, daily lunch as well as morning and afternoon coffee breaks are included in the registration fee. * Lecturers are leading researchers in Machine Learning and NLP. * Students will be able to (optionally) show their current work in poster sessions during coffee breaks. Confirmed Speakers --------------------------------- * Antonis Anastasopoulos, George Mason Computer Science * Isabelle Augenstein, University of Copenhagen * Desmond Elliott, University of Copenhagen * Nizar Habash, NYU Abu Dhabi * Lingpeng Kong, University of Hong Kong * Julia Kreutzer, Cohere * Ryan McDonald * Dong Nguyen, Utrecht University * Anna Rogers, IT University of Copenhagen * Emine Yilmaz, University College London Participation --------------------- To apply, please fill this [1] form: https://ijerm0co.forms.app/athens-nlp-2026-summer-school-final The fees are the following: * 300 EUR for students * 400 EUR for university professors or researchers at a public institute * 500 EUR for everyone else Links: ------ [1] https://ijerm0co.forms.app/athens-nlp-2026-summer-school-final

1 1

Conference anouncement and CfP: Translating and Interpreting in the Era of Algorithms (TIERA)
by VILELMINI SOSONI 27 May '26

27 May '26

Translating and Interpreting in the Era of Algorithms (TIERA) Department of Foreign Languages, Translation and Interpreting Ionian University, Corfu, Greece 9–11 October 2026 As translation and interpreting practices are reshaped by the accelerating advances of artificial intelligence, neural networks, and automation, new questions arise: What remains distinctly human in the act of translation? How can creativity and critical reflection coexist with algorithmic efficiency? Organised by the Department of Foreign Languages, Translation and Interpreting and the MA Science of Translation of the Ionian University, this international conference — Translating and Interpreting in the Era of Algorithms (TIERA)— invites scholars, researchers, professionals, and students to explore the emerging interfaces between technology, translation and interpreting, and language mediation in the 21st century. The event is held within the framework of the 40th anniversary of the Department’s founding, marking four decades of innovation, education, and scholarship in translation and interpreting. 📝 Submit your abstract for papers, posters and panels by 20 June 2026 here: https://conferences.ionio.gr/tiera/en/submission/ 📍 Corfu, Greece 📅 9–11 October 2026 See more: https://conferences.ionio.gr/tiera/en/about/ We can't wait to welcome you to Corfu! Join us in Corfu, 9–11 October 2026, to celebrate this milestone and collectively explore the future of translation and interpreting. Dr Vilelmini Sosoni Associate Professor Associate Head of the Department of Foreign Languages, Translation and Interpreting Director of the Laboratory for Specialised Translation and Language Technologies Ionian University Ippokratis Building, Chr. Tsirigotis Square 49100 Corfu Greece M: +306932623733 Email: Vilelmini(a)hotmail.com<mailto:Vilelmini@hotmail.com>, sosoni(a)ionio.gr<mailto:sosoni@ionio.gr> Skype: vilelmini.sosoni LinkedIn: https://www.linkedin.com/in/vilelmini-sosoni-74000910/

1 0

Second International Conference on Natural Language Processing and Artificial Intelligence for Cyber Security (NLPAICS 2026)
by Amal Haddad 27 May '26

27 May '26

NLPAICS 2026: CALL FOR PARTICIPATION Second International Conference on Natural Language Processing and Artificial Intelligence for Cyber Security (NLPAICS 2026) The Second International Conference on Natural Language Processing and Artificial Intelligence for Cyber Security (NLPAICS 2026) invites researchers, practitioners, industry experts, and students to participate in this international forum dedicated to the latest advances in NLP, Artificial Intelligence, and Cyber Security. The conference will take place in Alicante, Spain, on 11-12 June 2026. The list of accepted papers is published at: https://nlpaics2026.gplsi.es/accepted-papers/ [1]. Further details about keynote speakers, and submissions are also available on the official conference website https://nlpaics2026.gplsi.es/ [2]. The conference programme will be announced shortly. Researchers and attendees interested in participating are encouraged to register as soon as possible to secure the reduced rates and join the NLPAICS 2026 community. Early registration must be completed before Monday, 25 May 2026 in order to benefit from the discounted registration fees. Special fee rates are also available for participants attending NLPAICS + Summer School: "The Paradigm Shift: From Rules to Models in Natural Language Processing" (https://summer-school.gplsi.es/ [3]) that will be held from 15-17 June 2026 in Alicante. -- Amal Haddad Haddad (She/her) Facultad de Traducción e Interpretación Universidad de Granada |https://www.ugr.es/personal/amal-haddad-haddad Lexicon Research Group |http://lexicon.ugr.es/haddad Co-Convenor, BAAL SIG 'Humans, Machines, Language'|https://r.jyu.fi/humala Event Coordinator, BAAL SIG 'Language, Learning and Teaching' =============== Cláusula de Confidencialidad: "Este mensaje se dirige exclusivamente a su destinatario y puede contener información privilegiada o confidencial. Si no es Ud. el destinatario indicado, queda notificado de que la utilización, divulgación o copia sin autorización está prohibida en virtud de la legislación vigente. Si ha recibido este mensaje por error, se ruega lo comunique inmediatamente por esta misma vía y proceda a su destrucción. This message is intended exclusively for its addressee and may contain information that is CONFIDENTIAL and protected by professional privilege. If you are not the intended recipient you are hereby notified that any dissemination, copy or disclosure of this communication is strictly prohibited by law. If this message has been received in error, please immediately notify us via e-mail and delete it" =============== Links: ------ [1] https://urldefense.com/v3/__https://nlpaics2026.gplsi.es/accepted-papers/__… [2] https://urldefense.com/v3/__https://nlpaics2026.gplsi.es/__;!!D9dNQwwGXtA!S… [3] https://urldefense.com/v3/__https://summer-school.gplsi.es/__;!!D9dNQwwGXtA…

1 0

Relaxed CfP: Mechanistic Interpretability & Neuro-symbolic Approaches by-design: the MeMo Workshop
by fabio.massimo.zanzotto＠uniroma2.it 27 May '26

27 May '26

Are you stressfully working for the May 25th ARR deadline? Don't worry, you can relax afterwards by writing a short paper on work in progress and blue sky ideas over Mechanistic Interpretability and Neuro-symbolic Approaches by-design ( <https://sites.google.com/view/memo-workshop/> MeMo Workshop) (deadline: 31/5/2026 12:00 PM AoE) Are you surprised about the ability of LLMs to perform complex reasoning tasks? Do you have a particular keenness for cracking the code of their inner workings? If the answer to the three questions is yes, this is the workshop for you! Observing properties and understanding the inner workings are two sides of the same coin. Another way is possible (e.g., <https://aclanthology.org/2025.findings-acl.785/> Position Paper: MeMo: Towards Language Models with Associative Memory Mechanisms - ACL Anthology)! The workshop aims to accept focused contributions on: * Mechanistic Interpretability by-design * Neuro-symbolic approaches by-design Work in progress and blue sky ideas are in the spirit of the workshop. Your contribution must be between four pages and six pages, excluding references, and should be prepared using the <https://www.overleaf.com/read/gyxkgssbstfr#85e729> Overleaf Submission Template. The proceedings will be submitted to <http://ceur-ws.org/> CEUR-WS.org for online publication. Please submit your contribution via EasyChair at the designated submission <https://easychair.org/conferences/?conf=memomina2026> link. The reviewing process is single-blind. Submission Deadline: 31/5/2026 12:00 PM AoE Acceptance Notification Date: 14/6/2026 Camera ready: 21/6/2026 Workshop Date: 26/6/2026 (MODIFIED, so that you can plan your WE in Rome) Venue: University of Rome Tor Vergata (Rome, Italy) & Online Registration fees: cheaper than cheap (it's free) Web Site: <https://sites.google.com/view/memo-workshop/> MeMo Workshop

1 0

First CFP: The Second Tokenization Workshop @ COLM 2026
by Jindrich Libovicky 27 May '26

27 May '26

*First Call for Papers* TokShop: Second Tokenization Workshop (COLM 2026) https://tokenization-workshop.github.io **Important days** - Deadline for submissions is June 23, 2026, at 11:59 pm (anywhere on earth) - Notifications of acceptance will be sent out on July 24, 2026 - Camera-ready papers will be due shortly afterward at 11:59 pm (anywhere on earth) The workshop will take place at the Hilton Union Square in San Francisco, CA, USA on October 9, 2026. ***Workshop Description*** The Second Tokenization Workshop (TokShop) at COLM 2026 aims to bring together researchers and practitioners from across machine learning to explore tokenization in its broadest sense. We will discuss innovations, challenges, and future directions for tokenization across diverse data types and modalities. ***Call for Papers*** Topics of interest include: - Subword Tokenization in NLP: Analysis of techniques such as BPE, WordPiece, and UnigramLM, as well as improvements for efficiency, interpretability, and adaptability. - Multimodal Tokenization: Tokenization strategies for images, audio, video, and other modalities, including methods to align representations across different types of data. - Multilingual Tokenization: Development of tokenizers that work robustly across languages and scripts, and investigation into failure modes tied to tokenization. - Tokenizer Modification Post-Training: Methods for updating tokenizers after model training to boost performance and/or efficiency without retraining from scratch. - Alternative Input Representations: Exploration of non-traditional tokenization approaches, such as byte-level, pixel-level, or patch-based representations. - Statistical Perspectives on Tokenization: Empirical analysis of token distributions, compression properties, and correlations with model behavior. By broadening the scope of tokenization research beyond language, this workshop seeks to foster cross-disciplinary dialogue and inspire new advances at the intersection of representation learning, data efficiency, and model design. ***Submission Guidelines*** Our author guidelines follow the COLM requirements unless otherwise specified. - Paper submission is hosted on OpenReview: https://openreview.net/group?id=colmweb.org/COLM/2026/Workshop/TokShop#tab-… - We accept non-archival submissions of two types: - Research papers (up to 9 pages, not including references or appendices) - Extended abstracts (up to 2 pages) - Please use the provided LaTeX template (Style Files) for your submission. Please follow the general paper formatting guidelines for COLM, as specified in the style files. - You may use as many pages of references and appendix as you wish, but reviewers are not required to read the appendix. - Posting papers on preprint servers like ArXiv is permitted. - We encourage each submission to discuss the limitations as well as ethical and societal implications of their work, wherever applicable (but neither are required). These sections do not count towards the page limit. - The paper should be anonymized and uploaded to OpenReview as a single PDF. - The review process will be double-blind. Read more: https://tokenization-workshop.github.io/

1 0

2nd CfP for EMNLP Workshop on Multimodal Interaction in Face-to-Face Dialogue (MINT)
by Takmaz, E.K. (Ece) 27 May '26

27 May '26

Second CfP for EMNLP Workshop on Multimodal Interaction in Face-to-Face Dialogue (MINT) We invite submissions to MINT: Multimodal Interaction in Face-to-Face Dialogue, a workshop that brings together researchers from computational linguistics, NLP, computer vision, HCI, robotics, and cognitive science working on multimodal face-to-face communication. Workshop website: https://mintworkshop.github.io/2026/ The Workshop will be co-located with EMNLP 2026 in Budapest, Hungary, October 24–29, 2026 (exact date within this period to be decided). We welcome work on topics including: - computational models that integrate verbal and non-verbal cues such as speech, text, gesture, facial expression, gaze, and body pose; - cognitive and linguistic insights about face-to-face communication that can inform AI systems; - multimodal datasets with synchronized speech, video, and motion data; - evaluation methods for multimodal interaction; - applications and tools for embodied conversational agents, social robots, annotation, and behavioural analysis. Papers should be prepared using the official ACL formatting guidelines and ACL style files. MINT welcomes both archival and non-archival papers: - Archival papers: Submissions must be anonymous and report original, unpublished research to appear in the workshop proceedings. - Non-archival papers: Submissions reporting previously published work, preliminary research, or demos to be presented at the workshop and not published in the MINT proceedings. Papers may be submitted as long papers (up to 8 pages plus references) or short papers (up to 4 pages plus references). Non-archival submissions do not need to be anonymous. We allow cross-submissions to other venues. However, to be included in the proceedings, authors of accepted papers must withdraw them from any other venue where they remain under consideration. MINT will accept submissions through two channels: 1. Direct submission: The dedicated OpenReview portal for this is available at https://openreview.net/group?id=EMNLP/2026/Workshop/MINT. Archival papers submitted through this channel will be reviewed by the MINT programme committee. 2. ACL Rolling Review (ARR): Authors may submit through ARR and commit their paper together with the ARR reviews to MINT later at https://openreview.net/group?id=EMNLP/2026/Workshop/MINT_ARR_Commitment **Important dates (11:59 pm AOE)** - ARR paper submission deadline: May 25, 2026 - Direct paper submission deadline: July 8, 2026 - Pre-reviewed ARR commitment deadline: August 24, 2026 - Notification of acceptance: August 31, 2026 - Camera-ready paper due: September 14, 2026 Accepted contributions will be required to be presented at the MINT workshop as posters or talks. The MINT workshop is sponsored by the Max Planck Institute for Psycholinguistics: https://www.mpi.nl/ For questions, please contact: mint.organizers(a)gmail.com On behalf of the workshop organisers: - Raquel Fernández (University of Amsterdam) - Diego Frassinelli (LMU Munich) - Esam Ghaleb (Max Planck Institute for Psycholinguistics) - Bulat Khaertdinov (Maastricht University) - Asli Ozyurek (Max Planck Institute for Psycholinguistics / Radboud University) - Ece Takmaz (Utrecht University) - Zerrin Yumak (Utrecht University)

1 0

ESU in Digital Humanities 2026 – New Scholarship & Deadline Extension
by Elisabeth Burr 27 May '26

27 May '26

Dear colleagues, We are happy to announcea new scholarship opportunity for MA students <https://esudh.github.io/ScholarshipsandFunding/#graduate-school-translation…>coming to European Summer University in Digital Humanities from African and European universities. Ten scholarships will cover the participation fee and are offered by the newly established graduate programme “Translation” at the Université Marie et Louis Pasteur in Besançon. To give students time for their applications, we are extending the deadline to May 31 2026. Apply for school and scholarships via ConfTool <https://www.conftool.org/esudh2026/>. Please find more about the workshop offer and application process on our website <https://esudh.github.io/WorkshopsandLectures/>. ESU DH will be held at the Université Marie et Louis Pasteur in Besançon, France, from July 6 to July 18. We look forward to welcoming you in Besançon. On behalf of Prof. Frederic Spagnoli, Head of the ESU 2026, Dr. Artjoms Šeļa, Chair of the ESU Steering Committee

1 0

Online courses on Deep Learning, LLMs and Speech by HiTZ Chair. Registration is now open!!!
by Eneko Agirre 27 May '26

27 May '26

HiTZ Chair of Artificial Intelligence and Language Technology It is a pleasure to inform you that *registration* for the online training courses organized by the HiTZ center is now open. The courses, offered in June and July, include: *_Deep Learning for NLP (code: DL4NLP)_ *June 01th to 05th, 20 hours (2 ECTS). 15th edition. * _Large Language Models (code: LLMS)_ *June 15th to 19th, 20 hours (2 ECTS). 15th edition. * _Generative Playground: LLMs made easy (code: GPLLMME) _ *June 29th to July 03th, 20 hours (2 ECTS). 3th edition. * * *_Deep Learning for Speech Processing (code: DL4SP)_* July 13th to 16th, 10 hours (1 ECTS). 1st edition. For further information and registration <https://www.hitz.eus/training/>: https://www.hitz.eus/training/ Best regards, Olatz Arregi -------------------- HiTZ Chair of Artificial Intelligence and Language Technology -- Eneko Agirre HiTZ Hizkuntza Teknologiako Zentroa - Ixa Taldea Centro Vasco de Tecnología de la Lengua - Grupo Ixa Basque Center for Language Technology - Ixa NLP Group University of the Basque Country (UPV/EHU) hitz.ehu.eus/eneko <https://hitz.ehu.eus/eneko>

1 0

2026

2025

2024

2023

2022

Corpora