- Corpora - ELRA lists

1 PhD position in discourse studies at the University of Lisbon
by Amália Mendes 16 Apr '25

16 Apr '25

The Centre of Linguistics at the University of Lisbon (School of Arts and Humanities) is opening 1 PhD position in Discourse Studies. Research focus: questions in monologic discourse, at the crossroad between pragmatics, textual coherence, argumentation, and corpus linguistics We welcome applicants with a Master’s degree (MA) in Linguistics, languages, or related areas. Deadline: 29 April 2025 Email for application: clul(a)letras.ulisboa.pt<mailto:clul@letras.ulisboa.pt> For more details: https://euraxess.ec.europa.eu/jobs/334941 For any questions about the position, feel free to contact us at clul(a)letras.ulisboa.pt<mailto:clul@letras.ulisboa.pt>

1 0

Assistant Professorship with Tenure Track at TU Wien (Deadline May 22, 2025)
by Pia Pachinger 16 Apr '25

16 Apr '25

1 0

CALL FOR PAPERS: Special issue on Machine Translation for Low-Resource Languages@Language Resources and Evaluation Journal
by Atul K. Ojha 16 Apr '25

16 Apr '25

Apologies for cross-posting. --------------------------------------------------------------------------- *CALL FOR PAPERS: Language Resources and Evaluation Journal- Special Issue on Machine Translation for Low-Resource Languages* https://link.springer.com/collections/gbdgacbgbg *Guest Editors:* - Atul Kr. Ojha (Insight Research Ireland Centre for Data Analytics, DSI, University of Galway, Ireland) - Chao-Hong Liu (Industrial Technology Research Institute, Potamu Research Ltd.) - Ekaterina Vylomova (University of Melbourne, Australia) - Flammie Pirinen (UiT The Arctic University of Norway, Tromsø) - Jonathan Washington (Swarthmore College, USA) - Nathaniel Oco (De La Salle University, Philippines) - Xiaobing Zhao (Minzu University of China) Machine translation (MT) technologies have been improved significantly in the last decade using neural MT (NMT) approaches. However, most of these methods rely on the availability of large parallel data for training the MT systems, resources which are not available for the majority of language pairs. Hence, current technologies often fall short in their ability to be applied to low-resource languages. Developing MT technologies using relatively small corpora still presents a major challenge for the MT community. In addition, many methods for developing MT systems still rely on several natural language processing (NLP) tools to pre-process texts in source languages and post-process MT outputs in target languages. The performance of these tools often has a great impact on the quality of the resulting translation. The availability of MT technologies and NLP tools can facilitate equal access to information for the speakers of a language and determine on which side of the digital divide they will end up. The lack of these technologies for many of the world's languages provides opportunities both for the field to grow and for making tools available for speakers of low-resource languages. In the past few years, several workshops and evaluations have been organized to promote research on low-resource languages. NIST has been conducting Low Resource Human Language Technology evaluations (LoReHLT) annually from 2016 to 2019. In LoReHLT evaluations, there is no training data in the evaluation language. Participants receive training data in related languages but need to bootstrap systems in the surprise evaluation language at the start of the evaluation. Methods for this include pivoting approaches and taking advantage of linguistic universals. The evaluations are supported by DARPA's Low Resource Languages for Emergent Incidents (LORELEI) program, which seeks to advance technologies that are less dependent on large data resources and that can be quickly pivoted to new languages within a very short amount of time so that information from any language can be extracted in a timely manner to provide situation awareness to emergent incidents. There are also the Workshop on Technologies for MT of Low-Resource Languages (LoResMT), Special Interest Group on Under-resourced Languages (SIGUL), Workshop on Resources and Technologies for Indigenous, Endangered and Lesser-resourced Languages in Eurasia (EURALI), the Workshop on Deep Learning Approaches for Low-Resource Natural Language Processing (DeepLo). AfricaNLP, TurkLang, Conference on Machine Translation (WMT), and International Conference on Spoken Language Translation (IWSLT) workshop, which provide a venue for sharing research and working on research and development in this field. This topical collection solicits original research papers on MT systems/methods and related NLP tools for low-resource languages in general. LoReHLT, LORELEI, LoResMT, SIGUL, EURALI, DeepLo, WMT, and IWSLT participants are very welcome to submit their work to the special issue. Summary papers on MT research for specific low-resource languages, as well as extended versions (>40% difference) of published papers from relevant conferences/workshops, are also welcome. Topics of the special issue include, but are not limited to: * Research and review papers on MT systems/methods for low-resource languages * Research and review papers on pre-processing and/or post-processing NLP tools for MT * Word tokenizers/de-tokenizers for low-resource languages * Word/morpheme segmenters for low-resource languages * Use of morphological analyzers and/or morpheme segmenters in MT * Multilingual/cross-lingual NLP tools for MT * Review of available corpora of low-resource languages for MT * Pivot MT for low-resource languages * Zero-shot MT for low-resource languages * Fast building of MT systems for low-resource languages * Re-usability of existing MT systems and/or NLP tools for low-resource languages * Machine translation for language preservation * Techniques that work across many languages and modalities * Techniques that are less dependent on large data resources * Use of language-universal resources * Bootstrap-trained resources for the short development cycle * Entity, relation- and event-extraction * Sentiment detection in MT * MT Summarisation * Processing diverse languages, genres (news, social media, etc.) and modalities (text, speech, video, etc.) * Speech Translation for low-resource languages * Multimodal MT for low-resource languages * MT models using LLMs for low-resource languages * Generative AI models for low-resource languages * Evaluation metrics and datasets for low-resource languages For further information on this initiative, please refer to https://link.springer.com/collections/gbdgacbgbg *IMPORTANT DATES* May 26, 2025: Expression of interest (EOI) via this form: https://forms.gle/QqeqxZgGfsxP6rZ77 August 26, 2025: Paper submission deadline December 05, 2025: Revised papers due March 2026: Publication * SUBMISSION GUIDELINES* Authors should follow the "Instructions for Authors <https://link.springer.com/journal/10579/submission-guidelines> ( https://link.springer.com/journal/10579/submission-guidelines)" on the LRE journal website <https://link.springer.com/journal/10579>. Thanks, Atul

1 0

First call for papers DHASA Conference 2025
by Menno Van Zaanen 16 Apr '25

16 Apr '25

First call for papers DHASA Conference 2025 https://dh2025.digitalhumanities.org.za Theme: The role of humanities in digital humanities and artificial intelligence The Digital Humanities Association of Southern Africa (DHASA) is pleased to announce its fifth conference, focusing on the theme The role of humanities in digital humanities and artificial intelligence. In a region where the field of Digital Humanities is still relatively underdeveloped, this conference aims to address this gap and foster growth and collaboration in the field. The conference offers an opportunity for researchers interested in showcasing their work in the broad field of Digital Humanities to come together. By doing so, the conference provides a comprehensive overview of the current state-of- the-art in Digital Humanities, particularly within the Southern Africa region. As such, we welcome submissions related to Digital Humanities research conducted by individuals from Southern Africa or research focused on the geographical area of Southern Africa in the broad sense. Furthermore, the conference serves as a platform for information sharing and networking among researchers passionate about Digital Humanities. By bringing together experts working on Digital Humanities in Southern Africa or with a focus on Southern Africa, we aim to promote collaboration and facilitate further research in this dynamic field. In addition to the main conference, affiliated workshops and tutorials will be organised, providing researchers with valuable insights into novel technologies and tools. These supplementary events are designed for researchers interested in specific aspects of Digital Humanities or seeking practical information to enter or advance their knowledge in the field. The DHASA conference welcomes interdisciplinary contributions from researchers in various domains of Digital Humanities, including, but not limited to, language, literature, visual art, performance and theatre studies, media studies, music, history, sociology, psychology, language technologies, library studies, philosophy, methodologies, software and computation, AI, and more. Our goal is to cultivate an inclusive scientific community of practice within Digital Humanities. Suggested topics include the following: * The role of AI in digital humanities, the role of Digital Humanities in shaping AI, and the broader role of the humanities in both AI and DH projects; * Digital archives and the preservation of marginalised voices; * Intersectionality and the digital humanities: exploring the intersections of race, gender, sexuality, culture, and class in digital research and activism; * Activism and social change through digital media: how digital humanities tools and methodologies can be used to promote inclusion; * Engaging marginalised communities in the creation and use of digital tools, resources, and AI; * Exploring the role of digital humanities in decolonising knowledge and promoting indigenous perspectives; * The ethics of data collection and analysis in digital humanities and AI research; * The role of digital humanities and AI in promoting inclusive and equitable pedagogy; * Digital humanities and inclusion in the context of African and global perspectives and international collaborations; * Critical approaches to digital humanities and inclusion: examining the limitations and possibilities of digital tools and methodologies in promoting inclusion; and * Collaborative digital humanities projects with non-profit organisations, community groups, and cultural institutions; * Development of digital and AI tools for supporting digital humanities; * Novel utilisation of digital and AI tools for performing digital humanities research; * The role of digital humanities in the classroom: reimagining literacy and AI fluency * Digital humanities data and project management; * The role of librarians in the digital humanities project; * Any other digital humanities-related topic that serves the Southern African community. Submission Guidelines The DHASA conference 2025 asks for three types of submissions: * Long papers: Authors may submit long papers with a maximum of 8 content pages and unlimited pages for references and appendices. The final versions of accepted long papers will be granted an additional page (leading to a total of up to 9 content pages) to incorporate reviewers' comments. Long papers accepted for the conference will be presented in 30-minute time slots (which includes 10 minutes for questions). * Short papers: Authors may submit short papers with a maximum of 5 content pages and unlimited pages for references and appendices. The final versions of accepted short papers will be allowed an extra page (leading to a total of up to 6 content pages) to accommodate reviewers' comments. Short papers accepted for the conference will be presented in 15-minute time slots (which includes 5 minutes for questions). * Executive summaries: Authors can submit an executive summary for work in progress, limited to 1 page. Executive summaries accepted for the conference will be presented as posters during a dedicated poster presentation slot. All accepted long and short paper submissions that are presented at the conference will be published in the JDHASA journal, see https://upjournals.up.ac.za/index.php/dhasa. In addition, the executive summaries for the poster presentations will be published in a book of executive summaries before the conference. We particularly encourage student submissions where the first author is a student. All submissions should adhere to the ACL style guide: https://acl-org.github.io/ACLPUB/formatting.html Submissions should be submitted in PDF format. Submissions that do not adhere to the prescribed style guide will be rejected. Follow this link to go to the submission platform: https://dh2025.digitalhumanities.org.za/submission/ Authors are encouraged to upload their datasets to the SADiLaR repository: https://repo.sadilar.org/. In case of difficulties uploading the datasets, please reach out to Benito Trollip (benito.trollip(a)nwu.ac.za). Important dates Submission deadline: 14 July 2025 Date of notification: 16 September 2025 Camera-ready copy deadline: 24 October 2025 Conference: 10 November 2025 – 14 November 2025 Conference venue: CSIR ICC, Pretoria, South Africa Co-located events Several co-located events are currently being prepared, including workshops and tutorials. These will be updated on the conference website. Organising Committee Aby Louw, Council for Scientific and Industrial Research Andiswa Bukula, South African Centre for Digital Language Resources Avi Moodley, Council for Scientific and Industrial Research Franco Mak, Council for Scientific and Industrial Research Franziska Pannach, Rijksuniversiteit Groningen Ilana Wilken, Council for Scientific and Industrial Research Johannes Sibeko, Nelson Mandela University Juan Steyn, South African Centre for Digital Language Resources Laurette Marais, Council for Scientific and Industrial Research Marissa Griesel, South African Centre for Digital Language Resources Menno van Zaanen, South African Centre for Digital Language Resources Privolin Naidoo, Council for Scientific and Industrial Research Sthembiso Mkhwanazi, Council for Scientific and Industrial Research -- Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za Professor in Digital Humanities South African Centre for Digital Language Resources https://www.sadilar.org ________________________________ NWU PRIVACY STATEMENT: http://www.nwu.ac.za/it/gov-man/disclaimer.html DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system. ________________________________

1 0

CIRCE Online Seminar | 28/04/2025 | Dr. Alice Henderson | "Ten things everybody should know about (spoken) language"
by Claudia Soria 16 Apr '25

16 Apr '25

*🎓 *We are happy to remind you about the next webinar in the CIRCE online seminar series, organized by the CIRCE <https://www.circe-project.eu/> project in collaboration with DFCLAM University of Siena <https://www.dfclam.unisi.it/it>, H2IOSC <https://www.h2iosc.cnr.it/> project and CNR-ILC. *Speaker*: _Alice Henderson_ (Université de Grenoble Alpes, France) *Title*: Learning to listen: Coping with spoken variation in the workplace *Date*: Monday, April 28, 2025 - 16:30 CET *Venue*: Online Attendees: Secondary school teachers, researchers, language instructors *Summary*: The university workplace is representative of international-ised/-ising workplaces in general, where different communities, languages, and cultures coexist. Staff encounter their colleagues’ and students’ accents – of Italian, of English, and in my case, French - and sometimes the result is that communication can be quite hard work. Even with the best intentions, sometimes we just cannot understand a speaker. However, when we think about spoken interactions, we have to accept that it is not just about how the speaker produces a language; the actions and skills of listeners should also be addressed. This flip or change of perspective begs two questions: can we, as listeners, learn to cope better with spoken variation? And if so, how? In this talk I’ll summarize speech research findings about how listeners can improve their ability to adapt to new speakers and new accents. I’ll look at listener accommodation and accentism, as well as the conceptual trio of accentedness, comprehensibility and intelligibility. I’ll describe concrete ways to prepare listeners to cope with accented speech, with a primary focus on listeners instead of speakers. Examples will come mainly from my work with non-academic staff at a large, French public university; my 1-hour format for listener training can be reused in other professional contexts. If possible, I’ll also describe the next steps in this work, as I prepare to continue training previous workshop participants as part of a longitudinal study. *Bio*: Alice Henderson is a Professor at Université Grenoble - Alpes, France where she teaches English for Specific Purposes to Science & Technology students. She taught English phonetics and phonology for 24 years and has been involved in training teachers in France, Norway, Poland, and Spain. In 2009 she initiated the international bi-annual conference English Pronunciation: Issues & Practices. Her research interests include English pronunciation teaching and learning, the perception of foreign-accented speech, and English Medium Instruction (EMI). Much of her research has focused on speakers, but she is also intrigued by listeners’ roles, from an intercultural and sociolinguistic perspective. Upcoming webinars: - Ana Tankosic, /Intersectionality in translingual spaces: Migrant experiences from ‘down-under’/ (Monday, May 12, 2025) - Giuliana Regnoli, /Unveiling linguistic bias: Approaches to accent perception and discrimination/ (Monday, May 26, 2025) - Clara Molina, Unlearning Accentism: Action Research and Critical Pedagogies (Monday, June 30, 2025 The seminar is free of charge, but participants must register. To access this and next events, you should create an account on theH2IOSC Training Environment <https://h2iosc-training-platform.ilc4clarin.ilc.cnr.it/registration>. Once logged in with your credentials, choose the course “Language and Accent Discrimination - Online Seminar Series” and activate it with the code PbK837GtE. Make sure to have the Teams platform installed. The registrations of the previous CIRCE Seminars are also available on the H2IOSC Training Environment <https://h2iosc-training-platform.ilc4clarin.ilc.cnr.it/>. For any inquiry, write to contact(a)circe-project.eu <mailto: contact(a)circe-project.eu>.

1 1

Second Call for Special Issue Proposals - Journal Natural Language Processing
by Ranasinghe, Tharindu 16 Apr '25

16 Apr '25

Journal Natural Language Processing (formerly Journal of Natural Language Engineering) *** Second Call for Special Issue Proposals *** In recent years the area of Natural Language Processing (NLP) has enjoyed unprecedented developments since the emergence of Deep Learning and, lately, Large Language Models. At the same time, NLP is following the trend of many other areas in becoming highly specialised, with a number of application-orientated and narrow-domain topics emerging or growing in importance. These developments, often coinciding with a lack of related literature, necessitate and warrant the publication of specialised volumes focusing on a specific topic of interest to the NLP research community. The Journal Natural Language Processing (formerly Journal of Natural Language Engineering), which features six 160-page issues per year and has had its impact factor increase yearly, invites proposals for special issues on a competitive basis covering any topics in applied NLP which have emerged as important recent developments and have attracted the attention of a number of researchers. The Journal Calls for Proposals for special issues have resulted in high-quality outputs and this year we look forward to another successful competition. Proposals on topics covering a variety of methods, tasks, resources and applications from Natural Language Processing, Computational Linguistics, Speech and Language Processing, Text Analytics and related areas are eligible. Special issues on timely NLP topics such as latest language models including Large Language Models/Generative AI, are welcome. Special issue proposals may be based on a successful workshop or a body of work associated with a particular group or section of the community. In the case of papers previously submitted to workshops, the Guest Editors will not be able to re-use previous workshop reviews. In addition, the call for papers of the accepted proposals must be open to all interested parties and all authors will be given equal treatment; in the case of proposals based on previous workshops, submissions cannot be limited to workshop participants only. Prospective proposers are also encouraged to consult the successful Journal columns "Industry Watch" and "Emerging Trends" for additional inspiration. Interested parties have the option of preliminary feedback by emailing expressions of interest accompanied by a brief description of the intended special issue to the Executive Editor (Ruslan.Mitkov(a)ua.es). He will give a brief indication of whether the topic is appropriate to the Journal. In the case of initial positive feedback, the prospective Guest Editors will be asked to submit a proposal for a special issue that will be reviewed by the Editors of the Journal and by other members of the Journal Editorial Board. The proposal for a special issue should include a brief outline of the field and rationale as to why it is important to launch a special issue on the particular topic of interest at the current time. It should include a relevant literature survey (related previous special issues, volumes, workshop and conference proceedings) and should explain the added value of the proposed special issue against the background of other relevant or competing publications and volumes (if applicable). It is desirable that evidence for the estimate of expected submissions to the special issue be provided and justified. The proposals should also include a tentative Guest Editorial Board. It is desirable that at least one (preferably two) of the members of the Guest Editorial Board is on the Editorial Board of the Journal Natural Language Processing. The proposal should also include a tentative time-scale for the production of the special issue (the time-scale committed to in the proposal should be adhered to, if the proposal is accepted), and information about the prospective Guest Editors such as relevant experience, publications etc. Time-scale - Deadline for submission of special issue proposals: 28 April 2025 (proposals to be emailed to Ruslan.Mitkov(a)ua.es with a copy to NLP(a)cambridge.org) - Notification of acceptance/rejection: 19 May 2025 - Calls for papers related to the successful proposals (at least 2 calls are recommended): 7 June 2025 first call July-September 2025 second (and third call, if applicable) Once the special issue is approved and launched, Guest Editors are expected to adhere to the same reviewing and acceptance standards as regular issues of the Journal. In particular, each submission needs to be reviewed by three members of the Guest Editorial Board or other experts in the field. To ensure geographical diversity and balance, and to avoid over-reliance on the same reviewers, each submission must not be reviewed by three experts from the same country, and no single reviewer should evaluate more than two submissions. If the Executive Editor is not satisfied with the review process for a special issue paper, he may either reject the paper or send it for additional review. As a last resort, the Executive Editor has the discretion to reject the entire special issue if the reviewing practices are found to be flawed. All special issues are required to include a survey of the field (at least 15 pages) as its first article, which can be written either by the Guest Editors or experts in the field commissioned by the Guest Editors. This is in addition to a 1-2 page preface by the Guest Editors. Best Regards Dr Tharindu Ranasinghe | Lecturer in Security and Protection Science School of Computing and Communications | Lancaster University Contact me on Teams<https://teams.microsoft.com/l/chat/0/0?users=t.ranasinghe@lancaster.ac.uk> www.lancaster.ac.uk<https://www.lancaster.ac.uk/>

1 0

Call for participation: Automatic Detection of Borrowings (ADoBo 2025)
by ELENA ALVAREZ MELLADO 16 Apr '25

16 Apr '25

We are pleased to announce the second edition of ADoBo, the shared task on automatic detection of borrowings (automatically retrieving words from one language that are incorporated into another language.) For ADoBo 2025 we propose a shared task on retrieving anglicisms from Spanish text, i.e. words borrowed specifically from English that have recently been imported into the Spanish language (words like "running", "smartwatch", "influencer" or "youtuber"). The task will run from April 2025 to June 2025 and is part of IberLEF 2025, which will take place in September 2025 in Zaragoza, Spain. WEBSITE https://adobo-task.github.io/ TIMELINE April 21: Dev set released. May 6: Test set released. May 19: Systems output submissions. May 26: Results posted and Test set with GS annotations released. June 2: Working notes paper submission. June 16: Notification of acceptance (peer-reviews). June 23: Camera ready paper submission. September: ADoBo results to be presented at IberLEF 2025. ORGANIZATION COMMITTEE Elena Álvarez-Mellado, Universidad Nacional de Educación a Distancia (UNED). Julio Gonzalo, Universidad Nacional de Educación a Distancia (UNED). Constantine Lignos, Brandeis University. Jordi Porta-Zamorano, Universidad Autónoma de Madrid (UAM). AVISO LEGAL. Este mensaje puede contener información reservada y confidencial. Si usted no es el destinatario no está autorizado a copiar, reproducir o distribuir este mensaje ni su contenido. Si ha recibido este mensaje por error, le rogamos que lo notifique al remitente. Le informamos de que sus datos personales, que puedan constar en este mensaje, serán tratados en calidad de responsable de tratamiento por la UNIVERSIDAD NACIONAL DE EDUCACIÓN A DISTANCIA (UNED) c/ Bravo Murillo, 38, 28015-MADRID-, con la finalidad de mantener el contacto con usted. La base jurídica que legitima este tratamiento, será su consentimiento, el interés legítimo o la necesidad para gestionar una relación contractual o similar. En cualquier momento podrá ejercer sus derechos de acceso, rectificación, supresión, oposición, limitación al tratamiento o portabilidad de los datos, ante la UNED, Oficina de Protección de datos<https://www.uned.es/dpj>, o a través de la Sede electrónica<https://sede.uned.es/> de la Universidad. Para más información visite nuestra Política de Privacidad<https://descargas.uned.es/publico/pdf/Politica_privacidad_UNED.pdf>.

1 0

Lectures on Computational Linguistics 2025- Italian Association of Computational Linguistics, Milan 18-20 Jnee
by Elisabetta Jezek 16 Apr '25

16 Apr '25

We are pleased to announce that the 2025 edition of the *Lectures on Computational Linguistics*, a series of lectures dedicated to central topics in the field of Computational Linguistics and Natural Language Processing, will be held in *Milan* from *18 to 20 June*. The programme and all information are available on the Lectures website <https://www.ai-lc.it/lectures/lectures-2025/> The 2025 edition is organized by the *Italian Association of Computational Linguistics*/*Associazione Italiana di Linguistica Computazionale* (*AILC*) with the Department of Linguistic Sciences and Foreign Literatures at *Università Cattolica del Sacro Cuore*, Milan, and the Department of Informatics, Systems, and Communication at the *University of Milano-Bicocca*. The interdisciplinary nature of the school crosses several areas, particularly the Humanities, Computer Science and Artificial Intelligence. The program includes tutorials, labs, evening lectures and two student presentation sessions. The 2025 edition is focused on *Linguistic Linked Open Data*, *Neuro-symbolic AI* and *their interplay*. Programme June 18th 2025: Linguistic Linked Open Data • 9:00– 9:30: Welcome and opening • 9:30 – 11:30: *Tutorial 1 – Fundamentals of Linguistic Linked Open Data, Jorge Gracia, University of Zaragoza* • 11:30 – 12:00: BREAK • 12:00 – 13:30: Student session • 13:30 – 15:00: LUNCH • 15:00 – 17:00: *Tutorial 2 – Advanced Topics of Linguistic Linked Open Data, Max Ionov, University of Cologne* • 17:00 – 17:30: BREAK • 17:30 – 18:30: *Evening lecture – LiLa Now: Lessons Learned and the Road Ahead for a Large-Scale Linguistic Linked Data Project, Francesco Mambrini, Università Cattolica del Sacro Cuore, Milan* • 19:30: Welcome drink June 19th 2025: Neuro-symbolic AI • 9:00 – 11:00: *Tutorial 3 –Beyond Naive RAG: How Entities and Graphs Enhance Retrieval-Augmented Generation, Matteo Palmonari, University of Milano-Bicocca* • 11:00 – 11:30: BREAK • 11:30 – 13:30: *Lab. 1 (part 1) – Retrieval Augmented Generation, Large Language Models and Knowledge Bases, Marco Cremaschi, University of Milano-Bicocca* • 13:30 – 15:00: LUNCH • 15:00 – 17:00: *Lab. 1 (part 2) – Evaluating Large Language Models for Linguistic Linked Data Generation, Blerina Spahiu, University of Milano-Bicocca* • 17:00 – 17:30: BREAK • 17:30 – 18:30: *Evening lecture – Memorization or Generalization? Exploring Transformer-based Large Language Models and, possibly, novel approaches, Fabio Massimo Zanzotto, University of Roma Tor Vergata* June 20th 2025: Resources and Evaluation • 9:00 – 11:00: *Tutorial 4 – From Corpora to Capabilities: Rethinking Language Resources in the LLM Era, Zheng Yuan, University of Sheffield* • 11:00 – 11:30: BREAK • 11:30 – 13:00: Student session • 13:00 – 14:00: LUNCH • 14:00 – 16:00: *Lab 2 – The LiLa Knowledge Base: Hands-on Session, Eleonora Litta and Federica Iurescia, Università Cattolica del Sacro Cuore, Milano* Registration The school is mainly aimed at Doctoral and Master's degree students, although a minimum qualification is not required for access. Participation is *free* but subject to *registration *and* association to AILC**, *and places are limited to 200. Register at: https://www.ai-lc.it/en/lcl-registration-procedure/. Membership fees to AILC at: https://www.ai-lc.it/en/memberships/ Students wishing to present aspects of their work in the "Student Presentations" sessions are asked to send a *500-word abstract* to ailc. lectures(a)gmail.com by *May 10, 2025*. Notifications of acceptance will be sent by *May 20, 2025*. Scientific Committee Pierpaolo Basile, University of Studi di Bari Aldo Moro Raffaella Bernardi, University of Trento Tommaso Caselli, University of Groningen Elisabetta Fersini, University of Milano-Bicocca Elisabetta Jezek, University of Pavia Local Organizing Committee Marco Passarotti, Università Cattolica del Sacro Cuore, Milan Federica Iurescia, Università Cattolica del Sacro Cuore, Milan Matteo Pellegrini, Università Cattolica del Sacro Cuore, Milan Elisabetta Fersini, University of Milano-Bicocca Giulia Rizzi, University of Milano-Bicocca Contacts: ailc.lectures(a)gmail.com -- [image: LOGO-UNIPV] PhD ELISABETTA JEZEK Dipartimento di Studi Umanistici PROFESSORE ASSOCIATO IN LINGUISTICA E GLOTTOLOGIA Presidente del corso di laurea magistrale internazionale in European Languages, Cultures and Societies in Contact Membro del Consiglio Direttivo dell'Associazione Italiana di Linguistica Computazionale <https://firmamail.unipv.it/index.php/firme/genera> https://unipv.unifind.cineca.it/resource/person/659960 Elisabetta Jezek's Personal Meeting Room https://us02web.zoom.us/j/7814331810 -- Le informazioni contenute nella presente comunicazione sono di natura privata e come tali sono da considerarsi riservate ed indirizzate esclusivamente ai destinatari indicati e per le finalità strettamente legate al relativo contenuto. Se avete ricevuto questo messaggio per errore, vi preghiamo di eliminarlo e di inviare una comunicazione all’indirizzo e-mail del mittente. -- The information transmitted is intended only for the person or entity to which it is addressed and may contain confidential and/or privileged material. If you received this in error, please contact the sender and delete the material. <http://lettere.unipv.it/diplinguistica/docenti.php>

1 1

Deadline extension: The 20th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2025)
by Ekaterina Kochmar 16 Apr '25

16 Apr '25

Final Call for Papers: *The 20th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2025)* *Location*: Vienna, Austria and online (co-located with ACL 2025) *Date*: Thursday, July 31 and Friday, August 1, 2025 *Website*: https://sig-edu.org/bea/2025 <https://sig-edu.org/bea/2025> *Submission Deadline*: Thursday, April 24, 2025, 11:59pm UTC-12 *Submission Link*: https://softconf.com/acl2025/bea2025/ Dear all, due to multiple requests, we will be extending the submission deadline for BEA 2025 by one week – the submissions are due by April 24, 2025, 11:59pm UTC-12. WORKSHOP DESCRIPTION The BEA Workshop is a leading venue for NLP innovation in the context of educational applications. It is one of the largest one-day workshops in the ACL community with over 100 registered attendees in the past several years. The growing interest in educational applications and a diverse community of researchers involved resulted in the creation of the Special Interest Group in Educational Applications (SIGEDU) (https://sig-edu.org<https://sig-edu.org/>) in 2017, which currently has over 400 members. The 20th BEA workshop will be the first edition of BEA as *a 2-day workshop*, and it will feature a keynote by *Kostiantyn Omelianchuk (Grammarly)*, oral presentation sessions and large poster sessions to facilitate the presentation of a wide array of original research. This year, the workshop is also hosting *a shared task on Pedagogical Ability Assessment of AI-powered Tutors*, and *a half-day tutorial on LLMs for Education: Understanding the Needs of Stakeholders, Current Capabilities and the Path Forward *(more details on both to follow). We expect that the workshop will continue to highlight novel technologies and opportunities for educational NLP in English as well as other languages. The workshop will accept submissions of both full papers and short papers, eligible for either oral or poster presentation at https://softconf.com/acl2025/bea2025/. We solicit papers that incorporate NLP methods, including, but not limited to: - use of generative AI in education and its impact; - automated scoring of open-ended textual and spoken responses; - automated scoring/evaluation for written student responses (across multiple genres); - game-based instruction and assessment; - educational data mining; - intelligent tutoring; - collaborative learning environments; - peer review; - grammatical error detection and correction; - learner cognition; - spoken dialog; - multimodal applications; - annotation standards and schemas; - tools and applications for classroom teachers, learners and/or test developers; and - use of corpora in educational tools. INVITED TALKS The workshop will feature a keynote by Kostiantyn Omelianchuk (Grammarly), and an invited talk by a speaker from one of the IAALDE (https://alliancelss.com<https://alliancelss.com/>) societies. SHARED TASK The workshop will also host a shared task on Pedagogical Ability Assessment of AI-powered Tutors. See more details here: https://sig-edu.org/sharedtask/2025 IMPORTANT DATES All deadlines are 11.59 pm UTC-12 (anywhere on earth). - Submission deadline: *Thursday, April 24, 2025* - Notification of acceptance: *Thursday, May 22, 2025* - Camera-ready papers due: *Monday, June 9, 2025* - Workshop: *Thursday, July 31, and Friday, August 1, 2025* SUBMISSION INFORMATION We will be using the ACL Submission Guidelines for the BEA Workshop this year. Authors are invited to submit a long paper of up to eight (8) pages of content, plus unlimited references; final versions of long papers will be given one additional page of content (up to 9 pages) so that reviewers’ comments can be taken into account. We also invite short papers of up to four (4) pages of content, plus unlimited references. Upon acceptance, short papers will be given five (5) content pages in the proceedings. Authors are encouraged to use this additional page to address reviewers’ comments in their final versions. We generally follow ACL submission guidelines and will require that all submitted papers should include a dedicated "Limitations" section, which does not count toward the page limit. Papers which describe systems are also invited to give a demo of their system. If you would like to present a demo in addition to presenting the paper, please make sure to select either “long paper + demo” or “short paper + demo” under “Submission Category” in the START submission page. Previously published papers cannot be accepted. The submissions will be reviewed by the program committee. As reviewing will be blind, please ensure that papers are anonymous. Self-references that reveal the author’s identity, e.g., “We previously showed (Smith, 1991) …”, should be avoided. Instead, use citations such as “Smith previously showed (Smith, 1991) …”. We have also included conflict of interest in the submission form. You should mark all potential reviewers who have been authors on the paper, are from the same research group or institution, or who have seen versions of this paper or discussed it with you. We will be using the START conference system to manage submissions: https://softconf.com/acl2025/bea2025/ DOUBLE SUBMISSION POLICY We will follow the official ACL double-submission policy. Specifically, papers being submitted both to BEA and another conference or workshop must: - Note on the title page the other conference or workshop to which they are being submitted. - State on the title page that if the authors choose to present their paper at BEA (assuming it was accepted), then the paper will be withdrawn from other conferences and workshops. ORGANIZING COMMITTEE - Ekaterina Kochmar, MBZUAI - Andrea Horbach, Hildesheim University - Ronja Laarmann-Quante, Ruhr University Bochum - Marie Bexte, FernUniversität in Hagen - Anaïs Tack, KU Leuven, imec - Victoria Yaneva, National Board of Medical Examiners - Bashar Alhafni, New York University (NYU) & CAMeL Lab in NYUAD - Zheng Yuan, King’s College London - Jill Burstein, Duolingo Workshop contact email address: bea.nlp.workshop(a)gmail.com<mailto:bea.nlp.workshop@gmail.com> PROGRAM COMMITTEE https://sig-edu.org/bea/2025#program-committee

1 0

Job opening (Vienna, Austria): PhD Student, Researcher & Lecturer in Applied Data Science
by Ingo Frommholz 15 Apr '25

15 Apr '25

PhD Student, Researcher & Lecturer in Applied Data Science (all genders) Full-time (40 h/w) in Vienna, Austria Relevant for those interested in computer science, data science, artificial intelligence, information retrieval, natural language processing, or a related discipline. Temporary, Full-time - Modul University Vienna, Austria More info and application link at https://modul-university-vienna-gmbh.jobs.personio.com/job/2053035?language… JOB POSITION AND DEPARTMENT´S OVERVIEW Modul University Vienna is seeking an outstanding scholar for a Researcher & Lecturer position for its School of Applied Data Science to join its research team and teach undergraduate courses in the broader areas of data science (www.modul.ac.at/study-programs). This position is at the Researcher & Lecturer level and foresees candidates to develop their PhD research and participate in university self-governance. The School of Applied Data Science seeks candidates whose doctoral research will be related to one or more of the main research areas of the school. YOUR RESPONSIBILITIES Have a career ambition to conduct internationally recognized research, i.e. by publishing research articles and writing a doctoral thesis; Enthusiastically engage in teaching undergraduate courses; Assist in the general administration of the university; YOUR PROFILE A completed Master (by August 2025) in any of the fields of computer science, data science, artificial intelligence, information retrieval, natural language processing or a related discipline; Strong background in quantitative research methods; Excellent analytical skills and evidence of the potential to undertake cutting-edge research; Excellent written and spoken English (German is not a requirement as English is the working language). The minimum scores are: TOEFL: 100 Internet-based test (IBT) with no individual section score less than 20; or IELTS: overall band score between 7 and 7.5 with no sub-score below 6.0. The Admissions Committee may decide upon the recognition of other evidence of language skills. WHY JOIN US? Cutting-edge and interdisciplinary research approach Individual professional development An international and multicultural working environment FURTHER INFORMATION This is a full‐time, limited contract position where applicants are prepared to work up to 40 hours per week (with overtime hours). The position remains open until filled; the review of applications will commence immediately after the announcement. Starting date: 1 September 2025. The position is available as of September 1st for four years. Successful candidates will use about 50% of their working hours (40h/week) for their PhD studies. The starting salary is EUR 30,000 gross (annual teaching load: two units, @45 min per unit, of weekly teaching) plus approx. 30% Arbeitgeberanteil (= contribution to the social security and pension fund paid by the employer according to Austrian Law), with additional remuneration for extra teaching or supervision of theses. From the 2nd year onwards, the salary increases to EUR 39,350 gross with an annual teaching load of six units. Tuition for the PhD program is covered by Modul University Vienna. For questions related to this position, please contact the Dean of the PhD program astrid.dickinger(a)modul.ac.at or the Head of the School of Applied Data Science ingo.frommholz(a)modul.ac.at. Please submit on our job portal your complete application including cover letter, curriculum vitae, a motivation letter, two recommendation letters, academic transcripts, a statement of your research interests referring to any of the research profiles of Sustainability, Governance, and Methods Professors, and a statement of your teaching interests referring to one of the above-mentioned teaching areas (in English; preferably as a single PDF file not exceeding 8 MB). ABOUT US Modul University Vienna is an international private university and offers cutting-edge education at BBA, BSc, MSc, MBA and PhD levels. Students and staff come from over 70 countries around the world, providing for a multicultural and diverse work environment. As an employer, Modul University Vienna offers: Opportunities to work from home Flexible working hours International and dynamic team Personal and professional development Independent working environment Public transport ticket Modul University Vienna is an equal opportunity employer with a strong commitment to equality and diversity that does not discriminate on the basis of, among other factors, age, color, disability, ethnicity, gender or gender expression, national origin, race, sexual orientation, or social class. We especially encourage women and people who belong minority groups to apply and welcome all applications that can contribute to a diverse working culture. -- Prof. Dr. Ingo Frommholz (he/him), PhD, Dipl.-Inform., FBCS, FHEA Professor of Applied Data Science, Modul University Vienna, Austria Adjunct Professor, Bern University of Applied Sciences, Switzerland Web: http://www.frommholz.org/ | Email: ifrommholz(a)acm.org Bluesky: @ifromm.bsky.social | Mastodon: @ingo@idf.social

1 0

2025

2024

2023

2022

Corpora