Dear All,
We are pleased to invite submissions to the special issue of Natural
Language Engineering, now called *Natural Language Processing* (Cambridge
University Press), on the theme: "*Revolutionising Natural Language
Processing with Quantum Computing*"
*Cambridge NLE - Special Issue*
*Revolutionising Natural Language Processing with Quantum Computing*
*The deadline for submissions is November 30, 2025.*
*Call for Papers*
We welcome papers dealing with one or more of the following topics:
- Quantum computing Platforms for Algorithm execution
- Quantum-enhanced NLP models and embeddings
- Quantum Algorithms towards Natural Language Processing
- Design and Development of Quantum Machine Learning Hybrid
quantum-classical Models
- Applications of Quantum Machine Learning for Natural Language
Processing
- Future Direction of Quantum Machine Learning in Natural Language
Processing
- Quantum Large Language Models - Architectures and Mechanisms and
Challenges
- Quantum Machine Learning for Low-Resource Language Modeling
- Hybrid Quantum-Classical LLMs for Low-Resource Languages
- Quantum transformers and compression of large language models
- Quantum cognitive NLP for contextual and semantic understanding
- Quantum dialogue systems and conversational AI
- Quantum knowledge graphs for reasoning and inference
- Quantum semantic search and information retrieval
- Quantum adversarial learning and robustness in NLP
- *any other, which is related to Quantum Computing*
*Important Dates:*
- *Deadline for submissions*: November 30, 2025
- *First-round author notification*: Between April 30, 2026 and May 30,
2026
- *Submission of revised versions*: Between July 30, 2026 and August 30,
2026
- *Second-round author notification*: Between Nov 30, 2026 and Dec 30,
2026
- *Submission of revised second versions*: Between Dec 31, 2026 and Feb
28, 2027
- *Final Version*: Between April 30, 2027 and May 30, 2027
*Submissions:*
Instructions for preparing your manuscript for the *Journal of Natural
Language Engineering* are available
<https://www.cambridge.org/core/journals/natural-language-engineering/inform…>
here
<https://www.cambridge.org/core/journals/natural-language-engineering/inform…>
.
Please submit your article through the NLE
<https://mc.manuscriptcentral.com/nle>manuscript submission system
<https://mc.manuscriptcentral.com/nle>. When submitting your manuscript,
please select "*Revolutionising Natural Language Processing with Quantum
Computing"* in the field *Special Issue Section.*
*Any questions: *Dr. Partha Pakray National Institute of Technology
Silchar, India Email: partha(a)cse.nits.ac.in *[Guest Editor]*
Thanks & Regards,
Thanks & Regards,
*Dr. Partha Pakray*
Associate Professor
Department of Computer Science & Engineering
National Institute of Technology Silchar
Silchar, Assam, India
partha(a)cse.nits.ac.in
[Posting again with the correct time - apologies]
The next meeting of the Edge Hill Corpus Research Group will take place online (MS Teams) on Thursday 13 November 2025, 3:00-5:00 pm (GMT<https://time.is/United_Kingdom>).
Topic: LLMs and Corpus Tools
Speaker: Mark Davies<https://www.mark-davies.org/> (English-Corpora.org<https://www.english-corpora.org/>, USA)
Title: Integrating information from AI / LLMs into English-Corpora.org
The abstract and registration link are here: https://sites.edgehill.ac.uk/crg/next
Attendance is free. Registration closes on Tuesday 11 November.
If you have problems registering, or have any questions, please send an email to the organiser, Costas Gabrielatos (gabrielc(a)edgehill.ac.uk<mailto:gabrielc@edgehill.ac.uk>).
________________________________
Edge Hill University<http://ehu.ac.uk/home/emailfooter>
Modern University of the Year, The Times and Sunday Times Good University Guide 2022<http://ehu.ac.uk/tef/emailfooter>
University of the Year, Educate North 2021/21
________________________________
This message is private and confidential. If you have received this message in error, please notify the sender and remove it from your system. Any views or opinions presented are solely those of the author and do not necessarily represent those of Edge Hill or associated companies. Edge Hill University may monitor email traffic data and also the content of email for the purposes of security and business communications during staff absence.<http://ehu.ac.uk/itspolicies/emailfooter>
The next meeting of the Edge Hill Corpus Research Group will take place online (MS Teams) on Thursday 13 November 2025, 2:00-4:00 pm (GMT<https://time.is/United_Kingdom>).
Topic: LLMs and Corpus Tools
Speaker: Mark Davies<https://www.mark-davies.org/> (English-Corpora.org<https://www.english-corpora.org/>, USA)
Title: Integrating information from AI / LLMs into English-Corpora.org
The abstract and registration link are here: https://sites.edgehill.ac.uk/crg/next
Attendance is free. Registration closes on Tuesday 11 November.
If you have problems registering, or have any questions, please send an email to the organiser, Costas Gabrielatos (gabrielc(a)edgehill.ac.uk<mailto:gabrielc@edgehill.ac.uk>).
________________________________
Edge Hill University<http://ehu.ac.uk/home/emailfooter>
Modern University of the Year, The Times and Sunday Times Good University Guide 2022<http://ehu.ac.uk/tef/emailfooter>
University of the Year, Educate North 2021/21
________________________________
This message is private and confidential. If you have received this message in error, please notify the sender and remove it from your system. Any views or opinions presented are solely those of the author and do not necessarily represent those of Edge Hill or associated companies. Edge Hill University may monitor email traffic data and also the content of email for the purposes of security and business communications during staff absence.<http://ehu.ac.uk/itspolicies/emailfooter>
[Apologies for multiple postings]
Dear all,
We are pleased to announce that the submission deadline for the 16th
International Workshop on Spoken Dialogue Systems (IWSDS 2026)
<https://sites.google.com/unitn.it/iwsds26/> has been extended:
*Important Dates (Extended):*
* Paper Submission Deadline: October 12 → *Extended to October 22, 2025*
* Paper Update Deadline: October 18 →*Extended to October 28, 2025*
* Acceptance Notification: December 10, 2025
*Workshop Dates: February 26 – March 1, 2026*
We invite submissions of long papers, short papers, position papers,
industry track papers, and demonstrations on a broad range of topics
related to the Theoretical Foundations, Systems and Methods, and
Applications of spoken and multimodal dialogue systems.
Accepted papers will be included in the ACL Anthology.
This year’s theme is:
🎯“Human-Machine Dialogue in the Era of Multimodal Foundation Models”
Location: Trento, Italy – the gateway to the Dolomites, right after the
Milano Cortina 2026 Winter Olympics
Website & CfP: https://sites.google.com/unitn.it/iwsds26/
Twitter/X: https://x.com/iwsdsmeeting
Bluesky: https://bsky.app/profile/iwsdsmeeting.bsky.social
(s.) Mahed Mousavi, Ph.D.
IWSDS'26 Program Chair
Assistant Professor (RTD-A), University of Trento
https://sites.google.com/unitn.it/iwsds26/
====================================================================
CFP III Andaluz.IA Forum
December 19, 2025, Escuela Técnica Superior de Ingeniería Informática de la
Universidad de Sevilla
====================================================================
From ten Andalusian universities and the Joint Research Centre of the
European Commission we are organizing the III Andaluz.IA Forum
<https://sites.google.com/view/andaluzia/home>, a meeting whose main
objective is to show the potential and give visibility to the academic and
research community in Artificial Intelligence in our region. This forum
seeks to highlight the work of Andalusian scientists, both those who are
currently working in Andalusia, as well as those who have spent part of
their training or career in the region, regardless of their current place
of work.
The first edition of the Andaluz.IA forum was organized at Universidad
Pablo Olavide in Seville, the second edition at Universidad de Jaén and
this year it will take place at Escuela Técnica Superior de Ingeniería
Informática (Universidad de Sevilla). With this third edition, we want to
continue highlighting the great potential for research and academic
development in Artificial Intelligence that Andalusia has, in areas such as
machine learning, deep learning, robotics and natural language processing.
The event will be held in person on December 19, 2025 at Escuela Técnica
Superior de Ingeniería Informática(Universidad de Sevilla). Interested
researchers can participate by presenting their results in oral or poster
format, provided that they have been accepted in relevant conferences or
journals in the area. In addition, professionals and companies wishing to
participate may do so through sponsorship or direct participation by
registering on the event's website.
For more information on registration, submission of papers and forms of
sponsorship, please consult the following link:
https://sites.google.com/view/andaluzia/call-for-papers.
IMPORTANT DATES
● Submission deadline: October 20, 2025 October 26, 2025
● Notification of acceptance: November 3, 2025
● Registration deadline: November 17, 2025
● III Andaluz.IA Forum: December 19, 2025
ORGANIZING COMMITTEE
https://sites.google.com/view/andaluzia/organisers
[image: Universidad de Jaén] <https://www.ujaen.es/> *Salud María Jiménez
Zafra*
sjzafra(a)ujaen.es
Universidad de Jaén
Grupo de Investigación SINAI <http://sinai.ujaen.es/> | Departamento de
Informática
EPS Jaén, Edificio A3, Despacho 326
Campus Las Lagunillas s/n 23071 - Jaén | +34 953212992
[image: Universidad de Jaén] <https://www.ujaen.es/>
**
*15th Workshop on Computational Approaches to Subjectivity,
Sentiment & Social Media Analysis (WASSA 2026) – 1st Call for Papers*
*EACL’26, March 24–29, 2026, Rabat, Morocco, Half Day Workshop*
*Background and Envisaged Scope*
*Subjectivity and Sentiment Analysis has become a highly developed
research area, ranging from binary classification of reviews to the
detection of complex emotion structures between entities found in
text. This field has expanded both on a practical level, finding
numerous successful applications in business, as well as on a
theoretical level, allowing researchers to explore more complex
research questions related to affective computing. Its continuing
importance is also shown by the interest it generates in other
disciplines such as Economics, Sociology, Psychology, Marketing,
Crisis Management & Digital Humanities, where it can support the
study of online interactions, group dynamics, and public discourse.*
*The aim of WASSA 2026 is to bring together researchers working on
Subjectivity, Sentiment Analysis, Emotion Detection and
Classification and their applications to other NLP or real-world
tasks (e.g. public health messaging, fake news, media impact
analysis, social media mining, computational literary studies) and
researchers working on interdisciplinary aspects of affect
computation from text. We encourage the submission of long and short
research and demo papers including, but not restricted to the
following topics:*
*
*Resources for subjectivity, sentiment, emotion and social media
analysis*
*
*Opinion retrieval, extraction, categorization, aggregation and
summarization*
*
*Humor, Irony and Sarcasm detection*
*
*Mis- and disinformation analysis and the role of affective attributes*
*
*Aspect and topic-based sentiment and emotion analysis*
*
*Analysis of stable traits of social media users, incl. personality
analysis and profiling*
*
*Transfer learning for domain, language and genre portability of
sentiment analysis*
*
*Modelling commonsense knowledge for subjectivity, sentiment or
emotion analysis*
*
*Improvement of NLP tasks using subjectivity and/or sentiment analysis*
*
*Intrinsic and extrinsic evaluation of subjectivity and/or sentiment
analysis*
*
*Application of theories from related fields to subjectivity and
sentiment analysis*
*
*Multimodal emotion detection and classification*
*
*Social Groups analysis and their interactions in Social Media*
*
*Generation, detection, and evaluation of subjectivity, sentiment,
and emotion in NLP tasks with LLMs*
*
*Risks, challenges, and ethical implications of affective uses of LLMs*
*
*The role of emotions in argument mining*
*
*Applications of sentiment and emotion mining*
*
*Public sentiments and communication patterns of public health
emergencies.*
*
*The analysis of pretrained small and large language models.*
*Finally, this year we also propose a special trackon multilinguality
and socio-cultural adaptation to lesser-resourced languages/communities. *
*In general, we particularly invite contributions from young
researchers, work on low-resource languages, multilingual methods, and
interdisciplinary work.*
*Important dates*
*
*December 17, 2025: Direct submission deadline*
*
*January 2, 2026: ARR submission deadline*
*
*January 23, 2026: Notification of acceptance*
*
*February 3, 2026: Camera Ready Papers due*
*
*March 24–29, 2026: EACL with WASSA workshop on one of the days.*
*Shared tasks*
*We do not offer a shared task this year.*
*Papers*
*At WASSA 2026, we will accept three types of submissions:*
*For the regular research track we accept long& shortpapers.*
*Additionally, we accept double submissions and double commitment of ARR
reviews in parallel to WASSA and another venue. Please note that you
must immediately withdraw your paper from WASSA if you decide to publish
it elsewhere. Check with the other venue if they also allow double
submissions.*
*Longpapers may consist of up to eight (8) pages of content, with any
number of additional pages of references. A- subset of these papers will
be presented orally.*
*Shortpapers may consist of up to four (4) pages of content, with any
number of additional pages of references. Most of these papers will be
presented as posters.*
*Also this year there is an industry track, for which we accept demo
papers. Demo papersdescribe system demonstrations, ranging from early
prototypes to mature production-ready systems. Commercial sales and
marketing activities are not appropriate for this track. Demo papers may
consist of up to six (6) pages of content, these will be presented as a
poster and should include a live demonstration.*
*Submission procedure and templates*
*Submissions without reviews can be done directly through our Open
Review side
(https://openreview.net/group?id=eacl.org/EACL/2026/Workshop/WASSA).*
*Authors who received reviews already through the ACL Rolling Review
process are invited to commit their reviewed paper to WASSA. To do so,
please go to our ARR Website and click on “ACL 2026 Workshop WASSA
Commitment Submission”. You will then need to add the title, the URL to
the ARR submission with reviews + metareview, and other information.*
*Both long and short papers must be anonymised for double-blind
reviewing, must follow the ACL Author Guidelines
<https://www.aclweb.org/adminwiki/index.php?title=ACL_Author_Guidelines>,
and must use the ACL templates
(https://github.com/acl-org/acl-style-files). The submitting author must
have an OpenReview profile*
*Optional Supplementary Materials: Appendices, Software and Data*
*ARR encourages the submission of these supplementary materials to
improve the reproducibility of results, and to enable authors to provide
additional information that does not fit in the paper. Supplementary
materials may include appendices, software or data. For example, pre
processing decisions, model parameters, feature templates, lengthy
proofs or derivations, pseudocode, sample system inputs/outputs, and
other details that are necessary for the exact replication of the work
described in the paper can be put into appendices. However, if the
pseudo-code or derivations or model specifications are an important part
of the contribution, or if they are important for the reviewers to
assess the technical correctness of the work, they should be a part of
the main paper, and not appear in appendices. Reviewers are not required
to consider material in appendices. Appendices should come after the
references in the submitted pdf, but do not count towards the page
limit. Software should be submitted as a single .tgz or .zip archive,
and data as a separate single .tgz or .zip archive. Supplementary
materials must be fully anonymized to preserve the two-way anonymized
reviewing policy and must not exceed 100MB.*
*Organizers*
*
Jeremy Barnes, University of the Basque Country
Valentin Barriere, University of Chile
Orphée De Clercq, Ghent University
Roman Klinger, University of Bamberg
Célia Nouri, Inria and Sciences Po
Debora Nozza, Bocconi University
Pranaydeep Singh, Ghent University
Contact
Email: wassaworkshop(a)gmail.com
Website: https://workshop-wassa.github.io/
*
--
Prof. Dr. Roman Klinger (he/him)
Professor for Fundamentals of Natural Language Processing
Faculty Information Systems and Applied Computer Sciences (WIAI)
Otto-Friedrich-University Bamberg
Office: Room 02.10; Gutenbergstr. 13; 96050 Bamberg; Germany
Phone: +49 951 863 3320
Mail:roman.klinger@uni-bamberg.de
WWW:
https://www.uni-bamberg.de/nlproc/https://www.romanklinger.de/
======================================
8-12 June 2026
Venice, Italy
http://unive.it/avi2026
======================================
IMPORTANT DATES
- Workshop proposals submission: December 14, 2025
- Notification: December 21, 2025
- Workshop website online: January 12, 2026
(all deadlines are 23:59, AoE)
---------------------------------------------------------------------------
WORKSHOP PROPOSALS
We invite proposals for workshops that will facilitate the exchange of new ideas in all areas related to Advanced Visual Interfaces and Human-Computer Interaction. We invite organizers to propose either half-day or one-day long workshops held between June 8 and 12, 2026, at the AVI2026 venue.
Workshop proposals must be submitted using the following form:
https://forms.gle/v7gEKFchxAGzMJtj8
Workshop Chairs
- Daniela Fogli, Università di Brescia, Italy (daniela.fogli(a)unibs.it)
- Kyle Montague, Northumbria University, UK (kyle.montague(a)northumbria.ac.uk)
---------------------------------------------------------------------------
18th International Conference on Advanced Visual Interfaces (AVI) 2026
Interactive Creativity: Agencies, Interfaces, and Ethics
Since its first edition in 1992, AVI has become an influential space for encounters among scholars and practitioners interested in interfaces, interactions, and experiences. Rooted in pioneering research on visual interfaces characterized by a distinctive attention to the human factor, the conference has evolved across the different waves of Human-Computer Interaction (HCI). It has addressed the pragmatic and hedonic needs of heterogeneous groups of users up to the current challenge of self-actualization. Creativity is a core behavior that leads to the realization of a person’s full potential and the explosion of generative AI presents both opportunities and challenges to human creativity. They address fundamental issues related to agencies, interfaces, and ethics.
AVI 2026 will take place in San Servolo, a small island of Venice. This delicate and fragile ecosystem provides the ideal venue to reflect, reframe, and speculate about creative solutions to more sustainable and rewarding technological futures. AVI is an International Conference considering the nationality of participants, authors, and organizing committees. However, it has always taken place in Italy, thus complementing a strong and diverse research program with carefully selected cultural and social activities alongside a distinct sense of hospitality and conviviality.
The conference is held under the patronage of the Free University of Bozen-Bolzano and Ca’ Foscari University of Venice, Italy.
We look forward to your participation in AVI 2026!
Antonella De Angeli, AVI 2026 General Co-Chair
Albrecht Schmidt, AVI 2026 General Co-Chair
Rosella Gennari, AVI 2026 Program Co-Chair
Fabio Pittarello, AVI 2026 Program Co-Chair
---------------------------------------------------------------------------
MAIN THEMES AND TOPICS
Themes and topics of interest include (but are not limited to) the following:
/ Theme: Interaction Paradigms and Modalities /
Brain-Computer Interaction
Embodied and Tangible Interaction
Material-Centric Interaction
Information Visualization
Screen-based Interaction
Voice and Sound-based Interaction
Multimodal Interaction
/ Theme: Interaction Spaces /
Augmented Reality
Cross Reality
Virtual Reality
Interaction between Black-Boxes
Dynamic Physical Environments
Natural Environments
Urban Places
/ Theme: Human-System Interaction /
Adaptive and Context-Aware Interfaces
Affective Interfaces
Human-Robot Interaction
Intelligent Interfaces
Interfaces and Recommender Systems
/ Theme: Ecosystems of People, Groups and Societies /
Computer-Supported Cooperative Work
Learning Ecosystems
Game and Play Ecosystems
Social Interaction and Cooperation Systems
/ Theme: Values and Moral Principles /
Beyond Human Interaction
Critical Computing
Critical Data Science
Diversity, Equity, and Inclusion
Responsible Design
/ Theme: Applications /
Cultural Heritage and Digital Humanities
End User Development
AI Art and Creativity
Human Factors in Security Systems
Health, Well-being, and Self-Actualization
---------------------------------------------------------------------------
AVI 2026 ORGANIZING COMMITTEE
General Chairs
Antonella De Angeli, Free University of Bozen-Bolzano, Italy (antonella.deangeli(a)unibz.it)
Albrecht Schmidt, Ludwig-Maximilians Munich University, Germany (albrecht.schmidt(a)um.ifi.lmu.de)
Program Chairs
Rosella Gennari, Free University of Bozen-Bolzano, Italy (gennari(a)inf.unibz.it)
Fabio Pittarello, Ca’ Foscari University of Venice, Italy (pitt(a)unive.it)
Long Papers Chairs
Paloma Diaz, Carlos III University, Madrid, Spain (pdp(a)inf.uc3m.es)
Alessandra Melonio, Ca’ Foscari University of Venice, Italy (alessandra.melonio(a)unive.it)
Short Papers Chairs
Luigi De Russis, Politecnico di Torino, Italy (luigi.derussis(a)polito.it)
María Menéndez Blanco, Free University of Bozen-Bolzano, Italy (Maria.MenendezBlanco(a)unibz.it)
Proceedings Chairs
Niccolò Pretto, Free University of Bozen-Bolzano, Italy (niccolo.pretto(a)unibz.it)
Workshops Chairs
Daniela Fogli, Università di Brescia, Italy (daniela.fogli(a)unibs.it)
Kyle Montague, Northumbria University, UK (kyle.montague(a)northumbria.ac.uk)
Interactive Experiences and Demos Chairs
Stefania De Vincentis, Ca’ Foscari University of Venice, Italy (stefania.devincentis(a)unive.it)
Florian Michahelles, TU Wien, Austria (florian.michahelles(a)tuwien.ac.at)
Sebastiano Vascon, Ca’ Foscari University of Venice, Italy (sebastiano.vascon(a)unive.it)
Posters Chairs
Alba Bisante, Sapienza University of Rome, Italy (bisante(a)di.uniroma1.it)
Tanja Doering, TU Berlin, Germany (tanja.doering(a)mms.tu-berlin.de)
Doctoral Consortium Chairs
Rosa Lanzilotti, University of Bari, Italy (rosa.lanzilotti(a)uniba.it)
Monica Divitini, Norwegian University of Science and Technology, Norway (divitini(a)ntnu.no)
Industry Chairs
Fabio Morreale, Sony, Spain (Fabio.Morreale(a)sony.com)
Emanuele Pucci, Politecnico di Milano (emanuele.pucci(a)polimi.it)
Accessibility and Inclusion Chairs
Marco Mores, Free University of Bozen-Bolzano, Italy (marco.mores(a)student.unibz.it)
Teresa Scantamburlo, University of Trieste, Italy (teresa.scantamburlo(a)units.it)
Web Chair
Tommaso Pellegrini, Ca’ Foscari, University of Venice, Italy (tommaso.pellegrini(a)unive.it)
Publicity & Social Networks Chairs
Andrea Rezzani, Free University of Bozen-Bolzano, Italy (andrea.rezzani(a)student.unibz.it)
Mehdi Rizvi, University of East Anglia, UK (mehdi.rizvi(a)uea.ac.uk)
Student Volunteers Chairs
Daniel Bermudez, Free University of Bozen-Bolzano, Italy (JulioDaniel.BermudezChinea(a)student.unibz.it)
Bilal Khan, Free University of Bozen-Bolzano, Italy
(MuhammadBilal.Khan(a)student.unibz.it)
---------------------------------------------------------------------------
AVI STEERING COMMITTEE
Paolo Bottoni,
Sapienza University of Rome, Italy
Cristina Conati
University of British Columbia, Canada
Emanuele Panizzi,
Sapienza University of Rome, Italy
Ilaria Torre
University of Genoa, Italy
Genny Tortora
University of Salerno, Italy
Giuliana Vitiello
University of Salerno, Italy
Gualtiero Volpe
University of Genoa, Italy
Marco Winckler
Université Côte d'Azur, France
--
Informativa sulla Privacy: https://www.unibs.it/it/node/1452
<https://www.unibs.it/it/node/1452>
In this newsletter:
Membership year 2026 publication preview
Fall 2025 data scholarship recipients
New publications:
KAIROS Phase 2 Quizlet<https://catalog.ldc.upenn.edu/LDC2025T15>
BOLT CTS CALLFRIEND CALLHOME Egyptian Arabic Audio<https://catalog.ldc.upenn.edu/LDC2025S09>
BOLT CTS CALLFRIEND CALLHOME Egyptian Arabic Transcripts and Translations<https://catalog.ldc.upenn.edu/LDC2025T14>
________________________________
Membership year 2026 publication preview
The 2026 membership year is approaching and plans for next year's publications are in progress. Among the expected releases are:
* 2012 NIST Speaker Recognition Evaluation Test Set: 10,000+ hours of English conversational telephone speech following the Mixer collection protocol, used in NIST's 2012 speaker recognition evaluation
* KAIROS schema learning corpus background data and Phase 1 evaluation datasets: multimodal English and Spanish source data and annotations for reasoning about complex real-world events
* CALL MY NET 2: 800+ hours of Tunisian-Arabic conversational telephone speech from over 400 speakers to support text independent speaker recognition, used in the 2018 NIST Speaker Recognition Evaluation
* Multi-language conversational telephone speech: multiple releases, hundreds of hours of speech from speakers of confusable linguistic varieties (Arabic, Chinese, English, French, Slavic, Spanish) to support language identification
* CALLHOME Omnibus releases: combined speech and transcript datasets with updated directory structure, file formats and documentation, and lexicons (Chinese, English, German, Japanese, Spanish)
* IARPA MATERIAL language packs: conversational telephone speech, transcripts, English translations, annotations, and queries in multiple languages (e.g., Lithuanian, Pashto, Swahili, Tagalog)
Check your inbox for more information about membership renewal.
Fall 2025 data scholarship recipients
Congratulations to the recipients of LDC's Fall 2025 data scholarships:
Lasidu Dilshan: University of Moratuwa (Sri Lanka): BSc, Electronic and Telecommunication Engineering. Lasidu is awarded a copy of Asian Elephant Vocalizations LDC2010S05 for his work in elephant voice enhancement and classification.
Máté Gedeon: Budapest University of Technology and Economics (Hungary): PhD candidate, Department of Telecommunications and Artificial Intelligence. Máté is awarded a copy of Switchboard-1 Release 2 LDC97S62 for his work in simulated conversation generation.
Ping He: Northeastern University (USA): Student, Khoury College of Computer Sciences. Ping is awarded a copy of ETS Corpus of Non-Native Written English LDC2014T06 for their work in native language identification.
Thiyazen Iskander: Maulana Azad College of Arts, Science & Commerce (India), affiliated with Babasaheb Ambedkar Technological University (India): PhD candidate, Linguistics, Department of English. Thiyazen is awarded copies of Arabic Morphological Analyzer (SAMA) Version 3.1 LDC2010L01 and Arabic Treebank Part 1 v. 4.1 LDC2010T13 for his work in morphosyntactic analysis of short passives in Standard Arabic.
Michael Mooney: University of Glasgow (United Kingdom): PhD candidate, School of Computing Sciences. Michael is awarded copies of Treebank-2 LDC95T7 and BLLIP 1987-89 WSJ Corpus Release LDC2000T43 for their work in eye-tracking for text-centered modeling.
Abraham Sanders: Rensselaer Polytechnic Institute (USA): PhD candidate, Cognitive Science. Abraham is awarded a copy of Switchboard-1 Release 2 LDC97S62 for his work in spoken dialogue systems.
________________________________
New publications:
KAIROS Phase 2 Quizlet<https://catalog.ldc.upenn.edu/LDC2025T15> was developed by LDC and contains English and Spanish text, video and image data, and annotations used for pre-evaluation research and system development during Phase 2 of the DARPA KAIROS program. KAIROS Quizlets were a series of narrowly defined tasks designed to explore specific evaluation objectives enabling KAIROS system developers to exercise individual system components on a small data set prior to the full program evaluation. This corpus contains the complete set of Quizlet data used in Phase 2 which focused on five real-world complex events within the Disease Outbreak scenario.
Source data was collected from the web; 66 root web pages were collected and processed, yielding 65 text data files, 890 image files and 10 video files. Annotation steps included labeling scenario-relevant events and relations for each document to develop a structured representation of temporally ordered events, relations and arguments; generating a reference knowledge graph; and linking labeled entries to a knowledge base derived from a Wikidata-based ontology.
The DARPA KAIROS (Knowledge-directed Artificial Intelligence Reasoning Over Schemas) program aimed to build technology capable of understanding and reasoning about complex real-world events in order to provide actionable insights to end users. KAIROS systems utilized formal event representations in the form of schema libraries that specified the steps, preconditions and constraints for an open set of complex events; schemas were then used in combination with event extraction to characterize and make predictions about real-world events in a large multilingual, multimedia corpus.
2025 members can access this corpus through their LDC accounts. Non-members may license this data for a fee.
*
BOLT CTS CALLFRIEND CALLHOME Egyptian Arabic Audio<https://catalog.ldc.upenn.edu/LDC2025S09> was developed by LDC and consists of 116 hours of speech from 274 unscripted telephone conversations between native speakers of the Arabic dialect spoken in Egypt. The calls were collected by LDC in the CALLFRIEND and CALLHOME series where participants called family members or close friends and spoke on topics of their choice. Around 33% of the recordings (92 calls) are publicly released for the first time. The remaining 182 recordings were previously published by LDC in various CALLFRIEND, CALLHOME, and HUB5 Arabic datasets.
The DARPA BOLT (Broad Operational Language Translation) program developed machine translation and information retrieval for less formal genres, focusing particularly on user-generated content. LDC supported the BOLT program by collecting informal data sources -- discussion forums, conversational telephone speech, text messaging, and chat -- in Chinese, Egyptian Arabic, and English. The material in this release represents the unannotated Egyptian Arabic source conversational telephone speech. The telephone data was transcribed, translated, and annotated for various tasks in the BOLT program including word alignment, treebanking, and co-reference.
2025 members can access this corpus through their LDC accounts. Non-members may license this data for a fee.
*
BOLT CTS CALLFRIEND CALLHOME Egyptian Arabic Transcripts and Translations<https://catalog.ldc.upenn.edu/LDC2025T14> contains transcripts and corresponding English translations for the conversational telephone speech in BOLT CTS CALLFRIEND CALLHOME Egyptian Arabic Audio<https://catalog.ldc.upenn.edu/LDC2025S09> and was developed by LDC to support the DARPA BOLT program.
Transcribers were required to produce a verbatim transcript of all speech within a file using the CODA<https://aclanthology.org/L12-1328/> orthographic approach; diacritics were not included. Some transcripts contain redactions for potential personally identifying information. All speech data was transcribed and is divided into training, development, and evaluation partitions.
The goal of the BOLT translation task was to translate the Arabic transcripts into fluent English while preserving the meaning present in the original Arabic text. Transcripts in the development and evaluation partitions received first pass and gold standard translations. 99% of the transcripts were translated into English.
2025 members can access this corpus through their LDC accounts. Non-members may license this data for a fee.
To unsubscribe from this newsletter, log in to your LDC account<https://catalog.ldc.upenn.edu/login> and uncheck the box next to "Receive Newsletter" under Account Options or contact LDC for assistance.
Membership Coordinator
Linguistic Data Consortium<ldc.upenn.edu>
University of Pennsylvania
T: +1-215-573-1275
E: ldc(a)ldc.upenn.edu<mailto:ldc@ldc.upenn.edu>
M: 3600 Market St. Suite 810
Philadelphia, PA 19104
Call for Papers - AbjadNLP 2026
The 2nd Workshop on NLP for Languages Using Arabic Script
Rabat, Morocco (in-person)
Co-located with EACL 2026
_https://wp.lancs.ac.uk/abjad/_ [1]_ _
AbjadNLP 2026 invites submissions on all aspects of Natural Language
Processing (NLP) for Arabic-script languages, including Arabic and its
dialects, Perso-Arabic languages, and Ajami traditions across Africa and
Asia. Building on the success of AbjadNLP 2025 at COLING, the 2026
edition will be held in Rabat, Morocco, co-located with EACL.
The workshop provides a platform for research in Arabic NLP -- covering
Modern Standard Arabic, Classical Arabic, and dialectal varieties --
while also supporting work on related languages such as Persian, Urdu,
Pashto, Kurdish, Azeri Turkish, Ottoman Turkish, Sindhi, Uyghur, and
African Ajami languages (e.g. Hausa, Fula, Wolofal, Swahili, Kanuri,
Mandingo, and Tamazight).
We welcome contributions on all NLP topics, including morphological
analysis, parsing, translation, LLM adaptation, resources and corpora,
code-switching, sociolinguistic analysis, and low-resource methods.
Submission Deadlines:
Direct submissions: 8 January 2026
ARR submissions: 10 January 2026
Notification: 24 January 2026
Camera-ready: 3 February 2026
Workshop: 24-29 March 2026 (TBC)
AbjadNLP 2026 follows the EACL 2026 submission and formatting
guidelines. Templates are available at
https://github.com/acl-org/acl-style-files [2].
General Chair: Dr Mo El-Haj (VinUniversity & Lancaster University)
More information: https://wp.lancs.ac.uk/abjad/ [3]
--
Amal Haddad Haddad (She/her)
Ayudante doctora
Facultad de Traducción e Interpretación
Universidad de Granada |https://www.ugr.es/personal/amal-haddad-haddad
Lexicon Research Group |http://lexicon.ugr.es/haddad
Co-Convenor, BAAL SIG 'Humans, Machines,
Language'|https://r.jyu.fi/humala
Event Coordinator, BAAL SIG 'Language, Learning and Teaching
===============
Cláusula de Confidencialidad: "Este mensaje se dirige exclusivamente a
su destinatario y puede contener información privilegiada o
confidencial. Si no es Ud. el destinatario indicado, queda notificado de
que la utilización, divulgación o copia sin autorización está prohibida
en virtud de la legislación vigente. Si ha recibido este mensaje por
error, se ruega lo comunique inmediatamente por esta misma vía y proceda
a su destrucción.
This message is intended exclusively for its addressee and may contain
information that is CONFIDENTIAL and protected by professional
privilege. If you are not the intended recipient you are hereby notified
that any dissemination, copy or disclosure of this communication is
strictly prohibited by law. If this message has been received in error,
please immediately notify us via e-mail and delete it"
===============
Links:
------
[1] https://wp.lancs.ac.uk/abjad/
[2]
https://urldefense.com/v3/__https:/github.com/acl-org/acl-style-files__;!!D…
[3]
https://urldefense.com/v3/__https:/wp.lancs.ac.uk/abjad/__;!!D9dNQwwGXtA!Xa…
Dear all,
Please feel free to share this with your contacts.
I am seeking motivated students for a fully funded 4-year PhD in Social NLP. Social NLP is an emerging interdisciplinary field that combines natural language processing (NLP) and social computing.
The scope of doctoral thesis proposals (but is not restricted to) is based on the following topics:
- Information Disorder on Social Media (Fake news, Cyberbullying, Hate Speech, misinformation, etc.)
- Disaster Management Using Social Media
- Explainable AI in Social NLP
Why the University of Tartu<https://ut.ee/en>?
- The University of Tartu is one of the oldest universities (founded in 1632) in Northern and Eastern Europe. The University of Tartu is known for its high-quality studies and research. The unique intellectual atmosphere and fine living and working environment that characterise Tartu have attracted prominent scientists and experts for almost 390 years.
- As of 2025, the University of Tartu belongs to the top 1% of the world's universities.
- The University of Tartu is the strongest research university in Estonia. We belong to various networks of renowned research universities, such as ENLIGHT, the Coimbra Group, LERU, etc.
- Here, 13 000 students and 3000 staff members study and work in a diverse range of fields connected by the power of curiosity.
- Estonia is famous for its digital society and has the highest number of startups per capita in Europe. Estonia is famous for its digital innovations, and it is home to many startups like Skype, Transferwise, and Bolt to name a few.
Scholarship:
- Scholarship approximately 2300 Euro/month gross amount.
- Covers health insurance
- Additional benefits: Travel grant for attending conferences and workshops, etc.
If you are interested, please drop me an email (somnath.banerjee(a)ut.ee) with the following:
- CV which must include:
- final year undergraduate/graduate project (brief description: max 150 words)
- pursued relevant subjects at undergraduate and/or Master level (if any)
- relevant work experience (if any)
- publications (if any)
- The name and email contacts of academic referees(optional)
Please Note:
* Master-level students, who are in their final semester, can also apply.
* Applications will be considered until the position is filled. However, you are encouraged to apply early as the processing of the applications will start soon after receiving them.
* Do not send academic certificates, motivation letters, letters of recommendation, etc.
*
Upon initial selection based on CVs, additional documents (such as motivation letters, certificates) will be requested for further processing.
For queries: If you need more information, please contact me with the subject line "SocialNLP: PhD position queries".
Kind regards,
Somnath Banerjee
Assistant Professor,
Chair of NLP,
Institute of Computer Science,
University of Tartu,
51009 Tartu, ESTONIA