====================================================================
CFP III Andaluz.IA Forum
December 19, 2025, Escuela Técnica Superior de Ingeniería Informática de la
Universidad de Sevilla
====================================================================
From ten Andalusian universities and the Joint Research Centre of the
European Commission we are organizing the III Andaluz.IA Forum
<https://sites.google.com/view/andaluzia/home>, a meeting whose main
objective is to show the potential and give visibility to the academic and
research community in Artificial Intelligence in our region. This forum
seeks to highlight the work of Andalusian scientists, both those who are
currently working in Andalusia, as well as those who have spent part of
their training or career in the region, regardless of their current place
of work.
The first edition of the Andaluz.IA forum was organized at Universidad
Pablo Olavide in Seville, the second edition at Universidad de Jaén and
this year it will take place at Escuela Técnica Superior de Ingeniería
Informática (Universidad de Sevilla). With this third edition, we want to
continue highlighting the great potential for research and academic
development in Artificial Intelligence that Andalusia has, in areas such as
machine learning, deep learning, robotics and natural language processing.
The event will be held in person on December 19, 2025 at Escuela Técnica
Superior de Ingeniería Informática(Universidad de Sevilla). Interested
researchers can participate by presenting their results in oral or poster
format, provided that they have been accepted in relevant conferences or
journals in the area. In addition, professionals and companies wishing to
participate may do so through sponsorship or direct participation by
registering on the event's website.
For more information on registration, submission of papers and forms of
sponsorship, please consult the following link:
https://sites.google.com/view/andaluzia/call-for-papers.
IMPORTANT DATES
● Submission deadline: October 20, 2025 October 26, 2025
● Notification of acceptance: November 3, 2025
● Registration deadline: November 17, 2025
● III Andaluz.IA Forum: December 19, 2025
ORGANIZING COMMITTEE
https://sites.google.com/view/andaluzia/organisers
[image: Universidad de Jaén] <https://www.ujaen.es/> *Salud María Jiménez
Zafra*
sjzafra(a)ujaen.es
Universidad de Jaén
Grupo de Investigación SINAI <http://sinai.ujaen.es/> | Departamento de
Informática
EPS Jaén, Edificio A3, Despacho 326
Campus Las Lagunillas s/n 23071 - Jaén | +34 953212992
[image: Universidad de Jaén] <https://www.ujaen.es/>
The Seventh Workshop on Teaching NLP
Call for Papers
Educators designing Natural Language Processing (NLP) and/or
Computational Linguistics (CL) courses and degree programs face unique
challenges due to the rapid progress of the field, particularly with the
impact of generative AI and Large Language Models (LLMs). Here, the
challenge is two-fold: A) courses need to keep up with the pace in terms
of the content covered, while B) it will be crucial for educators to
adapt the course design accordingly, acknowledging the existence and the
use of LLMs by students. To support all those who are facing these
challenges, we are planning a discussion-heavy one-day workshop to bring
together the communities of NLP research and education, and facilitate
active discussion on questions such as (but not limited to):
*
How can we balance technical details, linguistic background, and domain
knowledge in NLP-related courses?
*
How do we keep the human in the loop?
*
How can the community support educators at different institutions and
career levels?
This timely seventh edition of the Teaching NLP Workshop builds on prior
successful offerings [1] to tackle the most pressing issues in how to
design NLP courses and bring together instructors from various
backgrounds to discuss, create, and refine instructional design and
material.
Submission Information
We invite submissions in two categories: short papers (2 pages) on
teaching materials and full papers (8 pages) on original, unpublished
research (both regular research papers and position papers).
Format
All submissions must use the official ACL LaTeX style template [2] and
follow the standard ACL submission requirements [3]. References and
appendices do not count against the page limit (8 pages). Limitations
and ethical considerations are optional and do not count against these
limits either. Papers that do not conform to these requirements will be
desk-rejected without review.
Submission Process
For submission and the review process, we will use OpenReview. If you do
not have an OpenReview account yet, make sure to create it well in
advance of the deadline. This is especially important, as in some cases,
the approval of the account may take some time. The reviewing process
will be single-blind.
Submission Link (OpenReview): tbd
Submission Type 1: Short Papers on Teaching Materials
We invite submissions of short papers of 1-2 pages that describe
teaching materials such as curricula, course GitHub repositories,
Jupyter notebooks, slides, homework, programming assignments, or
projects. These short papers need not be anonymized, but will be
peer-reviewed and published as part of the workshop proceedings, and
presented as posters and/or demos. The associated teaching materials,
while not being part of the proceedings, should be submitted in addition
to the short paper. We will create a Teaching NLP repository where
authors may opt in to make their materials available for reuse after the
workshop.
Submission Type 2: Full Papers
We invite papers of up to 8 pages discussing pedagogical aspects of NLP,
focusing on (but not limited to) any of the following general topics:
*
Tools and methodologies (e.g., teaching with code, active learning,
flipped classroom)
*
Scaling curricula to fit large class sizes
*
Adapting existing curricula to incorporate new NLP advancements
*
Teaching online NLP courses or adjusting courses to become remote
*
Challenges of designing the first NLP course or related degree program
at a college, university, or on a MOOC platform
*
Teaching heterogenous groups of students (e.g., with respect to prior
experience in computer science and linguistics, with respect to their
social and cultural background, etc.)
*
Teaching underrepresented students
*
Bridging the gap between academic training and industry needs
*
Incorporating ethics, reproducibility, and responsible practices in NLP
courses
*
Teaching multilingual NLP
Important Dates
*
First Call for Papers: October 15, 2025
*
Paper Submission: December 19, 2025
*
Notification of Acceptance: January 23, 2026
*
Camera-Ready Deadline: February 3, 2026
*
Teaching NLP Workshop: 1 day workshop co-located with EACL (March 24 to
29, 2026)
All deadlines are 23:59 AoE (anywhere on earth).
Links:
------
[1] https://sites.google.com/view/teachingnlpacl2024
[2] https://github.com/acl-org/acl-style-files
[3] https://aclrollingreview.org/cfp
**
*15th Workshop on Computational Approaches to Subjectivity,
Sentiment & Social Media Analysis (WASSA 2026) – 1st Call for Papers*
*EACL’26, March 24–29, 2026, Rabat, Morocco, Half Day Workshop*
*Background and Envisaged Scope*
*Subjectivity and Sentiment Analysis has become a highly developed
research area, ranging from binary classification of reviews to the
detection of complex emotion structures between entities found in
text. This field has expanded both on a practical level, finding
numerous successful applications in business, as well as on a
theoretical level, allowing researchers to explore more complex
research questions related to affective computing. Its continuing
importance is also shown by the interest it generates in other
disciplines such as Economics, Sociology, Psychology, Marketing,
Crisis Management & Digital Humanities, where it can support the
study of online interactions, group dynamics, and public discourse.*
*The aim of WASSA 2026 is to bring together researchers working on
Subjectivity, Sentiment Analysis, Emotion Detection and
Classification and their applications to other NLP or real-world
tasks (e.g. public health messaging, fake news, media impact
analysis, social media mining, computational literary studies) and
researchers working on interdisciplinary aspects of affect
computation from text. We encourage the submission of long and short
research and demo papers including, but not restricted to the
following topics:*
*
*Resources for subjectivity, sentiment, emotion and social media
analysis*
*
*Opinion retrieval, extraction, categorization, aggregation and
summarization*
*
*Humor, Irony and Sarcasm detection*
*
*Mis- and disinformation analysis and the role of affective attributes*
*
*Aspect and topic-based sentiment and emotion analysis*
*
*Analysis of stable traits of social media users, incl. personality
analysis and profiling*
*
*Transfer learning for domain, language and genre portability of
sentiment analysis*
*
*Modelling commonsense knowledge for subjectivity, sentiment or
emotion analysis*
*
*Improvement of NLP tasks using subjectivity and/or sentiment analysis*
*
*Intrinsic and extrinsic evaluation of subjectivity and/or sentiment
analysis*
*
*Application of theories from related fields to subjectivity and
sentiment analysis*
*
*Multimodal emotion detection and classification*
*
*Social Groups analysis and their interactions in Social Media*
*
*Generation, detection, and evaluation of subjectivity, sentiment,
and emotion in NLP tasks with LLMs*
*
*Risks, challenges, and ethical implications of affective uses of LLMs*
*
*The role of emotions in argument mining*
*
*Applications of sentiment and emotion mining*
*
*Public sentiments and communication patterns of public health
emergencies.*
*
*The analysis of pretrained small and large language models.*
*Finally, this year we also propose a special trackon multilinguality
and socio-cultural adaptation to lesser-resourced languages/communities. *
*In general, we particularly invite contributions from young
researchers, work on low-resource languages, multilingual methods, and
interdisciplinary work.*
*Important dates*
*
*December 17, 2025: Direct submission deadline*
*
*January 2, 2026: ARR submission deadline*
*
*January 23, 2026: Notification of acceptance*
*
*February 3, 2026: Camera Ready Papers due*
*
*March 24–29, 2026: EACL with WASSA workshop on one of the days.*
*Shared tasks*
*We do not offer a shared task this year.*
*Papers*
*At WASSA 2026, we will accept three types of submissions:*
*For the regular research track we accept long& shortpapers.*
*Additionally, we accept double submissions and double commitment of ARR
reviews in parallel to WASSA and another venue. Please note that you
must immediately withdraw your paper from WASSA if you decide to publish
it elsewhere. Check with the other venue if they also allow double
submissions.*
*Longpapers may consist of up to eight (8) pages of content, with any
number of additional pages of references. A- subset of these papers will
be presented orally.*
*Shortpapers may consist of up to four (4) pages of content, with any
number of additional pages of references. Most of these papers will be
presented as posters.*
*Also this year there is an industry track, for which we accept demo
papers. Demo papersdescribe system demonstrations, ranging from early
prototypes to mature production-ready systems. Commercial sales and
marketing activities are not appropriate for this track. Demo papers may
consist of up to six (6) pages of content, these will be presented as a
poster and should include a live demonstration.*
*Submission procedure and templates*
*Submissions without reviews can be done directly through our Open
Review side
(https://openreview.net/group?id=eacl.org/EACL/2026/Workshop/WASSA).*
*Authors who received reviews already through the ACL Rolling Review
process are invited to commit their reviewed paper to WASSA. To do so,
please go to our ARR Website and click on “ACL 2026 Workshop WASSA
Commitment Submission”. You will then need to add the title, the URL to
the ARR submission with reviews + metareview, and other information.*
*Both long and short papers must be anonymised for double-blind
reviewing, must follow the ACL Author Guidelines
<https://www.aclweb.org/adminwiki/index.php?title=ACL_Author_Guidelines>,
and must use the ACL templates
(https://github.com/acl-org/acl-style-files). The submitting author must
have an OpenReview profile*
*Optional Supplementary Materials: Appendices, Software and Data*
*ARR encourages the submission of these supplementary materials to
improve the reproducibility of results, and to enable authors to provide
additional information that does not fit in the paper. Supplementary
materials may include appendices, software or data. For example, pre
processing decisions, model parameters, feature templates, lengthy
proofs or derivations, pseudocode, sample system inputs/outputs, and
other details that are necessary for the exact replication of the work
described in the paper can be put into appendices. However, if the
pseudo-code or derivations or model specifications are an important part
of the contribution, or if they are important for the reviewers to
assess the technical correctness of the work, they should be a part of
the main paper, and not appear in appendices. Reviewers are not required
to consider material in appendices. Appendices should come after the
references in the submitted pdf, but do not count towards the page
limit. Software should be submitted as a single .tgz or .zip archive,
and data as a separate single .tgz or .zip archive. Supplementary
materials must be fully anonymized to preserve the two-way anonymized
reviewing policy and must not exceed 100MB.*
*Organizers*
*
Jeremy Barnes, University of the Basque Country
Valentin Barriere, University of Chile
Orphée De Clercq, Ghent University
Roman Klinger, University of Bamberg
Célia Nouri, Inria and Sciences Po
Debora Nozza, Bocconi University
Pranaydeep Singh, Ghent University
Contact
Email: wassaworkshop(a)gmail.com
Website: https://workshop-wassa.github.io/
*
--
Prof. Dr. Roman Klinger (he/him)
Professor for Fundamentals of Natural Language Processing
Faculty Information Systems and Applied Computer Sciences (WIAI)
Otto-Friedrich-University Bamberg
Office: Room 02.10; Gutenbergstr. 13; 96050 Bamberg; Germany
Phone: +49 951 863 3320
Mail:roman.klinger@uni-bamberg.de
WWW:
https://www.uni-bamberg.de/nlproc/https://www.romanklinger.de/
======================================
8-12 June 2026
Venice, Italy
http://unive.it/avi2026
======================================
IMPORTANT DATES
- Workshop proposals submission: December 14, 2025
- Notification: December 21, 2025
- Workshop website online: January 12, 2026
(all deadlines are 23:59, AoE)
---------------------------------------------------------------------------
WORKSHOP PROPOSALS
We invite proposals for workshops that will facilitate the exchange of new ideas in all areas related to Advanced Visual Interfaces and Human-Computer Interaction. We invite organizers to propose either half-day or one-day long workshops held between June 8 and 12, 2026, at the AVI2026 venue.
Workshop proposals must be submitted using the following form:
https://forms.gle/v7gEKFchxAGzMJtj8
Workshop Chairs
- Daniela Fogli, Università di Brescia, Italy (daniela.fogli(a)unibs.it)
- Kyle Montague, Northumbria University, UK (kyle.montague(a)northumbria.ac.uk)
---------------------------------------------------------------------------
18th International Conference on Advanced Visual Interfaces (AVI) 2026
Interactive Creativity: Agencies, Interfaces, and Ethics
Since its first edition in 1992, AVI has become an influential space for encounters among scholars and practitioners interested in interfaces, interactions, and experiences. Rooted in pioneering research on visual interfaces characterized by a distinctive attention to the human factor, the conference has evolved across the different waves of Human-Computer Interaction (HCI). It has addressed the pragmatic and hedonic needs of heterogeneous groups of users up to the current challenge of self-actualization. Creativity is a core behavior that leads to the realization of a person’s full potential and the explosion of generative AI presents both opportunities and challenges to human creativity. They address fundamental issues related to agencies, interfaces, and ethics.
AVI 2026 will take place in San Servolo, a small island of Venice. This delicate and fragile ecosystem provides the ideal venue to reflect, reframe, and speculate about creative solutions to more sustainable and rewarding technological futures. AVI is an International Conference considering the nationality of participants, authors, and organizing committees. However, it has always taken place in Italy, thus complementing a strong and diverse research program with carefully selected cultural and social activities alongside a distinct sense of hospitality and conviviality.
The conference is held under the patronage of the Free University of Bozen-Bolzano and Ca’ Foscari University of Venice, Italy.
We look forward to your participation in AVI 2026!
Antonella De Angeli, AVI 2026 General Co-Chair
Albrecht Schmidt, AVI 2026 General Co-Chair
Rosella Gennari, AVI 2026 Program Co-Chair
Fabio Pittarello, AVI 2026 Program Co-Chair
---------------------------------------------------------------------------
MAIN THEMES AND TOPICS
Themes and topics of interest include (but are not limited to) the following:
/ Theme: Interaction Paradigms and Modalities /
Brain-Computer Interaction
Embodied and Tangible Interaction
Material-Centric Interaction
Information Visualization
Screen-based Interaction
Voice and Sound-based Interaction
Multimodal Interaction
/ Theme: Interaction Spaces /
Augmented Reality
Cross Reality
Virtual Reality
Interaction between Black-Boxes
Dynamic Physical Environments
Natural Environments
Urban Places
/ Theme: Human-System Interaction /
Adaptive and Context-Aware Interfaces
Affective Interfaces
Human-Robot Interaction
Intelligent Interfaces
Interfaces and Recommender Systems
/ Theme: Ecosystems of People, Groups and Societies /
Computer-Supported Cooperative Work
Learning Ecosystems
Game and Play Ecosystems
Social Interaction and Cooperation Systems
/ Theme: Values and Moral Principles /
Beyond Human Interaction
Critical Computing
Critical Data Science
Diversity, Equity, and Inclusion
Responsible Design
/ Theme: Applications /
Cultural Heritage and Digital Humanities
End User Development
AI Art and Creativity
Human Factors in Security Systems
Health, Well-being, and Self-Actualization
---------------------------------------------------------------------------
AVI 2026 ORGANIZING COMMITTEE
General Chairs
Antonella De Angeli, Free University of Bozen-Bolzano, Italy (antonella.deangeli(a)unibz.it)
Albrecht Schmidt, Ludwig-Maximilians Munich University, Germany (albrecht.schmidt(a)um.ifi.lmu.de)
Program Chairs
Rosella Gennari, Free University of Bozen-Bolzano, Italy (gennari(a)inf.unibz.it)
Fabio Pittarello, Ca’ Foscari University of Venice, Italy (pitt(a)unive.it)
Long Papers Chairs
Paloma Diaz, Carlos III University, Madrid, Spain (pdp(a)inf.uc3m.es)
Alessandra Melonio, Ca’ Foscari University of Venice, Italy (alessandra.melonio(a)unive.it)
Short Papers Chairs
Luigi De Russis, Politecnico di Torino, Italy (luigi.derussis(a)polito.it)
María Menéndez Blanco, Free University of Bozen-Bolzano, Italy (Maria.MenendezBlanco(a)unibz.it)
Proceedings Chairs
Niccolò Pretto, Free University of Bozen-Bolzano, Italy (niccolo.pretto(a)unibz.it)
Workshops Chairs
Daniela Fogli, Università di Brescia, Italy (daniela.fogli(a)unibs.it)
Kyle Montague, Northumbria University, UK (kyle.montague(a)northumbria.ac.uk)
Interactive Experiences and Demos Chairs
Stefania De Vincentis, Ca’ Foscari University of Venice, Italy (stefania.devincentis(a)unive.it)
Florian Michahelles, TU Wien, Austria (florian.michahelles(a)tuwien.ac.at)
Sebastiano Vascon, Ca’ Foscari University of Venice, Italy (sebastiano.vascon(a)unive.it)
Posters Chairs
Alba Bisante, Sapienza University of Rome, Italy (bisante(a)di.uniroma1.it)
Tanja Doering, TU Berlin, Germany (tanja.doering(a)mms.tu-berlin.de)
Doctoral Consortium Chairs
Rosa Lanzilotti, University of Bari, Italy (rosa.lanzilotti(a)uniba.it)
Monica Divitini, Norwegian University of Science and Technology, Norway (divitini(a)ntnu.no)
Industry Chairs
Fabio Morreale, Sony, Spain (Fabio.Morreale(a)sony.com)
Emanuele Pucci, Politecnico di Milano (emanuele.pucci(a)polimi.it)
Accessibility and Inclusion Chairs
Marco Mores, Free University of Bozen-Bolzano, Italy (marco.mores(a)student.unibz.it)
Teresa Scantamburlo, University of Trieste, Italy (teresa.scantamburlo(a)units.it)
Web Chair
Tommaso Pellegrini, Ca’ Foscari, University of Venice, Italy (tommaso.pellegrini(a)unive.it)
Publicity & Social Networks Chairs
Andrea Rezzani, Free University of Bozen-Bolzano, Italy (andrea.rezzani(a)student.unibz.it)
Mehdi Rizvi, University of East Anglia, UK (mehdi.rizvi(a)uea.ac.uk)
Student Volunteers Chairs
Daniel Bermudez, Free University of Bozen-Bolzano, Italy (JulioDaniel.BermudezChinea(a)student.unibz.it)
Bilal Khan, Free University of Bozen-Bolzano, Italy
(MuhammadBilal.Khan(a)student.unibz.it)
---------------------------------------------------------------------------
AVI STEERING COMMITTEE
Paolo Bottoni,
Sapienza University of Rome, Italy
Cristina Conati
University of British Columbia, Canada
Emanuele Panizzi,
Sapienza University of Rome, Italy
Ilaria Torre
University of Genoa, Italy
Genny Tortora
University of Salerno, Italy
Giuliana Vitiello
University of Salerno, Italy
Gualtiero Volpe
University of Genoa, Italy
Marco Winckler
Université Côte d'Azur, France
--
Informativa sulla Privacy: https://www.unibs.it/it/node/1452
<https://www.unibs.it/it/node/1452>
In this newsletter:
Membership year 2026 publication preview
Fall 2025 data scholarship recipients
New publications:
KAIROS Phase 2 Quizlet<https://catalog.ldc.upenn.edu/LDC2025T15>
BOLT CTS CALLFRIEND CALLHOME Egyptian Arabic Audio<https://catalog.ldc.upenn.edu/LDC2025S09>
BOLT CTS CALLFRIEND CALLHOME Egyptian Arabic Transcripts and Translations<https://catalog.ldc.upenn.edu/LDC2025T14>
________________________________
Membership year 2026 publication preview
The 2026 membership year is approaching and plans for next year's publications are in progress. Among the expected releases are:
* 2012 NIST Speaker Recognition Evaluation Test Set: 10,000+ hours of English conversational telephone speech following the Mixer collection protocol, used in NIST's 2012 speaker recognition evaluation
* KAIROS schema learning corpus background data and Phase 1 evaluation datasets: multimodal English and Spanish source data and annotations for reasoning about complex real-world events
* CALL MY NET 2: 800+ hours of Tunisian-Arabic conversational telephone speech from over 400 speakers to support text independent speaker recognition, used in the 2018 NIST Speaker Recognition Evaluation
* Multi-language conversational telephone speech: multiple releases, hundreds of hours of speech from speakers of confusable linguistic varieties (Arabic, Chinese, English, French, Slavic, Spanish) to support language identification
* CALLHOME Omnibus releases: combined speech and transcript datasets with updated directory structure, file formats and documentation, and lexicons (Chinese, English, German, Japanese, Spanish)
* IARPA MATERIAL language packs: conversational telephone speech, transcripts, English translations, annotations, and queries in multiple languages (e.g., Lithuanian, Pashto, Swahili, Tagalog)
Check your inbox for more information about membership renewal.
Fall 2025 data scholarship recipients
Congratulations to the recipients of LDC's Fall 2025 data scholarships:
Lasidu Dilshan: University of Moratuwa (Sri Lanka): BSc, Electronic and Telecommunication Engineering. Lasidu is awarded a copy of Asian Elephant Vocalizations LDC2010S05 for his work in elephant voice enhancement and classification.
Máté Gedeon: Budapest University of Technology and Economics (Hungary): PhD candidate, Department of Telecommunications and Artificial Intelligence. Máté is awarded a copy of Switchboard-1 Release 2 LDC97S62 for his work in simulated conversation generation.
Ping He: Northeastern University (USA): Student, Khoury College of Computer Sciences. Ping is awarded a copy of ETS Corpus of Non-Native Written English LDC2014T06 for their work in native language identification.
Thiyazen Iskander: Maulana Azad College of Arts, Science & Commerce (India), affiliated with Babasaheb Ambedkar Technological University (India): PhD candidate, Linguistics, Department of English. Thiyazen is awarded copies of Arabic Morphological Analyzer (SAMA) Version 3.1 LDC2010L01 and Arabic Treebank Part 1 v. 4.1 LDC2010T13 for his work in morphosyntactic analysis of short passives in Standard Arabic.
Michael Mooney: University of Glasgow (United Kingdom): PhD candidate, School of Computing Sciences. Michael is awarded copies of Treebank-2 LDC95T7 and BLLIP 1987-89 WSJ Corpus Release LDC2000T43 for their work in eye-tracking for text-centered modeling.
Abraham Sanders: Rensselaer Polytechnic Institute (USA): PhD candidate, Cognitive Science. Abraham is awarded a copy of Switchboard-1 Release 2 LDC97S62 for his work in spoken dialogue systems.
________________________________
New publications:
KAIROS Phase 2 Quizlet<https://catalog.ldc.upenn.edu/LDC2025T15> was developed by LDC and contains English and Spanish text, video and image data, and annotations used for pre-evaluation research and system development during Phase 2 of the DARPA KAIROS program. KAIROS Quizlets were a series of narrowly defined tasks designed to explore specific evaluation objectives enabling KAIROS system developers to exercise individual system components on a small data set prior to the full program evaluation. This corpus contains the complete set of Quizlet data used in Phase 2 which focused on five real-world complex events within the Disease Outbreak scenario.
Source data was collected from the web; 66 root web pages were collected and processed, yielding 65 text data files, 890 image files and 10 video files. Annotation steps included labeling scenario-relevant events and relations for each document to develop a structured representation of temporally ordered events, relations and arguments; generating a reference knowledge graph; and linking labeled entries to a knowledge base derived from a Wikidata-based ontology.
The DARPA KAIROS (Knowledge-directed Artificial Intelligence Reasoning Over Schemas) program aimed to build technology capable of understanding and reasoning about complex real-world events in order to provide actionable insights to end users. KAIROS systems utilized formal event representations in the form of schema libraries that specified the steps, preconditions and constraints for an open set of complex events; schemas were then used in combination with event extraction to characterize and make predictions about real-world events in a large multilingual, multimedia corpus.
2025 members can access this corpus through their LDC accounts. Non-members may license this data for a fee.
*
BOLT CTS CALLFRIEND CALLHOME Egyptian Arabic Audio<https://catalog.ldc.upenn.edu/LDC2025S09> was developed by LDC and consists of 116 hours of speech from 274 unscripted telephone conversations between native speakers of the Arabic dialect spoken in Egypt. The calls were collected by LDC in the CALLFRIEND and CALLHOME series where participants called family members or close friends and spoke on topics of their choice. Around 33% of the recordings (92 calls) are publicly released for the first time. The remaining 182 recordings were previously published by LDC in various CALLFRIEND, CALLHOME, and HUB5 Arabic datasets.
The DARPA BOLT (Broad Operational Language Translation) program developed machine translation and information retrieval for less formal genres, focusing particularly on user-generated content. LDC supported the BOLT program by collecting informal data sources -- discussion forums, conversational telephone speech, text messaging, and chat -- in Chinese, Egyptian Arabic, and English. The material in this release represents the unannotated Egyptian Arabic source conversational telephone speech. The telephone data was transcribed, translated, and annotated for various tasks in the BOLT program including word alignment, treebanking, and co-reference.
2025 members can access this corpus through their LDC accounts. Non-members may license this data for a fee.
*
BOLT CTS CALLFRIEND CALLHOME Egyptian Arabic Transcripts and Translations<https://catalog.ldc.upenn.edu/LDC2025T14> contains transcripts and corresponding English translations for the conversational telephone speech in BOLT CTS CALLFRIEND CALLHOME Egyptian Arabic Audio<https://catalog.ldc.upenn.edu/LDC2025S09> and was developed by LDC to support the DARPA BOLT program.
Transcribers were required to produce a verbatim transcript of all speech within a file using the CODA<https://aclanthology.org/L12-1328/> orthographic approach; diacritics were not included. Some transcripts contain redactions for potential personally identifying information. All speech data was transcribed and is divided into training, development, and evaluation partitions.
The goal of the BOLT translation task was to translate the Arabic transcripts into fluent English while preserving the meaning present in the original Arabic text. Transcripts in the development and evaluation partitions received first pass and gold standard translations. 99% of the transcripts were translated into English.
2025 members can access this corpus through their LDC accounts. Non-members may license this data for a fee.
To unsubscribe from this newsletter, log in to your LDC account<https://catalog.ldc.upenn.edu/login> and uncheck the box next to "Receive Newsletter" under Account Options or contact LDC for assistance.
Membership Coordinator
Linguistic Data Consortium<ldc.upenn.edu>
University of Pennsylvania
T: +1-215-573-1275
E: ldc(a)ldc.upenn.edu<mailto:ldc@ldc.upenn.edu>
M: 3600 Market St. Suite 810
Philadelphia, PA 19104
Call for Papers - AbjadNLP 2026
The 2nd Workshop on NLP for Languages Using Arabic Script
Rabat, Morocco (in-person)
Co-located with EACL 2026
_https://wp.lancs.ac.uk/abjad/_ [1]_ _
AbjadNLP 2026 invites submissions on all aspects of Natural Language
Processing (NLP) for Arabic-script languages, including Arabic and its
dialects, Perso-Arabic languages, and Ajami traditions across Africa and
Asia. Building on the success of AbjadNLP 2025 at COLING, the 2026
edition will be held in Rabat, Morocco, co-located with EACL.
The workshop provides a platform for research in Arabic NLP -- covering
Modern Standard Arabic, Classical Arabic, and dialectal varieties --
while also supporting work on related languages such as Persian, Urdu,
Pashto, Kurdish, Azeri Turkish, Ottoman Turkish, Sindhi, Uyghur, and
African Ajami languages (e.g. Hausa, Fula, Wolofal, Swahili, Kanuri,
Mandingo, and Tamazight).
We welcome contributions on all NLP topics, including morphological
analysis, parsing, translation, LLM adaptation, resources and corpora,
code-switching, sociolinguistic analysis, and low-resource methods.
Submission Deadlines:
Direct submissions: 8 January 2026
ARR submissions: 10 January 2026
Notification: 24 January 2026
Camera-ready: 3 February 2026
Workshop: 24-29 March 2026 (TBC)
AbjadNLP 2026 follows the EACL 2026 submission and formatting
guidelines. Templates are available at
https://github.com/acl-org/acl-style-files [2].
General Chair: Dr Mo El-Haj (VinUniversity & Lancaster University)
More information: https://wp.lancs.ac.uk/abjad/ [3]
--
Amal Haddad Haddad (She/her)
Ayudante doctora
Facultad de Traducción e Interpretación
Universidad de Granada |https://www.ugr.es/personal/amal-haddad-haddad
Lexicon Research Group |http://lexicon.ugr.es/haddad
Co-Convenor, BAAL SIG 'Humans, Machines,
Language'|https://r.jyu.fi/humala
Event Coordinator, BAAL SIG 'Language, Learning and Teaching
===============
Cláusula de Confidencialidad: "Este mensaje se dirige exclusivamente a
su destinatario y puede contener información privilegiada o
confidencial. Si no es Ud. el destinatario indicado, queda notificado de
que la utilización, divulgación o copia sin autorización está prohibida
en virtud de la legislación vigente. Si ha recibido este mensaje por
error, se ruega lo comunique inmediatamente por esta misma vía y proceda
a su destrucción.
This message is intended exclusively for its addressee and may contain
information that is CONFIDENTIAL and protected by professional
privilege. If you are not the intended recipient you are hereby notified
that any dissemination, copy or disclosure of this communication is
strictly prohibited by law. If this message has been received in error,
please immediately notify us via e-mail and delete it"
===============
Links:
------
[1] https://wp.lancs.ac.uk/abjad/
[2]
https://urldefense.com/v3/__https:/github.com/acl-org/acl-style-files__;!!D…
[3]
https://urldefense.com/v3/__https:/wp.lancs.ac.uk/abjad/__;!!D9dNQwwGXtA!Xa…
Dear all,
Please feel free to share this with your contacts.
I am seeking motivated students for a fully funded 4-year PhD in Social NLP. Social NLP is an emerging interdisciplinary field that combines natural language processing (NLP) and social computing.
The scope of doctoral thesis proposals (but is not restricted to) is based on the following topics:
- Information Disorder on Social Media (Fake news, Cyberbullying, Hate Speech, misinformation, etc.)
- Disaster Management Using Social Media
- Explainable AI in Social NLP
Why the University of Tartu<https://ut.ee/en>?
- The University of Tartu is one of the oldest universities (founded in 1632) in Northern and Eastern Europe. The University of Tartu is known for its high-quality studies and research. The unique intellectual atmosphere and fine living and working environment that characterise Tartu have attracted prominent scientists and experts for almost 390 years.
- As of 2025, the University of Tartu belongs to the top 1% of the world's universities.
- The University of Tartu is the strongest research university in Estonia. We belong to various networks of renowned research universities, such as ENLIGHT, the Coimbra Group, LERU, etc.
- Here, 13 000 students and 3000 staff members study and work in a diverse range of fields connected by the power of curiosity.
- Estonia is famous for its digital society and has the highest number of startups per capita in Europe. Estonia is famous for its digital innovations, and it is home to many startups like Skype, Transferwise, and Bolt to name a few.
Scholarship:
- Scholarship approximately 2300 Euro/month gross amount.
- Covers health insurance
- Additional benefits: Travel grant for attending conferences and workshops, etc.
If you are interested, please drop me an email (somnath.banerjee(a)ut.ee) with the following:
- CV which must include:
- final year undergraduate/graduate project (brief description: max 150 words)
- pursued relevant subjects at undergraduate and/or Master level (if any)
- relevant work experience (if any)
- publications (if any)
- The name and email contacts of academic referees(optional)
Please Note:
* Master-level students, who are in their final semester, can also apply.
* Applications will be considered until the position is filled. However, you are encouraged to apply early as the processing of the applications will start soon after receiving them.
* Do not send academic certificates, motivation letters, letters of recommendation, etc.
*
Upon initial selection based on CVs, additional documents (such as motivation letters, certificates) will be requested for further processing.
For queries: If you need more information, please contact me with the subject line "SocialNLP: PhD position queries".
Kind regards,
Somnath Banerjee
Assistant Professor,
Chair of NLP,
Institute of Computer Science,
University of Tartu,
51009 Tartu, ESTONIA
SIGHUM (LaTeCH-CLfL) 2026
The 10th Joint SIGHUM Workshop on Computational Linguistics
for Cultural Heritage, Social Sciences, Humanities and Literature
to be held at EACL in March 2026 in Rabat, Morocco
as a two-day workshop with one on-site and one online day
First Call for Papers (with apologies for cross-posting)
Organizers: Diego Alves, Yuri Bizzoni, Stefania Degaetano-Ortlieb,
Anna Kazantseva, Janis Pagel, Stan Szpakowicz
SIGHUM (LaTeCH-CLfL) 2026 is the tenth in a series of meetings for NLP researchers who work with data from the broadly understood arts, humanities and social sciences, and for specialists in those disciplines who apply NLP techniques in their work. The workshop continues a long tradition of annual events which also host the SIGHUM business meetings.
Workshop site
https://sighum.wordpress.com/events/sighum-latech-clfl-2026/
Important dates
Submission deadline: January 5th, 2026
Notification of acceptance: February 3rd, 2026
Camera-ready paper due: February 10th, 2026
Description
The community of the broadly understood Digital Humanities (DH) has witnessed remarkable growth and transformation, fueled by the rapid advancements in NLP. There is a steady interest in, and a high demand for, NLP methods of semantic and structural annotation, intelligent linking, discovery, querying, cleaning and visualization of primary and secondary data. Even so, the heterogeneous landscape of the DH with their diverse, often multi-lingual or multi-modal sources can be a challenge for NLP. Consider, for example, the growing interest in historical language data and in under-resourced languages.
There are unique obstacles in developing comprehensive language models in aid of the linguistic diversity in DH. The handling of noisy and non-standard data, and the need for domain adaptation and intensive annotation, continue to be at the forefront of research effort in the community. The literary studies, which have witnessed substantial progress in the application of NLP methods, bring their own similar problems. Navigating forms of creative expression requires more than the typical information-seeking tools. A case in point might be the study of literature of a certain period, author or sub-genre, the recognition of certain literary devices, or the quantitative analysis of poetry.
The emergence of Large Language Models (LLMs) expands the DH toolkit. There is support for automatic text cleaning and annotation, creation of semantic resources, analysis of narrative, genre and literary style, and linking information across sources. LLMs can support historical or low-resource languages, particularly when complemented with domain-specific fine-tuning and careful evaluation. One must note, however, that even with careful adaptation, curation and attention to interpretability, LLM outputs remain prone to errors, biases and lack of transparency; that requires rigorous assessment to ensure their suitability for scholarly research.
There is growing emphasis on the importance of explanation in NLP models. That applied equally to DH, whose various domains enjoy the effect of NLP. Transparency and clarity of the results are critical if one is to accept the processed data, and gain valuable insights. That is why one must carefully consider a balance between raw performance scores and interpretability, in keeping with the specific research objectives.
For many years now, this broad research context has drawn together NLP experts, data specialists and researchers in Digital Humanities who work in and across their domains. Our long-standing series of workshops has shown that cross-disciplinary exchange supports work in the Humanities, Social Sciences and Cultural Heritage communities. It encourages the Computational Linguistics community to build rich, effective tools and, above all, interpretable models.
Topics
Our workshops attract original work on a wide variety of topics, including – but as usual not restricted to – these:
adaptation of NLP tools to Cultural Heritage, Social Sciences, Humanities and literature;
automatic error detection and cleaning of textual data;
complex annotation schemas, tools and interfaces;
creation (fully- or semi-automatic) of semantic resources;
creation and analysis of social networks of literary characters;
discourse and narrative analysis/modelling, notably in literature;
emotion analysis for the humanities and for literature;
generation of literary narrative, dialogue or poetry;
identification and analysis of literary genres;
information/knowledge modelling in the Humanities, Social Sciences and Cultural Heritage;
interpretability of large language models output for DH-related tasks (explainable AI);
linking and retrieving information from different sources, media, and domains;
low-resource and historical language processing;
modelling dialogue literary style for generation;
profiling and authorship attribution;
search for scientific and/or scholarly literature;
work with linguistic variation and non-standard or historical use of language
Information for authors
We invite papers on original, unpublished work in the topic areas of the workshop. We will consider long papers, short papers and system descriptions (demos). We also welcome position papers.
Long papers, presenting completed work, may consist of up to eight (8) pages of content plus additional pages of references (just two if possible -:). The final camera-ready versions of accepted long papers will be given one additional page of content (up to 9 pages), so that reviewers’ comments can be taken into account.
A short paper / demo presenting work in progress or the description of a system may consist of up to four (4) pages of content plus additional pages of references (one if you can). Upon acceptance, short papers will be given five (5) content pages in the proceedings.
A position paper — clearly marked as such — should not exceed eight (8) pages including references.
All submissions are to follow the *ACL paper styles (for LaTeX / Overleaf and MS Word) available at https://github.com/acl-org/acl-style-files <https://github.com/acl-org/acl-style-files>. Papers should be submitted electronically, only in PDF, via the LaTeCH-CLfL 2026 submission website on the SoftConf pages (we will publish the link as soon as we have it).
Reviewing will be double-blind. Please do not include the authors’ names and affiliations, or any references to Web sites, project names, acknowledgements and so on — anything that immediately reveals the authors’ identity. Please keep references to your own work at a reasonable minimum, and do not use anonymous citations.
In accordance with the EACL 2026 policy on multiple submission, we will not consider any paper that is under review in a journal or another conference at the time of submission. During the review period, papers submitted to our workshop cannot also be submitted elsewhere.
Stefania Degaetano-Ortlieb
Associate Professor
Universität des Saarlandes
Language Science and Technology
Campus A2.2, 1.06
66123 Saarbrücken
Tel.: ++49 681 302 70077
E-Mail: s.degaetano(a)mx.uni-saarland.de
www.stefaniadegaetano.com
This is a reminder that the deadline to submit tutorial proposals for
EACL'26 & ACL'26 is coming soon: October 20, 2025!
* Cutting-edge tutorials in CL/NLP: covering recent advances in emerging
areas not previously addressed in tutorials at EACL, NAACL-HLT, ACL, or
EMNLP
* Introductory tutorials in related fields: offering overviews of
disciplines potentially relevant to the CL/NLP community, such as
linguistics, bioinformatics, machine learning, human-computer
interaction, or applications of large language models in non-English
languages.
More information:
https://aclweb.org/portal/content/joint-call-tutorial-proposals-eaclacl-2026
[1]
Links:
------
[1]
https://aclweb.org/portal/content/joint-call-tutorial-proposals-eaclacl-2026