The Department of Software Science, Tallinn University of Technology is seeking for 2 positions on associate professor level, but good candidates at assistant and full professor level are encouraged to contact us as well.
Our department is currently welcoming new people in software engineering and data science fields.
Details of the job and on what it is like to work in Estonia can be found here:
Professor of Data Science<https://candidate.recrur.com/public/jobad/en/d99ed3c8-5>
Professor of Software Engineering<https://candidate.recrur.com/public/jobad/en/fedc5c5d-1>
Our department has strong research groups in programming languages, compositional systems, natural language processing, cyber security and applied AI among others and we are leading many study programs both at graduate and undergraduate level.
--
Marko Kääramees
Head of department of Software Science
Tallinn University of Technology
marko.kaaramees(a)taltech.ee
(Apologies for crossposting)
Dear Colleague
We are pleased to announce that CSEE&T 2026 (The 38th International
Conference on Software Engineering Education and Training) will be held at
The University of Florence, Italy, from July 20 - 22.
The CFP is posted at https://cseet26.techconf.org/download/CFP-CSEE&T-2026.pdf<http://www.cs.ucy.ac.cy/~george/GPLists_2021/lm.php?tk=Y29ycG9yYQkJCWNvcnBv…>
Please refer to the conference website for the most recent updates.
If you have any questions related to paper submission to the main conference,
please send emails to the Program Chairs:
Professor Matthew Barr
Professor Lin Liu, or
Professor Rafal Wlodarski.
For the Academy for Software Engineering Education & Training (ASEE&T) Workshop,
please visit its website or contact Professor Nancy Mead and Professor Hossein Saiedian.
For other inquiries, please get in touch with the CSEET 2026 Secretariat.
We look forward to working with you for a successful conference.
CSEE&T Secretariat
Dear editor,
I am Bin Li, one of the organizers of EvaHan2026. Would you spread this CFP to the corpora list? Thank you so much!
--
Best wishes!
Bin Li
Phone: (86)13813878144
Homepage: http://cognitivebase.com/lib/
School of Chinese Language and Literature,
Nanjing Normal University,China
CFP | EvaHan2026 Ancient Chinese OCR Shared Tasks
EvaHan 2026
https://github.com/GoThereGit/EvaHan
EvaHan 2026 is the Fifth International Evaluation of Ancient Chinese Information Processing, focusing on OCR tasks for multimodal large language models in ancient Chinese.
Co-organized with LT4HALA 2026@LREC 2026, which will be held from May 11 to 16, 2026, in Mallorca, Spain.
EvaHan 2026 is organized by Dongbo Wang, Bin Li, Minuxan Feng, Chao Xu, Weiguang Qu, Liu Liu, Si Shen.
Previous Tasks:
EvaHan 2022
The First Bake-off of Ancient Chinese Automatic Processing was successfully held in Marseille, France, in 2022, with a focus on automatic word segmentation and part-of-speech tagging of ancient Chinese.
EvaHan 2023
The Second Bake-off of Ancient Chinese Automatic Processing was successfully held in Macau, China, in 2023, with a focus on machine translation of ancient Chinese.
EvaHan 2024
The Third Bake-off of Ancient Chinese Automatic Processing was held in Turin, Italy, in 2024, with a focus on automatic sentence segmentation and punctuation of ancient Chinese.
EvaHan 2025
The Fourth Bake-off of Ancient Chinese Automatic Processing was held in New Mexico, USA, in 2025, with a focus on named entity recognition in ancient Chinese.
Important Dates for EvaHan 2026:
Registration deadline: January 30, 2026
Training data release: January 1, 2026
Test data release: February 1, 2026
Running results submission: February 6, 2026
Technical report submission deadline: February 28, 2026
Notification of acceptance: March 1, 2026
Camera-ready papers due: March 10, 2026
Participation
To participate in EvaHan 2026, you must complete the following steps:
Registration:
Submit a registration form to officially register your team for the task. Registration is open from December 1, 2025, to January 30, 2026. Only registered participants will gain access to the training dataset.
Accessing the Training Data:
After completing the registration process, participants will receive instructions for downloading the training dataset, which includes image--text pairs from ancient Chinese texts for OCR.
Submitting Results and Reports:
Participants must use the provided test data to generate results and submit their system outputs and a technical report as per the shared task schedule.
For inquiries or to request the registration form, please contact us at evahan2026(a)gmail.com.
Data
The Evahan 2026 dataset comprises three datasets, covering image-text pairs: plain text images, mixed image-text images, and handwritten images-text. The data underwent initial automatic annotation, followed by meticulous correction and refinement by experts in classical Chinese language and history to ensure the highest quality of the training materials and gold-standard texts.
● Dataset A ( Printed Texts) consists of data selected from the Siku Quanshu (Complete Library of the Four Treasuries), including classics, history, philosophy, and literature, as well as various other ancient books.
● Dataset B (Mixed Layouts) contains mixed image-text data selected from the Siku Quanshu and other ancient books.
● Dataset C (Handwritten Texts) includes handwritten ancient books, primarily the Chinese Buddhist canon, including the Chinese Buddhist canon (TKH) dataset, and the Chinese Buddhist canon (MTH) dataset.
Training Data The training set consists of designated portions of subsets A, B, and C. All training samples are provided in image-text pair format, with text in Traditional Chinese (UTF-8), approximately 5000-10000 image-text pairs per subset. Registered participants will receive the training data via email.
Test Data The test set includes the remaining unseen portions of subsets A, B, and C to ensure comprehensive evaluation of all three challenge types. The data is also provided in image-text pair format, approximately 200-500 image-text pairs per subset. Detailed information and a download link for the test data will be provided to participants before the start of the formal evaluation period.
Task
This section offers a detailed description of the tasks encompassed in EvaHan 2026.
OCR
In many Chinese language processing systems,OCR is a critical task, often performed in parallel with other processing functions. The accuracy and speed of OCR directly determine the overall system's performance and user experience in downstream applications such as document digitization, information extraction, and intelligent retrieval.
Evaluation
Metrics
Each team will only have access to the training data. Later, unlabeled test data will also be released. After the evaluation is complete, the labels for the test data will also be released. Tables 2,3 and 4 provide examples of the scorer output. The evaluation will align the system-generated text with the gold standard. Next, OCR will be evaluated: precision, recall, and F1 score will be calculated. BLEU ROUGE-1, ROUGE-2, and ROUGE-L will also be evaluated, bringing the competition's evaluation to multiple metrics. This evaluation adds layout analysis metrics: mAP and IoU. T he team's final ranking will be based on the overall score. The final ranking of teams will be based on the combined scores.
Two Modalities
Each participant can submit results for both modes. In the closed mode, each team has limited resources. Each team can only use training data and a pre-trained model. This model is a word embedding pre-trained on a large Traditional Chinese corpus. No other resources are allowed in the closed mode.
In the open mode, there are no restrictions on resources, data, or models. Annotated external data, such as processed images or text, may be used. However, each team must disclose all resources, data, and models used in each system in the final report.
How to Participate
Registration time is mentioned above. Participants will be required to submit their runs and to provide a technical report for the task they participated in.
Submitting Runs
Each team can submit runs for two tasks. A run should be produced according to the closed modality. The second run will be produced according to the open modality. The closed run is compulsory, while the open run is optional.
Once the system has produced the results for the task over the test set, participants have to follow these instructions to complete their submission:
The annotated results should be submitted as three plain text files encoded in UTF-8 (four-byte encoding). The specific submission format will be released along with the pre-trained dataset.
Organizers
Dongbo Wang, College of Information Management, Nanjing Agricultural University, China
Bin Li, School of Chinese Language and Literature, Nanjing Normal University, China
Minxuan Feng, School of Chinese Language and Literature, Nanjing Normal University, China
Chao Xu, School of Chinese Language and Literature, Nanjing Normal University, China
Weiguang Qu, School of Computer and Electronic Information /School of Artificial Intelligence, Nanjing Normal University, China
Liu Liu, College of Information Management, Nanjing Agricultural University, China
Si Shen, School of Economics and Management, Nanjing University of Science and Technology, China
Student Members
Dongmei Zhu, College of Information Management, Nanjing Agricultural University, China
Jieqiong Li, College of Information Management, Nanjing Agricultural University, China
Ruifeng Wu,College of Information Management, Nanjing Agricultural University, China
Junyi Yang,College of Information Management, Nanjing Agricultural University, China
Zhixing Xu, School of Chinese Language and Literature, Nanjing Normal University, China
Junjie Li, School of Chinese Language and Literature, Nanjing Normal University, China
Yue Zhu, School of Chinese Language and Literature, Nanjing Normal University, China
Mengting Xu, School of Chinese Language and Literature, Nanjing Normal University, China
Call for Papers
4th Int. Workshop on AI and Semantic Technologies for Scientific, Technical, and Legal Web co-located with The Web Conference 2026
Workshop: April 13 or 14, 2026 - Dubai, UAE
https://semtech4stld.github.io/
-----------------------------------------------------
Important Dates
-----------------------------------------------------
Abstract Submissions: January 5th, 2026
Paper Submissions: January 12th, 2026
Notifications: January 25th, 2026
Camera-Ready Contributions: February 2nd, 2026
Workshop: April 13 or 14, 2026 - Dubai, UAE
All deadlines are 11:59 pm, AoE time (Anywhere on Earth).
------------------------------------------------------
Workshop Aims and Scope
------------------------------------------------------
The SemTech 2026 workshop focuses on methods that combine Semantic Web technologies, Natural Language Processing, Large Language Models (LLMs), and other AI technologies to model knowledge across scientific, technical, and legal domains. The workshop invites research on knowledge graph creation, semantic annotation, LLM–KG hybrid reasoning, and trustworthy AI pipelines that enhance the reliability, interpretability, and reuse of Web data. This is particularly timely as the Web community seeks robust approaches to integrate symbolic and sub-symbolic methods for managing and understanding the growing body of domain-specific knowledge on the Web
-------------------------------------------------------
Workshop Topics
-------------------------------------------------------
We invite contributions on topics related to Semantic Web technologies and deep learning, particularly in the context of scientific, technical, and legal data. Areas of interest include, but are not limited to, the following areas:
Data Collection
- Leveraging LLMs for generating scientific, technical, and legal data.
- New tools and systems for capturing scientific, technical, and legal data, such as scientific articles, patent publications, etc.
- Procedures and tools for storing, sharing, and preserving data on the Web.
- Collecting and sharing data sets such as benchmarks, etc.
- Pipelines and protocols to capture peculiarities from Web data.
- Employing Semantic Web Technologies to represent and preserve sensitive data in terms of ethics, privacy, security, and trust on the Web.
Novel Semantic Technologies for scientific, technical, and legal web:
- Ontologies and annotation schemas to model such data.
- Annotation, linking, and disambiguation of the data.
- Knowledge graph construction.
- LLMs to generate metadata, vocabularies, ontologies, and semantic models for specific data.
Applications for patents, scientific, technical, and legal web:
- Applications based on Generative AI and LLMs.
- Exploiting knowledge graphs for document similarity, question answering, search, etc.
- Semantic content-based retrieval.
- Natural language processing techniques for classification, summarization, etc.
- Exploratory search using semantic technologies on scientific, technical, and legal data.
- Key enabling tools (also based on LLMs) for accessing and using data on the Web.
- Lessons learned and use cases from both academia and industry around semantic models and LLMs for data in specific domains.
-------------------------------------------------------
Submission Details
-------------------------------------------------------
Formatting Requirements. Submissions must be written in English, in double-column format, and must adhere to the ACM template and format (also available in Overleaf). The review process will follow a single-blind protocol.
Key Participation Requirement: At least one author per paper must be registered for the workshop, attend in person, and present their work.
- Full Research Papers (6-8 pages maximum) should be clearly placed with respect to the state of the art and state the contribution of the proposal in the domain of application, even if presenting preliminary results. In particular, research papers should describe the methodology in detail, experiments should be repeatable, and a comparison with the existing approaches in the literature is encouraged.
- Replicability/Reproducibility papers (4 pages) should involve repeating prior experiments using the source code and datasets to analyze existing methods and their limitations. Alternatively, authors may assess the robustness of previous work by applying the original code in new contexts, such as different domains or datasets.
- Short Papers (4 pages) should describe significant novel work in progress. Compared to full papers, their contribution may be narrower in scope, be applied to a narrower set of application domains, or have weaker empirical support than that expected for a full paper. Submissions likely to generate discussions in new and emerging areas of legal data are encouraged.
Submissions should not exceed the indicated number of pages, including any diagrams and references.
PROCEEDINGS PUBLICATION Due to conference policy, papers accepted by the workshop will be included in the Companion Proceedings of the Web Conference 2026, which are archived in the ACM Digital Library, subject to meeting the ACM open-access, formatting guidelines, and camera-ready timeline as provided and observed by the ACM Web Conference. See the section Important update on ACM's new open access publishing model for 2026 ACM Conferences! on the conference website.
---------------------------------------------------------
Workshop Chairs
---------------------------------------------------------
Rima Dessi´
Higher Colleges of Technology, United Arab Emirates (UAE)
Hidir Aras
FIZ Karlsruhe - Leibniz Institute for Information Infrastructure, Germany
Jeenu Joy
FIZ Karlsruhe - Leibniz Institute for Information Infrastructure, Germany
Danilo Dessi´
Department of Computer Science, College of Computing and Informatics, University of Sharjah, Sharjah, UAE
Francesco Osborne
The Open University, Milton Keynes, United Kingdom
-----------------------------------------------------------
Contacts
-----------------------------------------------------------
For general inquiries on the workshop, please send an email to ddessi(a)sharjah.ac.ae
*To be held at EACL 2026 (March 24-29 in Rabat, Morocco)*
*Workshop description*
The 8th SIGTYP Workshop aims to provide a forum for bridging linguistic
typology, multilingual NLP, and adjacent areas to develop truly
multilingual NLP methods. The workshop raises awareness of linguistic
typology and its potential to broaden the global reach of multilingual NLP
and introduces computational approaches to typology. We welcome open
problems and discussion, inviting contributions from researchers in
multilingual/cross-lingual NLP and leading scholars in linguistic typology.
In 2026, we place a special emphasis on the utility of LLMs for typological
research.
*SIGTYP is the first dedicated venue for typology-related research and its
integration in multilingual NLP. Appropriate topics include (but are not
limited to):*
- *Integration of typological features in language transfer and joint
multilingual learning. *Beyond techniques such as “selective sharing,”
what other ways can we encode heterogeneous external knowledge in ML
algorithms?
- *Development of unified taxonomy and resources. *Building universal
databases/models to support the understanding and processing of diverse
languages.
- *Automatic inference of typological features. *Pros/cons of existing
techniques (e.g., heuristics from morphosyntactic annotation, propagation
from related languages, supervised Bayesian/neural models) and emerging
approaches.
- *Typology and interpretability. *Using typological knowledge to
interpret hidden representations of multilingual models, guide multilingual
data generation/selection, and annotate texts.
- *Improvement and completion of typological databases. *Combining
linguistic expertise with data-driven methods to advance knowledge of
cross-linguistic variation and universals.
- *Linguistic diversity and universals; cross-lingual annotation. *Which
phenomena/categories should be considered universal? How should they be
annotated?
- *Using LLMs for typological studies. *Can LLMs help formulate/test
typological hypotheses? Can they make valid cross-linguistic
generalisations?
-
- *Additional topics include* constructed language generation, universals
in diachronic language change, information-theoretic approaches to
typology, and automated approaches to etymology.
*Important Dates (23:59 AoE)*
- *Direct submission deadline: December 26, 2025*
- *Pre-reviewed (ARR) submission deadline: January 2, 2026*
- *Notification of acceptance: January 23, 2026*
- *Camera-ready deadline: February 3, 2026*
- *Workshop date: During EACL 2026 (March 24–29, 2026; exact day TBA)*
*Submissions*
We invite extended *abstract submissions (non-archival) *and *general paper
submissions (archival)*. The accepted submissions will be presented at the
workshop, providing new insights and ideas. Extended abstracts should
describe already published work or work in progress and should *not exceed
two (2) pages*. This way, we will not discourage researchers from
preferring main conference proceedings, while ensuring that engaging and
thought-provoking research is presented at the workshop. For general
(archival) submissions, we accept both long and short papers. Short papers
should* not exceed four (4) pages, long papers should not exceed eight (8)
pages.* Unlimited additional pages are allowed for the references section
in all submission types.
*Submissions should be anonymous, without authors or an acknowledgement
section; self-citations should appear in third person.*
*Format: *
Submissions must follow the ACL 2025 stylesheet (
https://github.com/acl-org/acl-style-files), and both long and short paper
submissions must follow the two-column format of ACL proceedings. All
submissions must be in PDF format.
Submission Link:
https://openreview.net/group?id=eacl.org/EACL/2026/Workshop/SIGTYP
*SIGTYP 2026: *https://sigtyp.github.io/
*Organizing Committee*
Priya Rani, Michael Hahn, Andreas Shcherbakov, Oleg Serikov, Alexey
Sorokin, Ryan Cotterell and Kat Vylomova
*Anti-harassment policy*
The workshop follows the ACL anti-harassment policy:
https://www.aclweb.org/adminwiki/index.php?title=Anti-Harassment_Policy.
*Contact*
For any inquiries regarding the workshop, please send an email to the
Organising Committee at sigtyp(a)gmail.com
Regards,
Priya.
*Apologies for cross-posting*
*LLMs4SSH: Shaping Multilingual, Multimodal AI for the Social Sciences and
Humanities*
Workshop at LREC 2026 – 11 May 2026
📍 Location: LREC 2026, Mallorca, Spain
🗓 Date: 11 May 2026
🌐 Website: https://sites.google.com/view/llms4ssh-lrec2026/home
<https://sites.google.com/view/llms4ssh-lrec2026/home>
------------------------------
Description
Large Language Models (LLMs) are rapidly advancing in terms of
multilinguality, multimodality, reasoning, and agentic capabilities. While
these developments have transformed language and data processing, their
potential remains insufficiently aligned with the specific research needs,
methodologies, and epistemic values of the Social Sciences and Humanities
(SSH).
The LLMs4SSH workshop aims to bridge the gap between Language Technologies
(LT) and SSH, bringing together researchers from AI, NLP, Digital
Humanities, linguistics, social sciences, cultural heritage, and related
fields. The workshop will provide a forum to explore how multilingual,
multimodal, and reasoning-oriented LLMs can support SSH research tasks,
while critically examining their methodological, ethical, and societal
implications.
------------------------------
Topics
We invite submissions on topics including, but not limited to:
Multilinguality and Cultural Diversity
-
Multilingual LLMs for low-resource and underrepresented languages
-
Cross-lingual transfer and cultural adaptation
-
Bias, fairness, and inclusivity
-
Translation, code-switching, and cross-cultural communication
Multimodality in SSH Research
-
Integration of text, image, audio, and speech modalities
-
LLMs for multimodal archives, museum collections, and heritage data
-
Generative and interpretive multimodal applications in humanities
research
Reasoning and Agentic LLMs
-
Reasoning and interpretive capacities for SSH tasks (e.g. argumentation,
causality, hermeneutics)
-
Agentic LLMs as autonomous or collaborative research assistants
-
Tool use, planning, and hypothesis generation
-
Epistemological and ethical implications of reasoning and agency
Methodologies and Evaluation
-
SSH-oriented datasets, benchmarks, and evaluation metrics
-
Explainability, transparency, and reproducibility
-
Human–AI co-creation and qualitative evaluation methods
-
Comparative studies across languages, domains, and modalities
Applications in SSH Domains
-
Digital humanities and cultural analytics
-
Political science, sociology, and media studies
-
History, philosophy, law, psychology, education, and ethics
FAIR, Ethics, and Governance
-
Responsible and participatory design of LLMs
-
Ethical challenges in data use, authorship, and accountability
-
Policy frameworks for open and inclusive AI infrastructures
Interdisciplinary Collaboration and Infrastructure
-
Co-design of tools and workflows between LT and SSH
-
Integration of LLMs into research infrastructures and curricula
-
Open science initiatives and community building
------------------------------
Submission Types
We welcome:
-
Short research papers
-
Position papers
-
Case studies
-
Posters and demos
------------------------------
Submission Information
Submissions must be made electronically through the START Conference
Manager (Softconf) system:
🔗 Submission site: https://softconf.com/lrec2026/LLMs4SSH
<https://softconf.com/lrec2026/LLMs4SSH>
All submissions should follow the official LREC 2026 workshop
guidelines regarding
format, length, and anonymity. Accepted papers will be presented at the
workshop, and details regarding proceedings publication will be announced
on the workshop website.
------------------------------
Important Dates
-
Submission Deadline: 23 February 2026
-
Review Period: 2–23 March 2026
-
Notification of Acceptance: 30 March 2026
-
Workshop Date: 11 May 2026
------------------------------
Workshop Format
The full-day workshop will include:
-
An invited keynote from the Language Technologies community
-
Oral paper sessions on foundations, methodologies, and applications
-
Poster and demo sessions
-
Breakout groups and roundtable discussions
-
A closing panel to summarize insights and outline future research
directions
Hybrid participation options will be available to foster inclusivity.
------------------------------
Organizing committee
Jeremy Barnes (HiTZ Center, EHU)
Elena Battaner (URJC)
Joanna Blochowiak (University of Zurich)
Cristina Grisot (University of Zurich)
Arturo Montejo-Ráez (University of Jaén)
Nikola Ljubešić (Jožef Stefan Institute)
Maciej Piasecki (Wrocław University of Science and Technology)
German Rigau (HiTZ Center, EHU)
Marko Tadić (University of Zagreb)
Friedel Wolff (SADiLaR, North-West University)
------------------------------
Diversity and Inclusion
The workshop actively supports:
-
Underrepresented languages and research communities
-
Geographic and gender diversity
-
Interdisciplinary collaboration across AI, LT, and SSH
[image: Universidad de Jaén] <https://www.ujaen.es/> Arturo Montejo Ráez
Profesor Titular de Universidad | Associated Professor (Tenured)
amontejo(a)ujaen.es
Universidad de Jaén
Departamento de Informática, A3-114
Las Lagunillas s/n, 23071 - Jaén (Spain)
+34 953 212 882
<https://www.ujaen.es/servicios/sinformatica/sites/servicio_sinformatica/fil…>
ORCID: http://orcid.org/0000-0002-8643-2714
Researcher ID: D-3387-2009
SINAI Research Group <https://sinai.ujaen.es>
[image: Universidad de Jaén] <https://www.ujaen.es/> *Antes de imprimir
este mensaje, piense si es necesario. Proteger el medio ambiente es cosa de
todos.*
*** CLÁUSULA DE CONFIDENCIALIDAD ***
Este mensaje se dirige exclusivamente a su destinatario y puede contener
información privilegiada o confidencial. Si no es Ud. el destinatario
indicado, queda notificado de que la utilización, divulgación o copia sin
autorización está prohibida en virtud de la legislación vigente. Si ha
recibido este mensaje por error, se ruega lo comunique inmediatamente por
esta misma vía y proceda a su destrucción.
This message is intended exclusively for its recipient and may contain
information that is CONFIDENTIAL. If you are not the intended recipient you
are hereby notified that any dissemination, copy or disclosure of this
communication is strictly prohibited by law. If this message has been
received by mistake, please let us know immediately via e-mail and delete
it.
*LLMs4SSH: Shaping Multilingual, Multimodal AI for the Social Sciences and
Humanities*
Workshop at LREC 2026 – 11 May 2026
📍 Location: LREC 2026, Mallorca, Spain
🗓 Date: 11 May 2026
🌐 Website: https://sites.google.com/view/llms4ssh-lrec2026/home
------------------------------
Description
Large Language Models (LLMs) are rapidly advancing in terms of
multilinguality, multimodality, reasoning, and agentic capabilities. While
these developments have transformed language and data processing, their
potential remains insufficiently aligned with the specific research needs,
methodologies, and epistemic values of the Social Sciences and Humanities
(SSH).
The LLMs4SSH workshop aims to bridge the gap between Language Technologies
(LT) and SSH, bringing together researchers from AI, NLP, Digital
Humanities, linguistics, social sciences, cultural heritage, and related
fields. The workshop will provide a forum to explore how multilingual,
multimodal, and reasoning-oriented LLMs can support SSH research tasks,
while critically examining their methodological, ethical, and societal
implications.
------------------------------
Topics
We invite submissions on topics including, but not limited to:
Multilinguality and Cultural Diversity
-
Multilingual LLMs for low-resource and underrepresented languages
-
Cross-lingual transfer and cultural adaptation
-
Bias, fairness, and inclusivity
-
Translation, code-switching, and cross-cultural communication
Multimodality in SSH Research
-
Integration of text, image, audio, and speech modalities
-
LLMs for multimodal archives, museum collections, and heritage data
-
Generative and interpretive multimodal applications in humanities
research
Reasoning and Agentic LLMs
-
Reasoning and interpretive capacities for SSH tasks (e.g. argumentation,
causality, hermeneutics)
-
Agentic LLMs as autonomous or collaborative research assistants
-
Tool use, planning, and hypothesis generation
-
Epistemological and ethical implications of reasoning and agency
Methodologies and Evaluation
-
SSH-oriented datasets, benchmarks, and evaluation metrics
-
Explainability, transparency, and reproducibility
-
Human–AI co-creation and qualitative evaluation methods
-
Comparative studies across languages, domains, and modalities
Applications in SSH Domains
-
Digital humanities and cultural analytics
-
Political science, sociology, and media studies
-
History, philosophy, law, psychology, education, and ethics
FAIR, Ethics, and Governance
-
Responsible and participatory design of LLMs
-
Ethical challenges in data use, authorship, and accountability
-
Policy frameworks for open and inclusive AI infrastructures
Interdisciplinary Collaboration and Infrastructure
-
Co-design of tools and workflows between LT and SSH
-
Integration of LLMs into research infrastructures and curricula
-
Open science initiatives and community building
------------------------------
Submission Types
We welcome:
-
Short research papers
-
Position papers
-
Case studies
-
Posters and demos
------------------------------
Submission Information
Submissions must be made electronically through the START Conference
Manager (Softconf) system:
🔗 Submission site: https://softconf.com/lrec2026/LLMs4SSH
All submissions should follow the official LREC 2026 workshop guidelines
regarding format, length, and anonymity. Accepted papers will be presented
at the workshop, and details regarding proceedings publication will be
announced on the workshop website.
------------------------------
Important Dates
-
Submission Deadline: 23 February 2026
-
Review Period: 2–23 March 2026
-
Notification of Acceptance: 30 March 2026
-
Workshop Date: 11 May 2026
------------------------------
Workshop Format
The full-day workshop will include:
-
An invited keynote from the Language Technologies community
-
Oral paper sessions on foundations, methodologies, and applications
-
Poster and demo sessions
-
Breakout groups and roundtable discussions
-
A closing panel to summarize insights and outline future research
directions
Hybrid participation options will be available to foster inclusivity.
------------------------------
Organizing committee
Jeremy Barnes (HiTZ Center, EHU)
Elena Battaner (URJC)
Joanna Blochowiak (University of Zurich)
Cristina Grisot (University of Zurich)
Arturo Montejo-Ráez (University of Jaén)
Nikola Ljubešić (Jožef Stefan Institute)
Maciej Piasecki (Wrocław University of Science and Technology)
German Rigau (HiTZ Center, EHU)
Marko Tadić (University of Zagreb)
Friedel Wolff (SADiLaR, North-West University)
------------------------------
Diversity and Inclusion
The workshop actively supports:
-
Underrepresented languages and research communities
-
Geographic and gender diversity
-
Interdisciplinary collaboration across AI, LT, and SSH
[image: Universidad de Jaén] <https://www.ujaen.es/> Arturo Montejo Ráez
Profesor Titular de Universidad | Associated Professor (Tenured)
amontejo(a)ujaen.es
Universidad de Jaén
Departamento de Informática, A3-114
Las Lagunillas s/n, 23071 - Jaén (Spain)
+34 953 212 882
<https://www.ujaen.es/servicios/sinformatica/sites/servicio_sinformatica/fil…>
ORCID: http://orcid.org/0000-0002-8643-2714
Researcher ID: D-3387-2009
SINAI Research Group <https://sinai.ujaen.es>
[image: Universidad de Jaén] <https://www.ujaen.es/> *Antes de imprimir
este mensaje, piense si es necesario. Proteger el medio ambiente es cosa de
todos.*
*** CLÁUSULA DE CONFIDENCIALIDAD ***
Este mensaje se dirige exclusivamente a su destinatario y puede contener
información privilegiada o confidencial. Si no es Ud. el destinatario
indicado, queda notificado de que la utilización, divulgación o copia sin
autorización está prohibida en virtud de la legislación vigente. Si ha
recibido este mensaje por error, se ruega lo comunique inmediatamente por
esta misma vía y proceda a su destrucción.
This message is intended exclusively for its recipient and may contain
information that is CONFIDENTIAL. If you are not the intended recipient you
are hereby notified that any dissemination, copy or disclosure of this
communication is strictly prohibited by law. If this message has been
received by mistake, please let us know immediately via e-mail and delete
it.
Apologies for cross-posting
---------------------------------------------------------------------------
*The Ninth Workshop on Technologies for Machine Translation of Low-Resource
Languages (LoResMT 2026)*
*https://www.loresmt.org/ <https://www.loresmt.org/>*
*@ EACL 2026 (March 24-29, 2026)*
*Rabat, Morocco*
*SUBMISSION*
ARR submission link:
https://openreview.net/group?id=eacl.org/EACL/2026/Workshop/LoResMT
*TIMELINE*
- Submission deadline: December 19, 2025 (Anywhere on Earth)
- Pre-reviewed (ARR) submission deadline: January 2, 2026
- Notification of acceptance: January 23, 2026
- Camera-ready paper due: February 3, 2026 (Anywhere on Earth)
- Pre-recorded video due (hard deadline): February 24, 2026
- Workshop dates at EACL 2026: TBD
- EACL 202 Main Conference: March 24-29, 2026
*SCOPE*
Based on the success of past low-resource machine translation (MT)
workshops at AMTA 2018, MT Summit 2019, AACL-IJCNLP 2020, AMTA 2021, COLING
2022, EACL 2023, ACL 2024, NAACL 2025, we introduce LoResMT 2026 workshop
at EACL 2025. The workshop provides a discussion panel for researchers
working on MT systems/methods for low-resource and under-represented
languages in general. We would like to help review/overview the state of MT
for low-resource languages and define the most important directions.
Fundamental work on low-resource languages in MT and NLP is still crucial
and unavoidable. We also solicit papers dedicated to supplementary natural
language processing (NLP) tools that are used in any language and
especially in low-resource languages. Overview papers of these NLP tools
are very welcome. It will be beneficial if the evaluations of these tools
in research papers include their impact on the quality of MT output.
*TOPICS*
We are highly interested in (1) original research papers, (2)
review/opinion papers, and (3) online systems on the topics below; however,
we welcome all novel ideas that cover research on low-resource languages.
- Neural machine translation for low-resource languages
- Work that presents online systems for practical use by native speakers
- Word tokenizers/de-tokenizers for specific languages
- Word/morpheme segmenters for specific languages
- Alignment/Re-ordering tools for specific language pairs
- Use of morphology analyzers and/or morpheme segmenters in MT
- Multilingual/cross-lingual NLP tools for MT
- Corpora creation and curation technologies for low-resource languages
- COVID-related corpora, their translations and corresponding NLP/MT systems
- Review of available parallel corpora for low-resource languages
- Research and review papers of MT methods for low-resource languages
- MT systems/methods (e.g. rule-based, SMT, NMT) for low-resource languages
- Pivot MT for low-resource languages
- Zero-shot MT for low-resource languages
- Fast building of MT systems for low-resource languages
- Re-usability of existing MT systems for low-resource languages
- Machine translation for language preservation
*SUBMISSION INFORMATION*
We are soliciting two types of submissions: (1) research, review, and
position papers and (2) system demonstration papers. For research, review
and position papers, the length of each paper should be at least four (4)
and not exceed eight (8) pages, plus unlimited pages for references. For
system demonstration papers, the limit is four (4) pages. Submissions
should be formatted according to the official ACL style templates
(Overleaf). Please refer to the EACL submission guidelines for further
information <https://2026.eacl.org/calls/papers/>. Accepted papers will be
published online in the EACL 2026 proceedings and will be presented at the
conference.
Submissions must be anonymized and should be done using the provided
submission system. Scientific papers that have been or will be submitted to
other venues must be declared as such and must be withdrawn from the other
venues if accepted and published at LoResMT. The review will be
double-blind. Authors of an accepted paper should present their paper in
person at EACL 2026. Papers should be submitted in PDF to the LoResMT Open
Review.
We would like to encourage authors to cite papers written in ANY language
that are related to the topics, as long as both original bibliographic
items and their corresponding English translations are provided.
Registration is handled by the main conference (
https://2026.eacl.org/registration).
*Shared Tasks: *
We are going to have a machine translation competition for several low
resource Turkic languages, namely, Chuvash, Bashkir, Tatar and Kazakh. The
pairs you can evaluate your models are: Chuvash-English, Chuvash-Russian,
Bashkir-English, Bashkir-Russian, Tatar-English, Kazakh-Russian.
Unfortunately, we cannot provide you with training data for these language
pairs, but nevertheless you can use any data you find useful or collect by
yourself. We expect you to share your findings as a report. Best reports
will be included in the LoResMT proceedings. for futher information please
visit to LoResMT website.
*ORGANIZING COMMITTEE (LISTED ALPHABETICALLY)*
Atul Kr. Ojha
Chao-Hong Liu
Ekaterina Vylomova
Flammie Pirinen
Jonathan Washington
Nathaniel Oco
Xiaobing Zhao
*CONTACT*
Please email loresmt(a)googlegroups.com if you have any
questions/comments/suggestions.