[english version below]
****************************
Atelier 4 AS
Atelier sur les Avancées en AMR et en Analyse Sémantiques
4AS@TALN2025 - Marseille - 30 juin 2025
****************************
https://team.inria.fr/semagramme/fr/atelier-4-as/
****************************
-> Nouvelle date de soumission 27 avril 2025 (anywhere on earth)
Beaucoup de chercheurs et chercheuses ont rêvé de créer une intelligence artificielle générale et pour cela ont imaginé d’exploiter la sémantique du langage (parsing et génération). Aujourd’hui nous nous rapprochons de cette possibilité quand nous voyons la capacité des modèles à appréhender des problèmes complexes. Mais, aucun de ces systèmes n’exploite véritablement l’analyse sémantique qui reste un terrain de recherche fertile du point de vue des formalismes, des architectures/modèles et des publications. Bien qu’imparfait, un formalisme tentant de faire le lien entre ces différentes problématiques, les AMR, a concentré beaucoup d'efforts.
Cet atelier vise à réunir les différentes équipes qui s’intéressent à la sémantique du langage avec trois objectifs.
• Faire le point sur les différentes approches d’exploitation de la sémantique du langage, ses différents formalismes (DRS, AMR, Yarn, …) et leurs usages (modèles d’IA plus frugale, répondre à des problèmes où la sémantique a des avantages clairs par rapport des LLMs, ...). Un focus sera fait sur l’AMR, ses forces, faiblesses, ses successeurs potentiels et évolutions à venir.
• Identifier et positionner les différentes ressources utiles à l'analyse sémantique. Si le développement d'un Propbank en français n'apparaît pas comme une stratégie gagnante, porter le développement d'un alignement des mots du français vers Propbank, dans la ligné de VerbNet et Semlink serait intéressant. La question des corpus annotés est aussi largement ouverte.
• Echanger les différents points de vue sur l’intérêt de poursuivre cette recherche face aux capacités des très grands modèles de langue (LLM), la meilleure façon de structurer cette recherche (projets collaboratifs par exemple) et de communiquer auprès d’un vaste public.
--------------------
Thèmes
--------------------
L'atelier sollicite des communications qui abordent un ou plusieurs des thèmes suivants :
• Interface syntaxe-sémantique ;
• Les ressources pour la sémantique
• Expansion ou couplage de ressources sémantiques avec des LLM ;
• Conception et annotation des représentations sémantiques ;
• Comparaison des framework de représentations sémantiques ;
• Génération automatique de textes à partir de représentations de sens ;
• Forces et faiblesses des représentations sémantiques existantes ;
• Utilisation des représentations sémantique dans des applications réelles ;
• Application des représentations sémantiques et multilinguismes ;
• Multimodalité dans les représentations du sens ;
• La relation entre les représentations symboliques du sens et les représentations sémantiques distribuées ;
• Propriétés formelles des représentations de sens ;
--------------------
Soumission
--------------------
La longueur attendue des soumissions est de 4 pages, augmentée d'une page pour les versions camera ready.
Les soumissions doivent être rédigées selon la feuille de style ci-dessous et être soumises sous forme de fichiers PDF via le système EasyChair.
Les traductions de soumissions précédemment acceptées dans des conférences de plus grande envergure sont acceptées.
Système de soumission est une track 4AS sur le site easychair de la conférence principale : https://easychair.org/my/conference?conf=coriataln2025
Les feuilles de style sont communes à TALN, CORIA, RECITAL et RJCRI, voir sur le site web
--------------------
Dates
--------------------
• Date limite de soumission : 22 avril 2025 (anywhere on earth)
-> Nouvelle date de soumission 27 avril 2025 (anywhere on earth)
• Notification : 7 Mai 2025 (anywhere on earth)
• Camera Ready version : 14 Mai 2025 (anywhere on earth)
En intégrant les contraintes de calendrier, aucune extension ne sera possible.
--------------------
Organisateurices
--------------------
• Maxime Amblard (Loria, Université de Lorraine)
• Maria Boritchev (Telecom Paris Tech)
• Bruno Guillaume (Inria)
• Johannes Heinecke (Orange)
• Frédéric Herledan (Orange)
***********************************************************************************************************************************
Workshop 4 AS
Workshop on Advances in AMR and Semantic Analysis
4AS@TALN2025 - Marseille - June, 30th 2025
****************************
https://team.inria.fr/semagramme/fr/atelier-4-as/
****************************
-> New deadline: April 27th 2025 (anywhere on earth)
Many researchers have dreamed of creating general artificial intelligence, and to do so have imagined exploiting the semantics of language (parsing and generation). Today, we’re getting closer to this possibility when we see the ability of models to grasp complex problems. But none of these systems really exploits semantic analysis, which remains a fertile field of research in terms of formalisms, architectures/models and publications. Although imperfect, a formalism that attempts to bridge these different issues, AMR, has been the focus of much effort.
This workshop aims to bring together the various teams working on language semantics, with three objectives.
• Take stock of the different approaches to exploiting language semantics, its various formalisms (DRS, AMR, Yarn, …) and their uses (more frugal AI models, answering problems where semantics has clear advantages over LLMs, …). The focus will be on AMR, its strengths, weaknesses, potential successors and future developments.
• Identify and position the various resources useful for semantic analysis. While the development of Propbank in French does not appear to be a winning strategy, the development of a word alignment from French to Propbank, along the lines of VerbNet and Semlink, would be interesting. The question of annotated corpora is also wide open.
• Exchange different points of view on the interest of pursuing this research in the face of the capabilities of very large language models (LLMs), the best way of structuring this research (collaborative projects, for example) and communicating to a wide audience.
--------------------
Themes
--------------------
The workshop invites papers that address one or more of the following themes:
• Syntax-Semantics Interface;
• Resources for semantics;
• Expanding or coupling semantic resources with LLMs ;
• Designing and annotating semantic representations;
• Comparison of semantic representation frameworks;
• Automatic text generation from meaning representations;
• Strengths and weaknesses of existing semantic representations;
• Use of semantic representations in real-life applications;
• Application of semantic representations and multilingualism ;
• Multimodality in meaning representations;
• The relationship between symbolic representations of meaning and distributed semantic representations;
• Formal properties of meaning representations;
--------------------
Submission
--------------------
The expected length of submissions is 4 pages, plus one page for camera ready versions.
Submissions must follow the style sheet below and be submitted as PDF files via the EasyChair system.
Translations of previously accepted submissions to larger conferences are accepted.
The submission system is a 4AS track on the easychair site of the main conference: https://easychair.org/my/conference?conf=coriataln2025
Style sheets are common to TALN, CORIA, RECITAL and RJCRI.
An Overleaf model is available here: Feuilles de style CORIA-TALN 2025, see on the website.
--------------------
Date
--------------------
• Submission dead-line: April 22th, 2025
-> New deadline: April 27th 2025 (anywhere on earth)
• Notification: May 7th, 2025
• Camera Ready version: May 14th, 2025
Given the time constraints, no extension will be possible.
--------------------
Organizers
--------------------
• Maxime Amblard (Loria, Université de Lorraine)
• Maria Boritchev (Telecom Paris Tech)
• Bruno Guillaume (Inria)
• Johannes Heinecke (Orange)
• Frédéric Herledan (Orange)
----------------------
Maxime Amblard
Université de Lorraine
https://members.loria.fr/mamblardhttp://espoir-ul.fr
Si vous lisez ce message en dehors de vos heures de travail,
merci de ne le traiter qu’en cas d’urgence avérée.
EcoDL 2025: The 1st Workshop on Digital Libraries and AI-based Information Systems for Ecological Research and Practice in conjunction with TPDL 2025
EcoDL 2025 aims to explore the integration of AI, digital libraries, and FAIR data principles in ecological research to improve knowledge synthesis and predictive modeling. Ecology's complexity and data heterogeneity present challenges in generalization, requiring advanced computational tools for structured knowledge representation, search, and decision support. We invite researchers from ecology, AI, and digital information systems to discuss AI-driven data synthesis, semantic search, causal inference, and machine learning applications in biodiversity and conservation. Through interdisciplinary contributions, EcoDL 2025 seeks to foster innovation in ecological informatics, supporting open science and advancing digital methods for ecological research and environmental sustainability.
**************************************************************
Workshop website: https://sites.google.com/view/ecodl2025/
Paper Submission Deadline: 16th May 2025 (AoE)
***************************************************************
Topics of interest
------------
The EcoDL 2025 workshop welcomes submissions on, but not limited to, the following topics:
· Knowledge graphs and structured ecological data representation
o Biodiversity knowledge graphs
o Linked open data for integrating scattered ecological knowledge sources
o Ontologies for data interoperability in ecology: Standardizing environmental terms and concepts
o Semantic annotation and classification of ecological data
o AI-driven taxonomy generation for ecological datasets
· Advanced search and retrieval for ecological and environmental data
o Neural search for literature and reports: Improving retrieval of species, habitats, and ecosystem information
o Improving retrieval of study question, research hypothesis and applied method
o LLMs for information extraction: Capturing species interactions, climate impacts, and conservation policies
o Retrieval-Augmented Generation (RAG) for ecological research: Hybrid AI systems for answering complex scientific questions
o Multimodal search for biodiversity and environmental studies: Combining text, image, and geospatial data retrieval
o Automated knowledge discovery from climate and biodiversity repositories
· FAIR data principles in ecological research
o Data interoperability
o Open science infrastructure for ecological and environmental data
o Ontologies for data interoperability in ecology: Standardizing environmental terms and concepts
o FAIR data and software
o Data lifecycle management (Create, Store, Share, Reuse)
o NanopublicationsMapping-based Knowledge Graph Construction
· AI for assisting ecological research
o AI-based literature review
o AI-driven synthesis of ecological knowledge: taking complexity and context-dependence into account
o Monitoring biases in study system, study regions and methods in ecological research
o Tracking Misinformation in Climate Science Using NLP: Identifying and mitigating the spread of false environmental claims
· Digital libraries and ecological informatics
o Methods for digitizing and analyzing historical ecological archives
o Indigenous knowledge and digital archives for sustainability
o AI-powered environmental storytelling and digital heritage
o Human-nature interactions in digital libraries
o Digitization and NLP for analyzing historical climate data
· Methods for integrating heterogeneous ecological datasets
o Integrating remote sensing data with ecological repositories
o Multimodal search for biodiversity studies
· Applications of AI in ecosystem restoration, conservation planning and decision-making
o AI-powered decision support systems for restoration and conservation
o Lay summaries based on ecological evidence
o Impact assessment of conservation policies via digital libraries
· Reflections on knowledge synthesis in ecology and on the contributions of AI
o Evaluating the role of AI in ecological research
o Challenges and limitations of AI-driven ecological modeling
o The impact of automated systems on scientific knowledge creation
o Ethical considerations in AI-assisted ecological analysis
o Future directions for AI in knowledge synthesis for ecology
Submission guidelines
----------------
The EcoDL workshop solicits submissions in any of the following three formats:
§ Long Papers: Up to 15 LNCS style pages, including references.
§ Short Papers: Up to 10 LNCS style pages, including references.
§ Abstracts: Up to 2 LNCS style pages, including references.
All accepted long and short workshop papers will be published in the proceedings of the Springer series Communications in Computer and Information Science (CCIS). For detailed formatting instructions, please refer to the following link<https://www.springer.com/gp/computer-science/lncs/conference-proceedings-gu…>. Abstract submissions will be invited as poster presentations, to foster discussion and networking at the workshop, but will not be compiled in the proceedings.
Important dates
-----------
Paper Submission: 16th May 2025 (AOE)
Acceptance Notification: 20th June 2025 (AOE)
Camera-ready Version: 10th July 2025 (AOE)
Workshop: 23rd September 2025 in Tampere, Finland
The EcoDL 2025 Workshop is collocated with the The 29th International Conference on Theory and Practice of Digital Libraries (TPDL 2025) https://tpdl2025.github.io/, 23rd to 26th September 2025.
EcoDL 2025 Organising Committee
----------------
Jennifer D'Souza, TIB Leibniz Information Centre for Science and Technology, Hannover, Germany
Birgitta König-Ries, University of Jena, Germany
Tina Heger, Leibniz Institute of Freshwater Ecology and Inland Fisheries (IGB), Berlin, Germany
Marie Kaiser, Bielefeld University, Germany
The list of workshops and tutorials at TPDL this year can be found at https://tpdl2025.github.io/Program/workshops_tutorials.html
Call for Interest: DISRPT 2025 - Shared Task on Discourse Relation Parsing and Treebanking
Call for expression of interest: DISRPT 2025 - Shared Task on Discourse Relation Parsing and Treebanking In conjunction with CODI-CRAC & EMNLP 2025 - Suzhou, China, Nov. 5-9.
This year, we are organizing the fourth edition of the DISRPT shared task on discourse processing across formalisms, for a variety of languages and genres, with three subtasks:
* Task 1: discourse segmentation
* Task 2: connective identification
* Task 3: relation classification
We will provide training, development and test datasets from all available languages in RST / eRST, SDRT, PDTB, ISO 24617, and discourse dependencies, using a uniform format. Because different corpora, languages, and frameworks use different guidelines, the shared task will promote the design of flexible methods for dealing with various guidelines, and will help to push forward the discussion of converging standards for discourse units. For datasets which have treebanks, we will evaluate segmentation in two different scenarios: with and without gold syntax. An automatically parsed version is provided for all corpora without a gold parse.
This year, the shared task will feature:
* the inclusion of more frameworks, with datasets from: RST / eRST, SDRT, PDTB, ISO 24617, and discourse dependencies * the inclusion of new corpora and new languages, some of them kept a surprise! * a unified set of labels for the discourse relations, to make easier the evaluation across datasets * a new constraint: only one multilingual model should be submitted per task, and it should be small! This will make our replication work easier, but more importantly, it will simplify using such a model and test the robustness of your solution!
At this time we are calling for expressions of interest to participate. Registered participants will be added to our mailing list and receive updates as soon as data is made available. Please join us on: disrpt2025_participants(a)googlegroups.com
**Important dates:**
* May 15 2025 – sample data release * June 16 2025 – training data release * July 14 2025 – test data release * August 1 2025 – system + paper submissions due * September 12 2025 – notification of acceptance * September 19 2025 – camera ready papers * November 8-9 2025 – CODI at EMNLP
**Information:**
Contact the organizers:disrpt_chairs@googlegroups.com
Official website: https://sites.google.com/view/disrpt2025/
Google group for participants, please join us on: disrpt2025_participants(a)googlegroups.com
**Organization:**
Peter Bourgonje (Universität Potsdam, Germany)
Chloé Braud (CNRS - IRIT, University of Toulouse, France)
Chuyuan Li (University of British Columbia, Canada)
Janet Yang Liu (LMU Munich, Germany)
Philippe Muller (CNRS - University of Toulouse, France)
Amir Zeldes (Georgetown University, Washington DC, USA)
Dear colleagues,
A fully funded three-years PhD position is open at POLITO and EURECOM.
Do not hesitate to share with interested candidates around you.
=====================================
Web:
https://www.polito.it/sites/default/files/2025-03/borsa_scudo_25762_Eurecom…
Title: Enhancing Educational Storytelling with,Human-Centered AI in the
LLM Era
Summary: The PhD aims to develop novel methods and techniques for
allowing endusers to create interactive educational narratives from
structured resources such as knowledge graphs. The research envisions
combining generative models with Retrieval-Augmented Generation (RAG)
and end-user personalization strategies, moving beyond simple
binary-choice formats and thus enabling more engaging, custom-tailored,
and culturally adaptive storytelling.
Location:
- 18 months at Politecnico di Torino (Department of Control and
Computer Engineering)
- 18 months at EURECOM (Data Science Department)
Supervisors : Luigi de Russis, Raphael Troncy, Pasquale Lisena
Application deadline: April 28, 2025
Starting date: Fall 2025
To apply: send your CV + Motivation letter + Transcripts of your MSc +
your master thesis + 2 references to <raphael.troncy(a)eurecom.fr> and
<luigi.derussis(a)polito.it> by Monday 28 April 2025!
--
Raphaël Troncy
EURECOM, Campus SophiaTech
Data Science Department
450 route des Chappes, 06410 Biot, France.
e-mail: raphael.troncy(a)eurecom.fr & raphael.troncy(a)gmail.com
Tel: +33 (0)4 - 9300 8242
Fax: +33 (0)4 - 9000 8200
Web: http://www.eurecom.fr/~troncy/
(Apologies for cross-posting)
*SEM2025: The 14th Joint Conference on Lexical and Computational Semantics, Suzhou, China. (Co-located with EMNLP)
https://starsem2025.github.io/
Second Call for Papers
*SEM brings together researchers interested in the semantics of natural languages and its computational modelling. The conference embraces a wide range of approaches including data-driven, neural, probabilistic and symbolic; practical applications as well as theoretical contributions are welcome. The long-term goal of *SEM is to provide a forum for NLP researchers working on any aspect of natural language semantics.
*SEM invites submissions related to the computational modelling of natural language semantics (understood broadly) and its application. Relevant areas include (but are not limited to) theoretical aspects of computational semantics, empirical and data-driven approaches, resources, evaluation and applications/tools.
*SEM encourages authors to consider ethical aspects of their work, and to address and discuss ethical questions and implications relevant to their research. *SEM also values reproducibility and particularly welcomes submissions that adhere to the reproducibility guidelines as specified here<https://folk.idi.ntnu.no/odderik/reproducibility_guidelines.pdf>.
Submission Instructions
Submissions must describe unpublished work and be written in English. We solicit both long and short papers. Long papers describe original research and may consist of up to eight (8) pages of content, plus unlimited pages for references. Appendices are allowed after the references, but the paper should be self-contained and reviewers will not be required to check the appendices, if any. Final versions of long papers will be given one additional page of content (up to 9 pages) so that reviewers' comments can be taken into account. Short papers describe original focused research and may consist of up to four (4) pages, plus unlimited pages for references. Upon acceptance, short papers will be given five (5) content pages in the proceedings. Authors are encouraged to use this additional page to address reviewers comments in their final versions.
Limitations and Ethics Statement sections are allowed and encouraged, but are not mandatory. These sections should be placed after the conclusion and will not count towards the overall page limit.
Submissions should follow the ARR formatting requirements<https://github.com/acl-org/acl-style-files>.
Submission routes and deadlines
*SEM solicits both direct submissions and ACL Rolling Review (ARR) commitments. The deadline for direct submissions is May 30, 2025, and these submissions will be reviewed by the *SEM2025 program committee. ACL Rolling Review (ARR) submissions can be committed to *SEM up to August 22, 2025 (authors of ARR-reviewed papers need to include their OpenReview link with reviews in the submission form). Both types of submissions are made through OpenReview.
Direct submission link:
https://openreview.net/group?id=aclweb.org/StarSEM/2025/Conference<https://openreview.net/group?id=aclweb.org/StarSEM/2025/Conference>
Multiple submission policy: *SEM does not prohibit the submission of work that is under consideration for another venue at the same time as the *SEM review period. However, authors of such papers will be asked to declare this at submission time.
Important Dates
(All deadlines are 11:59pm UTC-12h, AoE)
Direct submission deadline (long & short papers): May 30, 2025
ARR-reviewed submission deadline (long & short papers): August 22, 2025
Notification of acceptance: September 5, 2025
Camera-ready deadline: September 26, 2025
Conference date: TBA (co-located with EMNLP 2025)
Following ACL and ARR policies<https://www.aclweb.org/portal/content/report-acl-committee-anonymity-policy>, there is no anonymity period requirement.
Kemal Kurniawan | Research Fellow | (he/him) PhD
School of Computing and Information Systems | Faculty of Engineering and IT
Level 4, Melbourne Connect, 700 Swanston St
The University of Melbourne, Victoria 3010 Australia
E: kurniawan.k(a)unimelb.edu.au<mailto:kurniawan.k@unimelb.edu.au>
THIRD CALL FOR PARTICIPATION - IberLEF 2025 - PRESTA: Questions and Answers
about Tables in Spanish
*Web*: https://www.codabench.org/competitions/5538/
We are pleased to announce the first IberLEF task on Question Answering on
Tabular Data: PRESTA.
The PRESTA shared-task consists of Question Answering over Tabular Data
making use of the DataBenchSPA benchmark. DataBenchSPA is a benchmark
composed of real-world table datasets from different domains and with large
size of rows and columns, as well as a wide variety of data types that
allow to assess distinct sort of questions related to each data type.
We propose a task to encourage participants to develop a system that
answers the questions of the kind present in DataBenchSPA over day-to-day
datasets, where the answer is either a number, a categorical value, a
boolean value or lists of several types. DataBenchSPA can be used as a
training and validation set, while we will release another test set
explicitly compiled for the task competition.
The system developed by the participants will be provided by a series of
(dataset, question) pairs and will need to provide an answer which would
then be compared with a gold standard.
The answer might be achieved through a variety of methods. In our paper [1]
we illustrate two different approaches: In-Context Learning and Code
Generation. You may use any of these or come up with your own approach.
There will be two subtasks:
Subtask I : DataBenchSPA QA
Participants will be provided with a dataset (of any size) and a question
over it. The question should be answered using the data from the dataset
only.
Subtask II: DataBenchSPA Lite QA
The task is essentially the same as the previous subtask, but involves
using the sampled version of each dataset with a maximum of 20 rows per
dataset. The question should be answered using the data from the sampled
dataset only. For the test set, we will similarly provide a reduced version
of each dataset for this subtask. This task is especially relevant when
testing for models with a smaller window size.
Important Dates
Release of training data: 18 March 2025
Release of test data - competition starts: 30 April 2025
Submission of the results - competition ends: 12 May 2025
Submission of the description paper: 30 May 2025
Task Organizers
Jorge Osés Grijalba - Graphext
L. Alfonso Ureña-López - University of Jaén
Eugenio Martínez Cámara - University of Jaén
Jose Camacho-Collados - Cardiff University
Codabench: https://www.codabench.org/competitions/5538/
--
Suelo trabajar a deshoras por lo que este correo puede haberte llegado
fuera de tu horario laboral, y al cual puedes responder en el momento que
mejor se ajuste a tus hábitos de trabajo. | I sometimes work at irregular
times and this email might arrive out of working hours so please be assured
that I respect your working pattern and look forward to your response when
it suits you.
[image: Universidad de Jaén] <https://www.ujaen.es/> Eugenio Martínez Cámara
Vicepresidente de la SEPLN <http://www.sepln.org/> | Vice President of the
SEPLN <http://www.sepln.org/en>.
Profesor Titular de Universidad | Associate Professor.
Investigador en Proc. del Lenguaje Natural | Postdoctoral Researcher in
Natural Language Proc.
Grupo de Investigación SINAI <http://sinai.ujaen.es/> | SINAI
<http://sinai.ujaen.es/> Research Group.
emcamara(a)ujaen.es
Código ORCID:0000-0002-5279-8355 <http://orcid.org/0000-0002-5279-8355>
Universidad de Jaén
Dpto. de Informática | Computer Science Department.
Edificio A3, despacho 145
| +34 953212883
<https://www.ujaen.es/servicios/sinformatica/sites/servicio_sinformatica/fil…>
[image: Universidad de Jaén] <https://www.ujaen.es/>
Este mensaje y los ficheros anexos son confidenciales dirigiéndose
exclusivamente al destinatario mencionado en el encabezamiento. Los mismos
contienen información reservada que no puede ser difundida. Si usted ha
recibido este correo por error, tenga la amabilidad de eliminarlo de su
sistema y avisar al remitente mediante reenvío a su dirección electrónica;
no deberá copiar el mensaje ni divulgar su contenido a ninguna persona.
Los datos personales facilitados por usted o por terceros serán tratados
por UNIVERSIDAD DE JAÉN, con la finalidad de gestionar y mantener los
contactos y relaciones que se produzcan como consecuencia de la relación
que mantiene con UJA. Normalmente, la base jurídica que legitima este
tratamiento, será su consentimiento, el interés legítimo o la necesidad
para gestionar una relación contractual o similar. El plazo de conservación
de sus datos vendrá determinado por la relación que mantiene con nosotros.
Para más información al respecto, o para ejercer sus derechos de acceso,
rectificación, cancelación/supresión, oposición, limitación o portabilidad,
dirija una comunicación por escrito a UNIVERSIDAD DE JAÉN, Campus Las
Lagunillas s/n. 23071 – Jaén, o a nuestro delegado de protección de datos [
dpo(a)ujaen.es]. En caso de considerar vulnerado su derecho a la protección
de datos personales, podrá interponer una reclamación ante el Consejo
Andaluz de Transparencia y Protección de Datos (www.ctpdandalucia.es).
Asimismo, es su responsabilidad comprobar que este mensaje o sus archivos
adjuntos no contengan virus informáticos, y en caso que los tuvieran
eliminarlos.
--
Apologies for cross-posting.
--
Have you recently completed or expect very soon an MSc or equivalent degree
in computer science, artificial intelligence, computational linguistics,
engineering, or a related area? Are you interested in carrying out research
on automatic translation during the next few years? Are you excited to
spend a part of your life in a pleasant city in the heart of the Italian
Alps?
WE ARE LOOKING FOR YOU!!!
The Machine Translation <https://mt.fbk.eu/> (MT) group at Fondazione Bruno
Kessler <https://www.fbk.eu/en/> (Trento, Italy) in conjunction with the ICT
International Doctorate School of the University of Trento
<https://iecs.unitn.it/> is pleased to announce the availability of the
following fully-funded PhD position:
TITLE: Automatic translation with large multimodal models
DESCRIPTION:
The rise of large, multimodal foundation models has driven remarkable
progress in natural language processing. In text and speech translation,
their growing adoption comes with quality improvements while also opening
critical research directions for successful deployment and widespread
access. Among the current hot research topics, three are particularly
relevant to this PhD position: resource efficiency, model alignment, and
model accessibility. (i) Resource efficiency aims to reduce computational
demands through model compression techniques that shrink large
general-purpose models, optimizing them for specific hardware (e.g., mobile
devices), translation tasks, domains, or language settings. (ii) Model
alignment ensures outputs are trustworthy, fair, and human-centered by
integrating cultural, sociodemographic, and human factors into model design
and evaluation. This includes bias-aware solutions that promote diversity
and inclusivity, as well as improved evaluation methods that better capture
real-world user needs. (iii) Model accessibility enhances inclusivity for
individuals with visual, hearing, or cognitive impairments by integrating
multimodal solutions such as sign language, speech-to-text simplification,
and description-augmented subtitling, expanding the capabilities of large
models. This PhD position is open to candidates with a strong interest in
advancing the state of the art in any of these areas.
CONTACTS: negri(a)fbk.eu, bentivo(a)fbk.eu
COMPLETE DETAILS AVAILABLE AT:
https://iecs.unitn.it/education/admission/call-for-application
IMPORTANT DATES:
The deadline for application is May 12th, 2025, hrs. 04:00 PM (CEST)
Prospective candidates are strongly invited to contact us in advance for
preliminary interviews. Depending on the short time remaining before the
application deadline, precedence for interviews will be given to
short-listed candidates that will send us a complete CV via email (
negri(a)fbk.eu, bentivo(a)fbk.eu) by May 5, 2025.
Candidate profile
The ideal candidate must have recently completed or expect very soon an MSc
or equivalent degree in computer science, artificial intelligence,
computational linguistics, engineering, or a closely related area. In
addition, the applicant should:
-
Have an interest in Machine and Speech Translation
-
Have experience in deep learning and machine learning, in general
-
Have good programming skills in Python and experience in PyTorch
-
Enjoy working with real-world problems and large data sets
-
Have good knowledge of written and spoken English
-
Enjoy working in a closely collaborating team
Working Environment
The doctoral student will be employed at the MT Unit <https://mt.fbk.eu/>
at Fondazione Bruno Kessler <https://www.fbk.eu/en/>, Trento, Italy. The
group (about 10 people including staff and students) has a long tradition
in research on machine and speech translation and is currently involved in
several projects. Former students are nowadays employed in leading IT
companies in the world.
Benefits
Fondazione Bruno Kessler offers an attractive benefits package, including a
flexible work week, full reimbursement for conferences and summer schools,
an excellent team of supervisors and mentors, help with housing, full
health insurance, the possibility of Italian courses, and sporting
facilities.
Further Information
Should you need further information about the position, please contact
Matteo Negri (negri(a)fbk.eu) and Luisa Bentivogli (bentivo(a)fbk.eu).
Best Regards,
Matteo Negri
--
--
Le informazioni contenute nella presente comunicazione sono di natura
privata e come tali sono da considerarsi riservate ed indirizzate
esclusivamente ai destinatari indicati e per le finalità strettamente
legate al relativo contenuto. Se avete ricevuto questo messaggio per
errore, vi preghiamo di eliminarlo e di inviare una comunicazione
all’indirizzo e-mail del mittente.
--
The information transmitted is
intended only for the person or entity to which it is addressed and may
contain confidential and/or privileged material. If you received this in
error, please contact the sender and delete the material.
**************************************************
*** Join us this coming weekend for the
*** 2025 NARNiHS Research Incubator!
**************************************************
==> 25-27 April 2025 <==
Consult the program for the richest Incubator line-up ever: https://narnihs.org/?page_id=3075
Fourteen (14 !!!) exciting international projects in Historical Sociolinguistics and an expert roundtable!
The event is fully online and free for NARNiHS members. Not yet a NARNiHS member? Membership is free: https://narnihs.org/?page_id=2
Information concerning access to the online venue will be distributed through the NARNiHS members' listserv a few days before the event.
We look forward to seeing you there!
The 2025 NARNiHS Research Incubator organizing committee
The 23rd International Workshop on Treebanks and Linguistic Theories (TLT 2025) will bring together developers and users of linguistically annotated natural language corpora. The workshop is part of SyntaxFest 2025 and will be hosted by University of Ljubljana in Slovenia on August 26-29, 2025.
Link to TLT 2025: https://www.korpuslab.uni-hamburg.de/en/tlt2025.html
Link to SyntaxFest 2025: https://syntaxfest.github.io/
-----------------------------
INVITED TALK
-----------------------------
Amir Zeldes (Georgetown University)
-----------------------------
SUBMISSION INFORMATION
-----------------------------
TLT addresses all aspects of treebank design, development, and use. As ‘treebanks’ we consider any pairing of natural language data (spoken, signed, or written) with annotations of linguistic structure at various levels of analysis, including, e.g., morpho-phonology, syntax, semantics, and discourse. Annotations can take any form (including trees or general graphs), but they should be encoded in a way that enables computational processing. Reflections on the design of linguistic annotations, methodology studies, resource announcements or updates, annotation or conversion tool development, or reports on treebank usage including probing the leakage of treebanks into large language models are but some examples of the types of papers we anticipate for TLT.
SyntaxFest joint submission link: https://openreview.net/group?id=SyntaxFest/2025
-----------------------------
IMPORTANT DATES
-----------------------------
* April 22, 2025: Extended paper submission deadline
* June 2, 2025: Notification of acceptance
* June 16, 2025: Camera-ready papers due
* August 26-29, 2025: SyntaxFest conference (about two workshop days for TLT; attendants are encouraged but not obliged to participate in the whole SyntaxFest.)
All deadlines are 11.59 pm Anywhere on Earth (UTC -12h).
-----------------------------
TLT2025 WORKSHOP CHAIRS
-----------------------------
* Sarah Jablotschkin, University of Hamburg
* Sandra Kübler, Indiana University
* Heike Zinsmeister, University of Hamburg
Contact: tlt2025.gw(a)uni-hamburg.de<mailto:tlt2025.gw@uni-hamburg.de>
Website: https://www.korpuslab.uni-hamburg.de/en/tlt2025.html
--------------------- Challenge Links:
Challenge Homepage: https://brandonio-c.github.io/ClinIQLink-2025/
Challenge Sample Dataset: https://github.com/Brandonio-c/ClinIQLink_Sample-dataset
CodaBench ClinIQLink Docker Setup (GitHub): https://github.com/Brandonio-c/ClinIQLink_CodaBench_docker-setup
--------------------- Challenge Links:
ClinIQLink @ BIONLP @ ACL 2025 - Important updates!
We are pleased to share a set of important updates regarding the ClinIQLink Challenge, focused on evaluating factuality in clinical question answering:
1. Extended Submission Deadline
The system submission deadline has been extended to 05 May 2025 at 23:59 AOE (Anywhere on Earth). This allows additional time for teams to finalize and submit their models.
2. Rolling Evaluation Process
Submissions are being evaluated on a first-come, first-served basis. We encourage early submission to receive results in a timely manner.
3. Baseline Benchmark Release
We will release performance benchmarks this Friday from a selection of open-source large language models evaluated on the ClinIQLink dataset. These results will serve as reference points for participating teams.
4. Updated Submission Instructions
Submission guidelines have been clarified to support reproducibility and containerized evaluation. The revised instructions are available at:
https://github.com/Brandonio-c/ClinIQLink_CodaBench_docker-setup/blob/main/…
5. Large Model Submissions Still Accepted
Teams submitting large models (e.g., models between 10GB and 750GB) may request access.
For full challenge details, please visit the challenge homepage:
https://cliniqlink.org/
If you have any questions, feel free to reach out to me at brandon.colelough(a)nih.gov<mailto:brandon.colelough@nih.gov>.
We look forward to your participation.
Best regards,
Brandon Colelough
brandon.colelough(a)nih.gov<mailto:brandon.colelough@nih.gov>