The Research unit ATILF (Computer Processing and Analysis of the French Language) offers a postdoctoral position in computational linguistics.
Topic: multiword expressions in large language models Location: ATILF, Nancy, France (Univ. Lorraine and CNRS) Starting date: September 2025 Duration: 12 months (possibility to extend the duration for one more year) Supervisors: Mathieu Constant (Univ. Lorraine, France) and Patrick Watrin (UC Louvain, Belgium) Salary: depends on experience and salary grids (from 3000 to 4200 euros before tax) Application deadline: April 22, 2025
Subject. The term « multiword expression » (MWE) refers to a combination of multiple lexical items that displays irregular composition possibly on different linguistic levels (morphology, syntax, semantics, …). They include a large variety of phenomena such as idioms (run around in circles), support verb constructions (take a walk), nominal compounds (dry run), complex function units (in spite of). They have been the subject of extensive research work in the NLP community over the last 50 years.
The goal of this post-doc position is to investigate to what extent large language models encode multiword expressions and their various levels of idiomaticity and fixedness. In particular, the hired post-doc will develop methods to extract linguistic features about multiword expressions in context from large language models. The methods will be experimented on French and will be used to provide aids for French L2 learners when reading MWE occurrences in authentic texts.
Context. The position is part of the STAR-FLE project (STrategic Adaptations for better Reading and Text Comprehension in FFL, https://www.starfle.fr/en, 2024-2027) funded by the French National Research Agency (ANR). The project aims to propose innovative digital solutions in the area of Natural Language Processing (NLP) that may improve text comprehension for French L2 learners and assist teachers in managing multiple levels of learners. In particular, it will propose context-based aids for understanding lexical issues as well as MWEs found in authentic texts. The hired researcher will be fully integrated in the project team.
Requirements. Applicants should hold a PhD thesis n natural language processing, in computational linguistics, in computer science, or in applied mathematics, . The hired post-doc researcher should have the following skills:
* expertise in deep learning for NLP and notably large language models * excellent programming skills * Good linguistic skills * good knowledge of French would be a plus * team spirit
Application. The applicants should submit a coverage letter, a CV including their publications, a list of references for recommandation, on the following official web site: https://emploi.cnrs.fr/Offres/CDD/UMR7118-SABMAR-022/Default.aspx?lang=EN. The applications should be sent not later than April 22, 2025.
For more information, contact Mathieu Constant (Mathieu.Constant@univ-lorraine.fr)