The Research unit ATILF (Computer Processing and Analysis of the French 
Language) offers a postdoctoral position in computational linguistics.

Topic: multiword expressions in large language models
Location: ATILF, Nancy, France (Univ. Lorraine and CNRS)
Starting date: September 2025
Duration: 12 months (possibility to extend the duration for one more year)
Supervisors: Mathieu Constant (Univ. Lorraine, France) and Patrick Watrin (UC 
Louvain, Belgium)
Salary: depends on experience and salary grids (from 3000 to 4200 euros before 
tax)
Application deadline: June 1st, 2025


Subject. The term « multiword expression » (MWE) refers to a combination of 
multiple lexical items that displays irregular composition possibly on 
different linguistic levels (morphology, syntax, semantics, …). They include a 
large variety of phenomena such as idioms (run around in circles), support verb 
constructions (take a walk), nominal compounds (dry run), complex function 
units (in spite of). They have been the subject of extensive research work in 
the NLP community over the last 50 years.

The goal of this post-doc position is to investigate to what extent large 
language models encode multiword expressions and their various levels of 
idiomaticity and fixedness. In particular, the hired post-doc will develop 
methods to extract linguistic features about multiword expressions in context 
from large language models.
The methods will be experimented on French and will be used to provide aids for 
French L2 learners when reading MWE occurrences in authentic texts.

Context. The position is part of the STAR-FLE project (STrategic Adaptations 
for better Reading and Text Comprehension in FFL, https://www.starfle.fr/en 
<https://www.starfle.fr/en>, 2024-2027) funded by the French National Research 
Agency (ANR). The project aims to propose innovative digital solutions in the 
area of Natural Language Processing (NLP) that may improve text comprehension 
for French L2 learners and assist teachers in managing multiple levels of 
learners. In particular, it will propose context-based aids for understanding 
lexical issues as well as MWEs found in authentic texts. The hired researcher 
will be fully integrated in the project team.

Requirements. Applicants should hold a PhD thesis n natural language 
processing, in computational linguistics, in computer science, or in applied 
mathematics, .
The hired post-doc researcher should have the following skills:

 *   expertise in deep learning for NLP and notably large language models
 *   excellent programming skills
 *   Good linguistic skills
 *   good knowledge of French would be a plus
 *   team spirit

Application. The applicants should submit a coverage letter, a CV including 
their publications, a list of references for recommandation, on the following 
official web site: 
https://emploi.cnrs.fr/Offres/CDD/UMR7118-SABMAR-022/Default.aspx?lang=EN 
<https://emploi.cnrs.fr/Offres/CDD/UMR7118-SABMAR-022/Default.aspx?lang=EN>. 
The applications should be sent not later than June 1st, 2025.

For more information, contact Mathieu Constant 
([email protected] <mailto:[email protected]>)
_______________________________________________
Corpora mailing list -- [email protected]
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to [email protected]

Reply via email to