General information
Offer title : Post-doctoral position in NLP in CHIST-ERA FAIRClinical project (M/F) (H/F)
Reference : UMR9015-NONNAD-004
Number of position : 1
Workplace : GIF SUR YVETTE
Date of publication : 12 May 2025
Type of Contract : Researcher in FTC
Contract Period : 12 months
Expected date of employment : 1 September 2025
Proportion of work : Full Time
Remuneration : monthly gross salary: between €3081.33 and €3519.85 (depends on the candidate's experience)
Desired level of education : Doctorate
Experience required : 1 to 4 years
Section(s) CN : 07 - Information sciences: processing, integrated hardware-software systems, robots, commands, images, content, interactions, signals and languages
Missions
The postdoctoral position is in the field of natural language processing and the researcher will join the European FAIRClinical project funded by CHIST-ERA. The postdoctoral researcher will develop machine learning approaches for information extraction from medical and clinical research articles and their supplementary materials.
Activities
- To identify the data sources needed for the extraction and normalization of the entities.
- To develop text mining pipelines for the extraction and normalization from full texts and supplementary materials.
- To evaluate the text mining methods.
- To participate in the team's publication and communication activities.
Skills
- PhD in computer science, computational linguistics or alike
- Skills in supervised and semi-supervised machine learning, including deep learning
- Experience with natural language processing
- Good command of English, both spoken and written
- Capacity to work independently and as a team member
- Ability to prioritize tasks and take initiative
Work Context
This position is part of the FAIRClinical project funded by CHIST-ERA, in which the objective is to enhance the FAIR-ness of all supplementary data files and significantly improve the reuse of unstructured clinical case report forms (CRFs). Supplementary data are commonly attached to a scientific publication, either directly in biomedical libraries such as PubMed Central, or via generalist deposition platforms such as Zenodo. CRFs collect the patient data in clinical research studies and trials, and represent an information-rich subset of clinical research literature and unstructured clinical study supplementary data. This project proposes to specifically enrich the contents—and therefore the interoperability, findability and reusability—of all supplementary data by delivering more normalized contents.
The position is located in a sector under the protection of scientific and technical potential (PPST), and therefore requires, in accordance with the regulations, that your arrival is authorized by the competent authority of the MESR.
Constraints and risks
Risks related to using a display screen.