Intitulé de l'offre : Machine learning for omics data M/F (H/F)
Acronyme : Omik-IA
Référence : CPJ-2023-012
Nombre de Postes : 1
Site(s) concerné(s) : Montpellier
Région(s) académique(s) : Occitanie
Etablissement(s) partenaire(s) envisagé(s) : Université de Montpellier
Code(s) établissement(s) : UMR5004, UMR5506, UMR5535, UMR9002
Date de publication : jeudi 16 mars 2023
Type de contrat : Chaire de professeur Junior
Durée du contrat : 5 ans
Date d'embauche prévue : 1 juillet 2023
Quotité de travail : Temps complet
Rémunération : Annual salary from 54 600 Euros to 57 800 Euros depending on professional experience
Thématique scientifique : Interface between biology and artificial intelligence
Mots clés : Sequencing data, genome, epigenome, artificial intelligence, machine learning
Section(s) CN : Information sciences: bases of information technology, calculations, algorithms, representations, uses
Information sciences: processing, integrated hardware-software systems, robots, commands, images, content, interactions, signals and languages
Organisation, expression and evolution of genomes Bioinformatics and systems biology
Data and biological systems modelling and analysis: computer, mathematical and physical approaches
Titulaire d’un doctorat ou diplôme équivalent ou justifiant de titres et travaux scientifiques jugés équivalents par l’instance compétente de l’établissement. Il n’y a aucune condition d’âge ou de nationalité pour candidater. Tous les emplois CNRS sont accessibles aux personnes en situation de handicap en bénéficiant d’aménagement d’épreuves rendus nécessaires par la nature du handicap
The CNRS is renowned for its multidisciplinary approach, which allows it to draw on the expertise of the nation's most esteemed researchers in all branches of modern experimental science. This JPC project is an example of the CNRS' commitment to interdisciplinary work, combining biology, computer science and mathematics in order to create new machine learning methods for exploring genomes and epigenomes. This JPC is part of a larger series spanning from 2023 to 2027, wherein the CNRS and its partners endeavor to strengthen their capacity for handling heterogeneous and multi-scale data in the realm of "omics". This JPC should be seen as a milestone of the CNRS's efforts to promote interdisciplinary projects.
Stratégie du laboratoire d'accueil
The research units involved have a solid expertise either genome mining or in machine learning. The aim of the JPC is to encourage these laboratories to work together and thus improve our know-how in the processing of this class of data. It will therefore have a structuring role. In addition, the implementation of this JPC will aim to strengthen local research in this field.
The CNRS welcomes and recruits a large number of international researchers. More than 30% of those recruited come from abroad. This recruitment will have the same ambition.
Répertoire national des structures de recherche (RNSR) du laboratoire d'accueil
199111950H, 201722404H, 199111979P, 195817959H
Résumé du projet scientifique
High-throughput sequencing technologies have provided unprecedented access to the genetic material of many model organisms, while also enabling the genome-wide study of their transcription activity, regulatory sites, epigenetic modifications, and 3D interactions - datasets collectively known as 'omics'. However, these omic datasets are large, heterogeneous, and multi-scale, rendering the traditional analysis techniques insufficient. To meet this quantitative challenge, modern computer science and mathematics must be combined with machine learning techniques, which are well-suited for large-scale datasets but still under-utilized for multi-scale data integration. For this, centers of excellence in omics data and machine learning must join forces to innovate and develop novel methods for the analysis of next-generation sequencing data.
Résumé du projet d'enseignement
The aim is to teach an expert public, for whom mathematics and biology already represent a choice of curriculum (Master 1 or 2 level). In such a context, the teaching project will be related to the mathematical and computational tools and methods needed for the implementation of machine learning techniques while allowing the students to handle reasoning techniques and the biological problems.
- Total financé (dont package ANR) : 200 k€
- Total du projet : 200 k€
The dissemination of the results will be done through world-class scientific productions: publications, patents, software... In addition, the results will be communicated to various targets such as scientific communities, media, decision makers, general public, schools, etc., with an adapted calendar. Specific tools may be developed such as websites, newsletters, meetings, international symposia, summer schools and conferences.
The CNRS is developing a strong policy in favor of open science. Open science consists of making research results "as accessible as possible and closed as necessary". As such, the CNRS aims to make 100% of the texts of publications resulting from the work of its laboratories accessible , in particular through deposit in HAL. The data produced must also be made available and reusable, except for specific restrictions. In addition, the guiding principles of individual evaluation have been revised in accordance with the DORA declaration, to be more qualitative and to take into account all facets of the researcher's profession.
Science et société
The relationship between science and society is now recognized as a full dimension of scientific activity. The project will develop this dimension in synergy with all the partners. The resulting research work will contribute to informing public decision-making. Participatory science initiatives may be initiated with actors from the project’s socio-economic and cultural eco-system .
The activity will be evaluated in particular on the basis of scientific production (publications, software, patents, etc.), on institutional and private partnerships formalized by contracts, on international presence, on the promotion of work to multidisciplinary scientific communities, on innovation and its transfer to society and on scientific dissemination to non-specialist audiences.
Modalités d'organisation des auditions
Seul(e)s seront convoqué(e)s aux auditions les candidat(e)s sélectionné(e)s sur dossier par la commission de sélection