Offre d'emploi

Principal Data Scientist - Knowledge graphs and causal inference

Principal Data Scientist - Knowledge graphs and causal inference

Postuler à l'offre

Date limite de candidature
Date limite pour postuler : 31.05.22


Corps de texte

About the Company:

Sanofi is a global life sciences company committed to improving access to healthcare and supporting the people we serve throughout the continuum of care. From prevention to treatment, Sanofi transforms scientific innovation into healthcare solutions, in human vaccines, rare diseases, multiple sclerosis, oncology, immunology, infectious diseases, diabetes and cardiovascular solutions and consumer healthcare. More than 110,000 people in over 100 countries at Sanofi are dedicated to make a difference on patients’ daily life, wherever they live and enable them to enjoy a healthier life. As a company with a global vision of drug development and a highly-regarded corporate culture, Sanofi is recognized as one of the best pharmaceutical companies in the world and is pioneering the application of Artificial Intelligence (AI) in the R&D organization including drug discovery, chemical manufacturing and control, translational research, clinical development, and regulatory document management and submission. Details of the organization and the company’s mission and goals can be found on our website (


Artificial Intelligence (AI) and Machine Learning (ML) algorithms can significantly speed up drug discovery and shorten drug development and identification of patients for clinical trials thereby creating better medicines that save lives. AI and Deep Analytics (AIDA) is a critical group in Digital and Data Science (DDS) organization at Sanofi R&D focused on applications of AI/ML and Deep Learning (DL) in drug design, multi-omics diseases modeling, drug development, and analysis of outcomes of clinical trials.  

Our existing research and development areas include Omics Data Science applied to single-cell RNA sequences, multi-omics data integration, and real word data (RWD); Biologics Drug Design; Natural Language Processing (NLP); Deep Learning-based Imaging and bioimaging for digital pathology and Spatial Biology; digital signal processing (DSP) and machine learning applied to digital health and patient-generated data from wearables.    

Scientists in our team come from diverse backgrounds in computational sciences and engineering with deep expertise in AI/ML, deep learning, biostatistics and algorithms.  

We are seeking a Principal Data Scientist to join the AI and Deep Analytics (AIDA), Omics Data Science (ODS) team. ODS closely interacts with Precision Oncology, Precision Immunology and Translational Sciences at Sanofi R&D.

The successful candidate will have extensive experience in omics data analysis, biological network inference and knowledge graphs development with published studies aimed at deciphering complex biological mechanisms beyond the deterministic paradigm. The candidate should also have excellent oral and written communication skills, the ability to learn and acquire new techniques and methodologies as well as a strong tropism for teamwork.

The candidate is expected to execute analytical strategies for patient deep phenotyping using multi-omics data analysis and integration methods for new indication identification, disease endotype characterization and drug repurposing.

The candidate will directly report to the Global Head of AI and Deep Analytics at Sanofi R&D.

The responsibilities of the principal data scientist in AI and Deep Analytics will include:    

  • Analyzing multi-omics data (including spatial and single cell data) and inferring biological networks.
  • Building knowledge graphs from internal and external data sources.    
  • Close interactions with other data scientists as well as scientists in immunology, oncology, and translational sciences, in an international context (US, Europe, China).
  • Update and report relevant results to interdisciplinary project teams and stakeholders
  • Maintain a keen awareness of recent developments in data science and bioinformatics and state-of-the-art of AI/ML/DL algorithms and research results
  • Active engagement in evaluation and coordination of both academic and startup collaborations

Qualifications & Requirements:

  • A PhD degree in Bioinformatics, Biostatistics, Biophysics, Computational Biology, Computer Science, and Engineering Sciences
  • +7 years of industry experience with a strong record of accomplishments and project experience in applications of AI/ML in biological systems  
  • Strong familiarity with core concepts in spatial omics data analysis and network inference.
  • Experience with building and using knowledge graphs
  • Familiarity with data visualization and dimensionality reduction algorithms
  • Proficiency in Python or R,
  • Ability to develop, benchmark and apply predictive algorithms to generate hypotheses
  • A change agent with a combination of business, science & technology, and diplomatic skills

At Sanofi R&D North America, we deliver meaningful solutions for patients. We transform science into breakthrough, best-in-class and first-in-class medicines and vaccines. We believe in creating a diverse and inclusive workforce – and workplace – which brings together the collective brainpower of over 2,000 colleagues and provides you with an exciting place to grow and develop. We set the bar high, and we deliver. Join us and together we will build on our trusted legacy of breakthroughs for society.

Sanofi Inc. and its U.S. affiliates are Equal Opportunity and Affirmative Action employers committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race; color; creed; religion; national origin; age; ancestry; nationality; marital, domestic partnership or civil union status; sex, gender, gender identity or expression; affectional or sexual orientation; disability; veteran or military status or liability for military status; domestic violence victim status; atypical cellular or blood trait; genetic information (including the refusal to submit to genetic testing) or any other characteristic protected by law.