Research software Engineer

7 days ago


Hinxton, United Kingdom Society of Research Software Engineering Full time £48,613

Open Targets (OT) is a unique public-private partnership working to deliver experimental data and informatics platforms that enable researchers to make more informed decisions about target selection for drug discovery. OT is a shared initiative between the European Bioinformatics Institute (EMBL-EBI), a global leader in the management, integration and analysis of public domain life science data; world-leading pharmaceutical companies GSK, Sanofi, Bristol Myers Squibb, Pfizer and Genentech; and the Wellcome Sanger Institute.

Generative AI has revolutionised the way we interact with knowledge. To benefit from the advances in LLM technology inside Open Targets, we are extending the capabilities of our platform towards LLM integration using open-source frameworks. The project aims to improve on extraction, representation, and usage of scientific knowledge, and present this knowledge to platform users in a user-friendly way. The central aims of the role we describe below will be 1) connecting the collected knowledge to an interface that can be queried using natural language, and 2) ensuring that the behaviours of the LLMs are consistent and adequate.

Your role
We are seeking a highly skilled and motivated Research Software Engineer with expertise in Python and Large Language Model (LLM) integration to join the AI knowledge management project for a period of 3 years. This role is central to our mission of advancing open science and robust drug development. You will be a key player in designing and operating software that integrates cutting-edge knowledge graphs, LLMs and ML techniques into our biomedical data processing and drug discovery platforms.

We are open to applicants at various career stages, but we are particularly interested in individuals who are eager to utilise cutting-edge technologies to address complex challenges in biological knowledge extraction. This position would be embedded within the Open Targets project team in the Saez-Rodriguez Group at the European Bioinformatics Institute and benefit from joint supervision with Sebastian Lobentanzer in the Saez-Rodriguez Group at Heidelberg University Hospital (UKHD).

You will work collaboratively across the project group with other experts in ML/AI, NLP, data integration and product delivery across ChEMBL, ePMC, Open Targets and Heidelberg University Hospital on a common goal to integrate cutting-edge technology for knowledge extraction, representation and integration to help drug discovery scientists to generate therapeutic hypotheses.

As a crucial member of the project team, you will design, build, and operate cloud-first open-source software that interfaces with large-scale biomedical data and drug discovery. You will contribute to the development of current and future informatics tools designed to support the identification and prioritisation of drug targets. Leveraging cutting-edge technologies and the expertise of our product owners and industry stakeholders, you will work in a dynamic, multidisciplinary, international environment to tackle a wide range of algorithmic and technical challenges.

As part of a dynamic, collaborative, and international team, you will be responsible for:

  • Developing and maintaining backend (Python) software, integrating databases and LLMs (
  • Investigating and implementing open source LLMs/conversational AI agents for use in web applications (chatbot) and therapeutic hypothesis generation.
  • Accessing bespoke knowledge graphs across OT data for specific use cases and assisting in the integration into the Open Targets Platform ecosystem.
  • Collaborating with the OT industry and academic partners to collect requirements and assess, prioritise, validate and refine the developed methods.
  • Taking ownership of the design and development of new features and pipelines, and independent problem-solving to resolve complex issues.
  • Actively disseminating the outcomes of the project to the scientific community and stakeholders through well-crafted presentations and publications.

You have

  • Advanced degree (MSc, PhD) in computer science, bioinformatics, software development, or a related field.
  • Strong proficiency in Python and experience with Large Language Model integration.
  • Proofed experience in applying modern ML/LLM frameworks and concepts
  • Good understanding of ML principles including embeddings, cross-validation and fine-tuning.
  • Proficiency in common data preprocessing task and normalisation
  • Exposure to source code version control software such as Git and GitHub.
  • Experience in independent problem-solving and examples of resolving complex issues.
  • Fluency in written and spoken English.
  • Ability to effectively communicate ideas or issues and work with team members from multidisciplinary backgrounds.
  • Interest in promoting your work and the ways we have solved complex challenges.

You might also have

  • Experience in MLOps including experiment tracking and model deployment
  • Experience with current LLM frameworks, such as LangChain and open-source LLM deployment (e.g., llama-cpp, ggml, Xorbits Inference).
  • Knowledge of human genetics, genomics, and/or drug discovery – or are interested in learning about these topics.
  • Experience building high-quality software and making frequent deployments as part of a regular software release process.
  • Experience working with knowledge graphs and graph databases.
  • Previous experience working in the research or life science industries.

Apply now Benefits and Contract Information

  • Financial incentives: depending on circumstances, monthly family/marriage allowance of £260, monthly child allowance of £314 per child. Non resident allowance up to £532 per month. Annual salary review, pension scheme, death benefit, long-term care, accident-at-work and unemployment insurances
  • Hybrid working arrangements
  • Private medical insurance for you and your immediate family (including all prescriptions and generous dental & optical cover)
  • Generous time off: 30 days annual leave per year, in addition to eight bank holidays
  • Relocation package including installation grant (as applicable)
  • Campus life: Free shuttle bus to and from work, on-site library, subsidised on-site gym and cafeteria, casual dress code, extensive sports and social club activities (on campus and remotely)
  • Family benefits: On-site nursery, child sick leave, generous parental leave, holiday clubs on campus and monthly family and child allowances
  • Contract duration: This position is a 3 year project contract non-renewable
  • Salary: UK Equivalent £48, Total package will be dependant on family circumstances)
  • International applicants: We recruit internationally and successful candidates are offered visa exemptions. Read more on our page for international applicants.
  • Diversity and inclusion: At EMBL-EBI, we strongly believe that inclusive and diverse teams benefit from higher levels of innovation and creative thought. We encourage applications from women, LGBTQ+ and individuals from all nationalities.
  • Job location: This role is based in Hinxton, near Cambridge, UK. You will be required to relocate if you are based overseas and you will receive a generous relocation package to support you.

To apply, please submit a covering letter and CV via our online system. Applications will close on 20/05/2024.

#J-18808-Ljbffr

  • Hinxton, United Kingdom Society of Research Software Engineering Full time £48,613

    Open Targets (OT) is a unique public-private partnership working to deliver experimental data and informatics platforms that enable researchers to make more informed decisions about target selection for drug discovery. OT is a shared initiative between the European Bioinformatics Institute (EMBL-EBI), a global leader in the management, integration and...


  • Hinxton, United Kingdom Embl-Ebi Full time

    Your role You will join a team of software engineers, responsible for the design, development and operations of high-throughput pipelines necessary to ensure that ENA can support scientists all over the world by being able to ingest, process, persist and distribute high volumes of sequence data. You will work together with ENA’s development and...


  • Hinxton, United Kingdom EMBL Full time

    Your role You will join a team of software engineers, responsible for the design, development and operations of high-throughput pipelines necessary to ensure that ENA can support scientists all over the world by being able to ingest, process, persist and distribute high volumes of sequence data. You will work together with ENA’s development and...


  • Hinxton, United Kingdom European Molecular Biology Laboratory (EMBL) Full time

    About the team/jobOpen Targets (OT) is a unique public-private partnership working to deliver experimental data and informatics platforms that enable researchers to make more informed decisions about target selection for drug discovery. OT is a shared initiative between the European Bioinformatics Institute (EMBL-EBI), a global leader in the management,...


  • Hinxton, United Kingdom European Molecular Biology Laboratory (EMBL) Full time

    About the team/jobOpen Targets (OT) is a unique public-private partnership working to deliver experimental data and informatics platforms that enable researchers to make more informed decisions about target selection for drug discovery. OT is a shared initiative between the European Bioinformatics Institute (EMBL-EBI), a global leader in the management,...


  • Hinxton, United Kingdom European Molecular Biology Laboratory (EMBL) Full time

    About the team/jobOpen Targets (OT) is a unique public-private partnership working to deliver experimental data and informatics platforms that enable researchers to make more informed decisions about target selection for drug discovery. OT is a shared initiative between the European Bioinformatics Institute (EMBL-EBI), a global leader in the management,...


  • Hinxton, United Kingdom European Molecular Biology Laboratory (EMBL) Full time

    About the team/jobOpen Targets (OT) is a unique public-private partnership working to deliver experimental data and informatics platforms that enable researchers to make more informed decisions about target selection for drug discovery. OT is a shared initiative between the European Bioinformatics Institute (EMBL-EBI), a global leader in the management,...


  • Hinxton, Cambridgeshire, United Kingdom EMBL-EBI Full time

    We are looking for a software engineer who will join the Technical Transformation team at EBI that has been tasked with modernising the European Nucleotide Archive (ENA), a petabyte-scale public database of genomic sequences, one of the largest scientific data resources in the world. Your role You will join a team of software engineers, responsible for...


  • Hinxton, United Kingdom Springer Nature Full time

    **Research Programme Officer**: - Employer- European Molecular Biology Laboratory (EMBL)- Location- Hinxton- Salary- Competitive- Closing date- 12 May 2024- Discipline Life Science Job Type Engineer Employment - Hours Full time Duration Fixed term Qualification Masters Sector Academia- You need to sign in or create an account to a job. **About...


  • Hinxton, United Kingdom EMBL-EBI Full time

    The IT & Technical Services department's Operations team is seeking a Senior Site Reliability Engineer to support the growing portfolio of services it provides to EMBl-EBIs service and research teams. The Operations team is responsible for maintaining and developing the InstitutesTransfer Services, the application and monitoring systems for our software...


  • Hinxton, United Kingdom EMBL-EBI Full time

    The IT & Technical Services department's Operations team is seeking a Senior Site Reliability Engineer to support the growing portfolio of services it provides to EMBl-EBIs service and research teams. The Operations team is responsible for maintaining and developing the InstitutesTransfer Services, the application and monitoring systems for our software...


  • Hinxton, United Kingdom EMBL-EBI Full time

    The IT & Technical Services department's Operations team is seeking a Senior Site Reliability Engineer to support the growing portfolio of services it provides to EMBl-EBIs service and research teams. The Operations team is responsible for maintaining and developing the InstitutesTransfer Services, the application and monitoring systems for our software...


  • Hinxton, United Kingdom EMBL-EBI Full time

    The IT & Technical Services department's Operations team is seeking a Senior Site Reliability Engineer to support the growing portfolio of services it provides to EMBl-EBIs service and research teams. The Operations team is responsible for maintaining and developing the InstitutesTransfer Services, the application and monitoring systems for our software...


  • Hinxton, United Kingdom EMBL-EBI Full time

    The IT & Technical Services department's Operations team is seeking a Senior Site Reliability Engineer to support the growing portfolio of services it provides to EMBl-EBIs service and research teams. The Operations team is responsible for maintaining and developing the InstitutesTransfer Services, the application and monitoring systems for our software...

  • Data engineer

    6 days ago


    Hinxton, United Kingdom European Molecular Biology Laboratory (EMBL) Full time

    About the team/jobWe are looking for a Data/software engineer who will join the Technical Transformation team at EBI that has been tasked with modernising the European Nucleotide Archive (ENA), a petabyte-scale public database of genomic sequences, one of the largest scientific data resources in the world.Your roleYou will join a team of software engineers,...

  • Data engineer

    6 days ago


    Hinxton, United Kingdom European Molecular Biology Laboratory (EMBL) Full time

    About the team/job We are looking for a Data/software engineer who will join the Technical Transformation team at EBI that has been tasked with modernising the European Nucleotide Archive (ENA), a petabyte-scale public database of genomic sequences, one of the largest scientific data resources in the world. Your role You will join a team of software...


  • Hinxton, United Kingdom European Molecular Biology Laboratory (EMBL) Full time

    About the team/job We are looking for a full-stack developer to join the Computational Chemical Biology team at the European Bioinformatics Institute (EMBL-EBI) located on the Wellcome Genome Campus near Cambridge in the UK. The role is primarily to develop a series of web-based applications and interfaces for the Chemical Biology's team resources...


  • Hinxton, United Kingdom European Molecular Biology Laboratory (EMBL) Full time

    About the team/job We are looking for a full-stack developer to join the Computational Chemical Biology team at the European Bioinformatics Institute (EMBL-EBI) located on the Wellcome Genome Campus near Cambridge in the UK.The role is primarily to develop a series of web-based applications and interfaces for the Chemical Biology’s team resources (ChEMBL,...


  • Hinxton, United Kingdom European Molecular Biology Laboratory (EMBL) Full time

    About the team/jobWe are looking for a full-stack developer to join the Computational Chemical Biology team at the European Bioinformatics Institute (EMBL-EBI) located on the Wellcome Genome Campus near Cambridge in the UK.The role is primarily to develop a series of web-based applications and interfaces for the Chemical Biology's team resources (ChEMBL,...


  • Hinxton, United Kingdom European Molecular Biology Laboratory (EMBL) Full time

    About the team/jobWe are looking for a full-stack developer to join the Computational Chemical Biology team at the European Bioinformatics Institute (EMBL-EBI) located on the Wellcome Genome Campus near Cambridge in the UK.The role is primarily to develop a series of web-based applications and interfaces for the Chemical Biology's team resources (ChEMBL,...