Research Scientist in AI Safety

3 days ago


London, Greater London, United Kingdom AI Safety Institute Full time
Join Our Team as a Research Scientist in AI Safety

The AI Safety Institute research unit is seeking highly motivated and talented Research Scientists to work on critical areas of AI safety, including risk models, frontier models, and large-scale targeted manipulation and deception.

Key Responsibilities:
  • Conduct research on AI safety and risk models, including auto-replication, iterative self-improvement, and large-scale targeted manipulation and deception.
  • Develop and implement novel methods for mitigating extreme risks from autonomous AI systems.
  • Collaborate with world-class multi-disciplinary teams, including scientists and engineers, to advance AI safety research.
  • Contribute to the development of open-source tooling, such as Inspect, used across all work-streams.
Requirements:
  • PhD in a relevant field, such as computer science, mathematics, or physics.
  • Strong background in deep learning and large language models.
  • Experience working with world-class multi-disciplinary teams.
  • Excellent communication and collaboration skills.
What We Offer:
  • A dynamic and collaborative research environment.
  • Opportunities for professional growth and development.
  • A competitive salary and benefits package.


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team as a Research Scientist in AI SafetyWe're a team of scientists, engineers, and domain experts at the AI Safety Institute, focused on mitigating the risks associated with autonomous AI systems. Our mission is to advance the state of the science in risk modeling, incorporating insights from safety-critical and adversarial domains, while...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team as a Research Scientist in AI SafetyWe're a team of scientists, engineers, and domain experts at the AI Safety Institute, focused on mitigating the risks associated with autonomous AI systems. Our mission is to advance the state of the science in risk modeling, incorporating insights from safety-critical and adversarial domains, while...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team as a Research Scientist in AI SafetyWe're a team of scientists, engineers, and domain experts at the AI Safety Institute, focused on mitigating the risks associated with autonomous AI systems. Our goal is to advance the state of the science in risk modeling, incorporating insights from safety-critical and adversarial domains, while developing...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team as a Research Scientist in AI SafetyWe're a team of scientists, engineers, and domain experts at the AI Safety Institute, focused on mitigating the risks associated with autonomous AI systems. Our goal is to advance the state of the science in risk modeling, incorporating insights from safety-critical and adversarial domains, while developing...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team as a Research Scientist in AI SafetyWe're a team of scientists, engineers, and domain experts at the AI Safety Institute, focused on mitigating the risks associated with autonomous AI systems. Our mission is to advance the state of the science in risk modeling, incorporating insights from safety-critical and adversarial domains, while...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team as a Research Scientist in AI SafetyWe're a team of scientists, engineers, and domain experts at the AI Safety Institute, focused on mitigating the risks associated with autonomous AI systems. Our mission is to advance the state of the science in risk modeling, incorporating insights from safety-critical and adversarial domains, while...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Research TeamThe AI Safety Institute is seeking highly motivated and talented Research Scientists to work on critical projects related to AI safety. Our team is dedicated to advancing the field of AI safety research and developing innovative solutions to mitigate risks associated with autonomous AI systems.Key ResponsibilitiesConduct research on AI...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Research TeamThe AI Safety Institute is seeking highly motivated and talented Research Scientists to work on critical projects related to AI safety. Our team is dedicated to advancing the field of AI safety research and developing innovative solutions to mitigate risks associated with autonomous AI systems.Key ResponsibilitiesConduct research on AI...

  • AI Safety Researcher

    3 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    Join the AI Safety Institute TeamAISI is launching a new Mechanistic Interpretability team to research the fundamental question of how can we tell if a model is scheming? This is an ambitious bet to bring interpretability as a field into prime time. We believe that this is a vital challenge that mechanistic interpretability can help solve, ensuring that...

  • AI Safety Researcher

    3 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    Join the AI Safety Institute TeamAISI is launching a new Mechanistic Interpretability team to research the fundamental question of how can we tell if a model is scheming? This is an ambitious bet to bring interpretability as a field into prime time. We believe that this is a vital challenge that mechanistic interpretability can help solve, ensuring that...

  • Research Scientist

    3 weeks ago


    London, Greater London, United Kingdom RI Research Instruments GmbH Full time

    About the RoleWe are seeking a highly skilled Research Scientist to join our team at RI Research Instruments GmbH. As a Research Scientist, you will play a key role in advancing our research efforts in AI safety and developing beneficial AI systems.Key ResponsibilitiesConduct research in AI safety and develop new methods for automatically red teaming...

  • Research Scientist

    3 weeks ago


    London, Greater London, United Kingdom RI Research Instruments GmbH Full time

    About the RoleWe are seeking a highly skilled Research Scientist to join our team at RI Research Instruments GmbH. As a Research Scientist, you will play a key role in advancing our research efforts in AI safety and developing beneficial AI systems.Key ResponsibilitiesConduct research in AI safety and develop new methods for automatically red teaming...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Job Title: Machine Learning Research ScientistJoin the AI Safety Institute as a Machine Learning Research Scientist and contribute to the development of safety cases for AI systems. As a key member of our research team, you will conduct foundational research to advance the understanding of AI safety and governance.About the RoleWe are seeking a highly...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Job Title: Machine Learning Research ScientistJoin the AI Safety Institute as a Machine Learning Research Scientist and contribute to the development of safety cases for AI systems. As a key member of our research team, you will conduct foundational research to advance the understanding of AI safety and governance.About the RoleWe are seeking a highly...

  • Research Scientist

    2 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    AI Safety Institute: Launching a New Mechanistic Interpretability TeamAISI is embarking on a groundbreaking project to develop mechanistic interpretability, a crucial challenge in ensuring the safety of AI systems. We seek a team lead, research scientists, and research engineers to join our mission.Key Responsibilities:Conduct hands-on mechanistic...

  • Research Scientist

    2 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    AI Safety Institute: Launching a New Mechanistic Interpretability TeamAISI is embarking on a groundbreaking project to develop mechanistic interpretability, a crucial challenge in ensuring the safety of AI systems. We seek a team lead, research scientists, and research engineers to join our mission.Key Responsibilities:Conduct hands-on mechanistic...

  • Research Scientist

    1 week ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    AI Safety Institute: Mechanistic Interpretability Team LeadAISI is launching a pioneering Mechanistic Interpretability team to tackle the fundamental question of how can we tell if a model is scheming? This is an ambitious bet to bring interpretability as a field into prime time. We believe that this is a vital challenge that mechanistic interpretability can...

  • Research Scientist

    1 week ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    AI Safety Institute: Mechanistic Interpretability Team LeadAISI is launching a pioneering Mechanistic Interpretability team to tackle the fundamental question of how can we tell if a model is scheming? This is an ambitious bet to bring interpretability as a field into prime time. We believe that this is a vital challenge that mechanistic interpretability can...

  • AI Researcher

    5 days ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team as a Risk Modeling ResearcherWe're a leading organization focused on mitigating the risks associated with autonomous AI systems. Our team is dedicated to advancing the state of the science in risk modeling, incorporating insights from other safety-critical and adversarial domains, while developing novel techniques. We're also empirically...

  • AI Researcher

    6 days ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team as a Risk Modeling ResearcherWe're a leading organization focused on mitigating the risks associated with autonomous AI systems. Our team is dedicated to advancing the state of the science in risk modeling, incorporating insights from other safety-critical and adversarial domains, while developing novel techniques. We're also empirically...