AI Safety Researcher

2 days ago


London, Greater London, United Kingdom AI Safety Institute Full time
Join the AI Safety Institute Team

AISI is launching a new Mechanistic Interpretability team to research the fundamental question of how can we tell if a model is scheming? This is an ambitious bet to bring interpretability as a field into prime time. We believe that this is a vital challenge that mechanistic interpretability can help solve, ensuring that dangerous capability evaluations can be reliably determine if models are safe to release even when the models themselves are capable of gaming the evals. We also think it can lead to an entirely new field of alignment evaluations and make substantial contributions to the problem of technical AI safety.

Key Responsibilities
  • Training sparse auto encoders (or fine-tuning open source SAEs)
  • Circuit discovery/analysis
  • Hands-on mechanistic interpretability research experience
  • Experience working within a research team that has delivered multiple exceptional scientific breakthroughs in deep learning (or a related field)
  • Comprehensive understanding of large language models (e.g. GPT-4), including both a broad understanding of the literature and hands-on experience with pre-training or fine tuning LLMs
  • Strong track-record of academic excellence (e.g. multiple spotlight papers at top-tier conferences)
Requirements
  • Improving scientific standards and rigour through mentorship & feedback
  • Experience working with world-class multi-disciplinary teams, including both scientists and engineers (e.g. in a top-3 lab)
What We Offer
  • A range of pension options available
  • Research problem selection


  • London, Greater London, United Kingdom Atla Ai Full time

    About AtlaWe're a London-based start-up dedicated to developing safe and beneficial AI systems that will have a profound impact on humanity's future. Our mission is to create the most capable AI evaluation models, and we're seeking talented individuals to join our growing team.RoleAs an alignment research engineer at Atla, you'll play a crucial role in...


  • London, Greater London, United Kingdom Atla Ai Full time

    About AtlaWe're a London-based start-up dedicated to developing safe and beneficial AI systems that will have a profound impact on humanity's future. Our mission is to create the most capable AI evaluation models, and we're seeking talented individuals to join our growing team.RoleAs an alignment research engineer at Atla, you'll play a crucial role in...

  • Research Scientist

    6 days ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team as a Research EngineerWe are seeking a highly motivated and talented Research Engineer to join our team at the AI Safety Institute. As a Research Engineer, you will play a key role in developing and implementing cutting-edge AI systems that prioritize safety and security.About the RoleAs a Research Engineer, you will be responsible for...

  • Research Scientist

    6 days ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team as a Research EngineerWe are seeking a highly motivated and talented Research Engineer to join our team at the AI Safety Institute. As a Research Engineer, you will play a key role in developing and implementing cutting-edge AI systems that prioritize safety and security.About the RoleAs a Research Engineer, you will be responsible for...

  • AI Research Engineer

    3 weeks ago


    London, Greater London, United Kingdom Atla Ai Full time

    About Atla AIWe are a London-based start-up dedicated to developing safe and beneficial AI systems that will have a profound impact on the future of humanity.Our mission is to engineer AI systems that align with human values and preferences, and we are seeking a highly skilled Research Engineer to join our team.Job SummaryWe are looking for a talented...

  • AI Research Engineer

    3 weeks ago


    London, Greater London, United Kingdom Atla Ai Full time

    About Atla AIWe are a London-based start-up dedicated to developing safe and beneficial AI systems that will have a profound impact on the future of humanity.Our mission is to engineer AI systems that align with human values and preferences, and we are seeking a highly skilled Research Engineer to join our team.Job SummaryWe are looking for a talented...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team as a Cybersecurity Research EngineerAt the AI Safety Institute, we are committed to advancing the safety of artificial intelligence systems. As a Cybersecurity Research Engineer, you will play a critical role in our mission to develop and deploy AI systems that are secure, transparent, and accountable.About the RoleWe are seeking a highly...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team as a Cybersecurity Research EngineerAt the AI Safety Institute, we are committed to advancing the safety of artificial intelligence systems. As a Cybersecurity Research Engineer, you will play a critical role in our mission to develop and deploy AI systems that are secure, transparent, and accountable.About the RoleWe are seeking a highly...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Job Title: Machine Learning Research ScientistJoin the AI Safety Institute as a Machine Learning Research Scientist and contribute to the development of safety cases for AI systems. As a key member of our research team, you will conduct foundational research to advance the understanding of AI safety and governance.About the RoleWe are seeking a highly...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Job Title: Machine Learning Research ScientistJoin the AI Safety Institute as a Machine Learning Research Scientist and contribute to the development of safety cases for AI systems. As a key member of our research team, you will conduct foundational research to advance the understanding of AI safety and governance.About the RoleWe are seeking a highly...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    About the RoleWe are seeking a highly skilled Machine Learning Research Scientist to join our team at the AI Safety Institute. As a key member of our research unit, you will play a critical role in advancing our understanding of AI safety and governance.Key ResponsibilitiesConduct foundational research to develop safety cases for AI systemsCollaborate with...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    About the RoleWe are seeking a highly skilled Machine Learning Research Scientist to join our team at the AI Safety Institute. As a key member of our research unit, you will play a critical role in advancing our understanding of AI safety and governance.Key ResponsibilitiesConduct foundational research to develop safety cases for AI systemsCollaborate with...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team of Cybersecurity ExpertsWe are seeking highly motivated and talented Research Engineers to join our team at the AI Safety Institute. As a Research Engineer, you will work closely with our Cybersecurity and policy specialists to develop and implement cutting-edge AI systems that prioritize safety and security.About the RoleThis is an exciting...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team of Cybersecurity ExpertsWe are seeking highly motivated and talented Research Engineers to join our team at the AI Safety Institute. As a Research Engineer, you will work closely with our Cybersecurity and policy specialists to develop and implement cutting-edge AI systems that prioritize safety and security.About the RoleThis is an exciting...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team of Cybersecurity ExpertsWe are seeking highly motivated and talented Research Engineers to join our team at the AI Safety Institute. As a Research Engineer, you will work closely with our Cybersecurity and policy specialists to develop and implement cutting-edge AI systems that prioritize safety and security.About the RoleThis is an exciting...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Join Our Team of Cybersecurity ExpertsWe are seeking highly motivated and talented Research Engineers to join our team at the AI Safety Institute. As a Research Engineer, you will work closely with our Cybersecurity and policy specialists to develop and implement cutting-edge AI systems that prioritize safety and security.About the RoleThis is an exciting...


  • London, Greater London, United Kingdom Freelancingforgood Full time

    Research Manager AI SafetyFreelancingforgood is seeking a highly skilled Research Manager AI Safety to join our team. As a Research Manager AI Safety, you will play a crucial role in supporting and guiding AI safety researchers, facilitating projects, and contributing to the overall success of our programme.Key Responsibilities:Support and guide AI safety...


  • London, Greater London, United Kingdom Freelancingforgood Full time

    Research Manager AI SafetyFreelancingforgood is seeking a highly skilled Research Manager AI Safety to join our team. As a Research Manager AI Safety, you will play a crucial role in supporting and guiding AI safety researchers, facilitating projects, and contributing to the overall success of our programme.Key Responsibilities:Support and guide AI safety...


  • London, Greater London, United Kingdom Freelancingforgood Full time

    Research Manager - AI SafetyFreelancingforgood is seeking a highly skilled Research Manager to join our team in London. As a Research Manager, you will play a crucial role in supporting and guiding AI safety researchers, facilitating projects, and contributing to the overall success of our programme.Key Responsibilities:Support and guide AI safety...


  • London, Greater London, United Kingdom Freelancingforgood Full time

    Research Manager - AI SafetyFreelancingforgood is seeking a highly skilled Research Manager to join our team in London. As a Research Manager, you will play a crucial role in supporting and guiding AI safety researchers, facilitating projects, and contributing to the overall success of our programme.Key Responsibilities:Support and guide AI safety...