AI Safety Engineer

2 days ago


London, Greater London, United Kingdom AI Safety Institute Full time

Company Overview: The AI Safety Institute is a leading organization in the field of artificial intelligence safety. Our mission is to ensure that AI systems are developed and used in ways that benefit society.

Salary: £80,000 - £120,000 per annum, depending on experience.

Job Description: We are seeking a highly skilled Research Engineer to join our Post-Training Team. As a member of this team, you will use cutting-edge machine learning techniques to improve model performance in our domains of interest. This includes developing methodologies for in-depth analysis of agent behaviour, implementing new tools for our LLM agents, and creating pipelines for supporting and fine-tuning large open-source models.

Required Skills and Qualifications: To be successful in this role, you will need:

  • A PhD in a technical field such as computer science or mathematics
  • Experience conducting empirical machine learning research, particularly on Large Language Models (LLMs)
  • Experience with machine learning engineering or extensive experience as a software engineer with a strong demonstration of relevant skills/knowledge in machine learning

Benefits: We offer a range of benefits including pension options, generous holiday allowance, and opportunities for professional development.

Others: If you are passionate about AI safety and have the skills and qualifications we are looking for, please apply for this exciting opportunity.


  • AI Safety Engineer

    4 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    The Post-Training Team at the AI Safety Institute is dedicated to optimizing AI systems for state-of-the-art performance in various risk domains. This involves a combination of scaffolding, prompting, supervised and RL fine-tuning of AI models.Key Responsibilities:Improve model performance using cutting-edge machine learning techniquesDevelop methodologies...

  • AI Safety Researcher

    3 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    About the RoleWe are seeking a highly motivated and talented Research Scientist/Engineer to join our Societal Impacts team at the AI Safety Institute. The successful candidate will work with other researchers to design and run studies that answer important questions about the effect of AI on society.The ideal candidate will have a strong background in...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    At the AI Safety Institute, we are dedicated to optimizing AI systems for state-of-the-art performance across various risk domains. Our Post-Training Team works tirelessly to fine-tune and scaffold AI models, ensuring they reach their full potential.About the RoleWe are seeking a strong Research Scientist to join our team. As a member of this team, you will...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    About AI Safety InstituteAISI is a leading research institution in the field of artificial intelligence safety. We are dedicated to developing and applying cutting-edge technologies to ensure that AI systems align with human values.We are currently seeking a highly skilled researcher to join our Mechanistic Interpretability team. As a researcher, you will be...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    We are seeking an exceptional Cybersecurity Research Engineer to join our team at the AI Safety Institute. Our goal is to develop first-of-its-kind government-run infrastructure to benchmark the progress of advanced AI capabilities in cyber security. The selected candidate will work closely with a cross-functional team of cybersecurity researchers, machine...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    As advanced AI systems continue to evolve, the potential risks associated with their cyber capabilities pose a significant threat to organizational and individual security. These risks are particularly concerning when combined with other AI risk areas, such as harmful outcomes from biological and chemical capabilities, and autonomous systems.The AI Safety...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    The AI Safety Institute is a pioneering organization at the forefront of developing safety evaluations for next-generation frontier AI systems. Our platform is the backbone of this critical initiative, and we're seeking an experienced Cloud Software Architect to join our Platform Engineering team.This is an exceptional opportunity to drive innovation in an...

  • AI Safety Researcher

    4 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    About the RoleWe are seeking a highly motivated and talented Research Scientist to join our Societal Impacts team at the AI Safety Institute. The successful candidate will work with our team to design and run studies that answer important questions about the effect AI will have on society.Key ResponsibilitiesDesign and run studies to evaluate the impact of...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Estimated Salary: £80,000 - £110,000 per annumAbout the RoleWe are seeking an exceptional Senior AI Safety Researcher to join our team at the AI Safety Institute. This is a unique opportunity to contribute to the development of safety cases and advance the field of AI governance.Key ResponsibilitiesConduct foundational research on safety cases to help...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    We are advancing the state of the science in risk modeling, incorporating insights from safety-critical and adversarial domains, while developing novel techniques. Our research aims to empirically evaluate these risks by building one of the world's largest agentic evaluation suites and pushing forward the science of model evaluations.Job RoleYou will work as...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    **Job Title:** Policy Lead**Location:** N/AAs a leading expert in the field of human-AI interaction risks, you will lead a multidisciplinary research team at the AI Safety Institute to evaluate and mitigate the behavioral and psychological risks that emerge from AI systems. The position offers a unique opportunity to push forward an emerging field and be...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Job DescriptionWe are seeking a highly skilled Research Scientist to join our team at the AI Safety Institute. This role offers an exciting opportunity to contribute to the development of rigorous scientific techniques for the measurement of frontier AI system capabilities.As a member of our Science of Evaluations team, you will be responsible for conducting...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Role OverviewThe AI Safety Institute is seeking a highly skilled Senior AI Safety Researcher to join its Safeguard Analysis Team. The successful candidate will play a key role in researching and developing interventions that secure systems from abuse by bad actors.About the RoleThis is a challenging and rewarding opportunity for an experienced researcher to...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    **About the Role:**We are seeking a highly skilled Research Lead to join our team at the AI Safety Institute. In this role, you will be responsible for advancing the state of science in evaluating societal-level harms caused by advanced AI systems.The Crime and Social Destabilisation workstream is a new initiative that focuses on assessing and mitigating...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Job Title: Cybersecurity Research ScientistAbout the JobThe AI Safety Institute is seeking a highly skilled and motivated individual to join our research team as a Cybersecurity Research Scientist.In this role, you will be responsible for evaluating the strength and efficacy of safety and security components of advanced AI systems against diverse threats....


  • London, Greater London, United Kingdom AI Safety Institute Full time

    AI Safety Institute OverviewThe Post-Training Team at the AI Safety Institute is dedicated to enhancing the performance of artificial intelligence systems across various risk domains. This is achieved through a combination of scaffolding, prompting, and fine-tuning of AI models. As a member of this team, you will utilize cutting-edge machine learning...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    About AI Safety InstituteThe AI Safety Institute is a leading research organization dedicated to developing and applying cutting-edge technologies to ensure that AI systems align with human values. We are currently seeking a highly skilled researcher to join our Mechanistic Interpretability team.As a researcher, you will be responsible for advancing our...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    The Post-Training Team at AI Safety Institute focuses on optimising AI systems to achieve state-of-the-art performance across various risk domains. This is accomplished through scaffolding, prompting, supervised and RL fine-tuning of AI models, which include access to tools for interacting with the underlying operating system.Job OverviewWe are seeking...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    AI Safety Institute Job OverviewThe Department for Science, Innovation and Technology is seeking a highly skilled Senior AI Research Specialist to join its esteemed team at the forefront of artificial intelligence safety research. This role offers an exceptional opportunity for individuals with expertise in machine learning, large language models, and...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    The AI Safety Institute is seeking an experienced professional to lead our new Psychological and Social Risks workstream. As our Policy Lead, you will be responsible for developing and delivering a cutting-edge research agenda focused on the psychological and behavioural risks of AI systems.Key ResponsibilitiesBuild and lead a talent-dense, multidisciplinary...