Research Director for AI Safety

12 hours ago


London, Greater London, United Kingdom AI Safety Institute Full time

**Job Title:** Policy Lead

**Location:** N/A

As a leading expert in the field of human-AI interaction risks, you will lead a multidisciplinary research team at the AI Safety Institute to evaluate and mitigate the behavioral and psychological risks that emerge from AI systems. The position offers a unique opportunity to push forward an emerging field and be part of an organization that is a unique and fast-growing presence in AI research and governance.

The ideal candidate has a clear understanding of the emerging risks related to AI-human interaction and a vision for how to conduct impactful research in this area. A strong track record of leading multidisciplinary teams to deliver multiple exceptional scientific breakthroughs or high-quality products is required.

The role includes building and leading a talent-dense, multidisciplinary, and mission-driven team with diverse skill sets relevant for this endeavor. You will develop and deliver a cutting-edge research agenda focused on the psychological and behavioral risks of AI systems, manage a diverse portfolio of research projects, and forge relationships with key partners in industry, academia, and across Government, including the national security community.

**Estimated Salary:** $150,000 - $200,000 per annum


  • AI Safety Researcher

    1 month ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    We're focused on addressing extreme risks from autonomous AI systems that can interact with the real world. To do this, we're advancing the state of the art in risk modeling, incorporating insights from other safety-critical and adversarial domains, and developing novel techniques. We're also empirically evaluating these risks through one of the world's...

  • AI Safety Researcher

    2 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    About the RoleWe are seeking a highly motivated and talented Research Scientist/Engineer to join our Societal Impacts team at the AI Safety Institute. The successful candidate will work with other researchers to design and run studies that answer important questions about the effect of AI on society.The ideal candidate will have a strong background in...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    We are advancing the state of the science in risk modeling, incorporating insights from safety-critical and adversarial domains, while developing novel techniques. Our research aims to empirically evaluate these risks by building one of the world's largest agentic evaluation suites and pushing forward the science of model evaluations.Job RoleYou will work as...

  • AI Safety Researcher

    3 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    About the RoleWe are seeking a highly motivated and talented Research Scientist to join our Societal Impacts team at the AI Safety Institute. The successful candidate will work with our team to design and run studies that answer important questions about the effect AI will have on society.Key ResponsibilitiesDesign and run studies to evaluate the impact of...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    At the AI Safety Institute, we are dedicated to optimizing AI systems for state-of-the-art performance across various risk domains. Our Post-Training Team works tirelessly to fine-tune and scaffold AI models, ensuring they reach their full potential.About the RoleWe are seeking a strong Research Scientist to join our team. As a member of this team, you will...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Estimated Salary: £80,000 - £110,000 per annumAbout the RoleWe are seeking an exceptional Senior AI Safety Researcher to join our team at the AI Safety Institute. This is a unique opportunity to contribute to the development of safety cases and advance the field of AI governance.Key ResponsibilitiesConduct foundational research on safety cases to help...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    We are seeking an exceptional Cybersecurity Research Engineer to join our team at the AI Safety Institute. Our goal is to develop first-of-its-kind government-run infrastructure to benchmark the progress of advanced AI capabilities in cyber security. The selected candidate will work closely with a cross-functional team of cybersecurity researchers, machine...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Role OverviewThe AI Safety Institute is seeking a highly skilled Senior AI Safety Researcher to join its Safeguard Analysis Team. The successful candidate will play a key role in researching and developing interventions that secure systems from abuse by bad actors.About the RoleThis is a challenging and rewarding opportunity for an experienced researcher to...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    **About the Role:**We are seeking a highly skilled Research Lead to join our team at the AI Safety Institute. In this role, you will be responsible for advancing the state of science in evaluating societal-level harms caused by advanced AI systems.The Crime and Social Destabilisation workstream is a new initiative that focuses on assessing and mitigating...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    As advanced AI systems continue to evolve, the potential risks associated with their cyber capabilities pose a significant threat to organizational and individual security. These risks are particularly concerning when combined with other AI risk areas, such as harmful outcomes from biological and chemical capabilities, and autonomous systems.The AI Safety...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    AI Safety Institute Job OverviewThe Department for Science, Innovation and Technology is seeking a highly skilled Senior AI Research Specialist to join its esteemed team at the forefront of artificial intelligence safety research. This role offers an exceptional opportunity for individuals with expertise in machine learning, large language models, and...

  • AI Safety Engineer

    3 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    The Post-Training Team at the AI Safety Institute is dedicated to optimizing AI systems for state-of-the-art performance in various risk domains. This involves a combination of scaffolding, prompting, supervised and RL fine-tuning of AI models.Key Responsibilities:Improve model performance using cutting-edge machine learning techniquesDevelop methodologies...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    The AI Safety Institute is seeking a highly skilled Machine Learning Research Scientist to join our team and contribute to the development of safety cases for AI systems. As a key member of our research team, you will work closely with our Research Director and other experts to advance our understanding of AI safety and governance.Key...

  • Research Scientist

    3 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    Are you passionate about improving AI systems for safer and more reliable operation? Do you have expertise in machine learning and a strong desire to apply it to real-world problems?We are seeking a talented Research Scientist to join our Post-Training Team at the AI Safety Institute. As a member of this team, you will be responsible for developing and...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Job DescriptionAs a Research Scientist/Engineer at the AI Safety Institute, you will be part of a multidisciplinary team that studies how advanced AI models can impact people and society. Your primary responsibility will be to design and run studies that investigate the effects of AI on society, including its potential to change people's political and social...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    **Job Description:** The AI Safety Institute is launching a new Psychological and Social Risks workstream, focused on understanding and mitigating the risks that arise from repeated or prolonged human-AI interaction. As a leading expert in this field, you will build and lead a multidisciplinary research team to develop behavioral and psychological research...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    The AI Safety Institute is a pioneering organization at the forefront of developing safety evaluations for next-generation frontier AI systems. Our platform is the backbone of this critical initiative, and we're seeking an experienced Cloud Software Architect to join our Platform Engineering team.This is an exceptional opportunity to drive innovation in an...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    The Post-Training Team at AI Safety Institute focuses on optimising AI systems to achieve state-of-the-art performance across various risk domains. This is accomplished through scaffolding, prompting, supervised and RL fine-tuning of AI models, which include access to tools for interacting with the underlying operating system.Job OverviewWe are seeking...


  • London, Greater London, United Kingdom Atla Ai Full time

    Atla Ai: Safeguarding the Future of HumanityAbout Us:We're Atla Ai, a pioneering London-based start-up dedicated to engineering safe and beneficial AI systems. Our mission is to drive positive change in the world by developing cutting-edge AI evaluation models.Role Overview:As our alignment research engineer, you'll play a pivotal role in shaping the future...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    AI Safety Institute OverviewThe Post-Training Team at the AI Safety Institute is dedicated to enhancing the performance of artificial intelligence systems across various risk domains. This is achieved through a combination of scaffolding, prompting, and fine-tuning of AI models. As a member of this team, you will utilize cutting-edge machine learning...