Empirical AI Safety Research Position

21 hours ago


London, Greater London, United Kingdom Anthropic Full time
Job Title: Empirical AI Safety Research Fellow

Job Summary: We are seeking a highly skilled individual to join our team as an Empirical AI Safety Research Fellow. The successful candidate will work on an empirical project aligned with our research priorities, using external infrastructure such as open-source models and public APIs. You will produce a public output, like a paper submission, and have access to substantial support, including mentorship, funding, and compute resources.

Key Responsibilities: As an Empirical AI Safety Research Fellow, you will:
  • Work on an empirical project using external infrastructure, such as open-source models and public APIs.
  • Produce a public output, like a paper submission.
  • Have access to substantial support, including mentorship, funding, and compute resources.

Why Join Us: As an Empirical AI Safety Research Fellow, you will be part of a dynamic team working towards creating reliable, interpretable, and steerable AI systems. You will enjoy a weekly stipend of £1,300 and an expectation of 40 hours per week. You will receive direct mentorship from Anthropic researchers, connection to the broader AI safety research community, and access to benefits. Furthermore, you will have funding for compute and other research expenses. You will also have the opportunity to work in a collaborative environment, sharing your ideas and learning from others.
  • AI Safety Researcher

    1 month ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    We're focused on addressing extreme risks from autonomous AI systems that can interact with the real world. To do this, we're advancing the state of the art in risk modeling, incorporating insights from other safety-critical and adversarial domains, and developing novel techniques. We're also empirically evaluating these risks through one of the world's...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    AI Safety Institute Job OverviewThe Department for Science, Innovation and Technology is seeking a highly skilled Senior AI Research Specialist to join its esteemed team at the forefront of artificial intelligence safety research. This role offers an exceptional opportunity for individuals with expertise in machine learning, large language models, and...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    We are advancing the state of the science in risk modeling, incorporating insights from safety-critical and adversarial domains, while developing novel techniques. Our research aims to empirically evaluate these risks by building one of the world's largest agentic evaluation suites and pushing forward the science of model evaluations.Job RoleYou will work as...

  • AI Safety Engineer

    4 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    The Post-Training Team at the AI Safety Institute is dedicated to optimizing AI systems for state-of-the-art performance in various risk domains. This involves a combination of scaffolding, prompting, supervised and RL fine-tuning of AI models.Key Responsibilities:Improve model performance using cutting-edge machine learning techniquesDevelop methodologies...

  • AI Safety Researcher

    3 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    About the RoleWe are seeking a highly motivated and talented Research Scientist/Engineer to join our Societal Impacts team at the AI Safety Institute. The successful candidate will work with other researchers to design and run studies that answer important questions about the effect of AI on society.The ideal candidate will have a strong background in...

  • AI Safety Engineer

    19 hours ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    Company Overview: The AI Safety Institute is a leading organization in the field of artificial intelligence safety. Our mission is to ensure that AI systems are developed and used in ways that benefit society.Salary: £80,000 - £120,000 per annum, depending on experience.Job Description: We are seeking a highly skilled Research Engineer to join our...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    About AI Safety InstituteAISI is a leading research institution in the field of artificial intelligence safety. We are dedicated to developing and applying cutting-edge technologies to ensure that AI systems align with human values.We are currently seeking a highly skilled researcher to join our Mechanistic Interpretability team. As a researcher, you will be...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    **Job Title:** Policy Lead**Location:** N/AAs a leading expert in the field of human-AI interaction risks, you will lead a multidisciplinary research team at the AI Safety Institute to evaluate and mitigate the behavioral and psychological risks that emerge from AI systems. The position offers a unique opportunity to push forward an emerging field and be...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    The Post-Training Team at AI Safety Institute focuses on optimising AI systems to achieve state-of-the-art performance across various risk domains. This is accomplished through scaffolding, prompting, supervised and RL fine-tuning of AI models, which include access to tools for interacting with the underlying operating system.Job OverviewWe are seeking...

  • AI Safety Researcher

    3 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    About the RoleWe are seeking a highly motivated and talented Research Scientist to join our Societal Impacts team at the AI Safety Institute. The successful candidate will work with our team to design and run studies that answer important questions about the effect AI will have on society.Key ResponsibilitiesDesign and run studies to evaluate the impact of...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    At the AI Safety Institute, we are dedicated to optimizing AI systems for state-of-the-art performance across various risk domains. Our Post-Training Team works tirelessly to fine-tune and scaffold AI models, ensuring they reach their full potential.About the RoleWe are seeking a strong Research Scientist to join our team. As a member of this team, you will...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Estimated Salary: £80,000 - £110,000 per annumAbout the RoleWe are seeking an exceptional Senior AI Safety Researcher to join our team at the AI Safety Institute. This is a unique opportunity to contribute to the development of safety cases and advance the field of AI governance.Key ResponsibilitiesConduct foundational research on safety cases to help...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Job DescriptionWe are seeking a highly skilled Research Scientist to join our team at the AI Safety Institute. This role offers an exciting opportunity to contribute to the development of rigorous scientific techniques for the measurement of frontier AI system capabilities.As a member of our Science of Evaluations team, you will be responsible for conducting...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    We are seeking an exceptional Cybersecurity Research Engineer to join our team at the AI Safety Institute. Our goal is to develop first-of-its-kind government-run infrastructure to benchmark the progress of advanced AI capabilities in cyber security. The selected candidate will work closely with a cross-functional team of cybersecurity researchers, machine...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Role OverviewThe AI Safety Institute is seeking a highly skilled Senior AI Safety Researcher to join its Safeguard Analysis Team. The successful candidate will play a key role in researching and developing interventions that secure systems from abuse by bad actors.About the RoleThis is a challenging and rewarding opportunity for an experienced researcher to...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    **About the Role:**We are seeking a highly skilled Research Lead to join our team at the AI Safety Institute. In this role, you will be responsible for advancing the state of science in evaluating societal-level harms caused by advanced AI systems.The Crime and Social Destabilisation workstream is a new initiative that focuses on assessing and mitigating...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    As advanced AI systems continue to evolve, the potential risks associated with their cyber capabilities pose a significant threat to organizational and individual security. These risks are particularly concerning when combined with other AI risk areas, such as harmful outcomes from biological and chemical capabilities, and autonomous systems.The AI Safety...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    AI Safety Institute OverviewThe Post-Training Team at the AI Safety Institute is dedicated to enhancing the performance of artificial intelligence systems across various risk domains. This is achieved through a combination of scaffolding, prompting, and fine-tuning of AI models. As a member of this team, you will utilize cutting-edge machine learning...


  • London, Greater London, United Kingdom Atla Ai Full time

    Atla Ai: Safeguarding the Future of HumanityAbout Us:We're Atla Ai, a pioneering London-based start-up dedicated to engineering safe and beneficial AI systems. Our mission is to drive positive change in the world by developing cutting-edge AI evaluation models.Role Overview:As our alignment research engineer, you'll play a pivotal role in shaping the future...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    About the RoleWe are seeking a Senior Researcher to join our Frontier Evaluations team at the AI Safety Institute. As a member of this team, you will play a key role in developing and applying rigorous scientific techniques for the measurement of frontier AI system capabilities.Your responsibilities will include:Developing methods for inferring model...