Alignment Researcher, AI Safety Specialist

2 weeks ago


London, Greater London, United Kingdom Atla Ai Full time

Atla is dedicated to developing safe, beneficial AI systems with a profound impact on humanity's future. As a leading London-based startup, we're building the most advanced AI evaluation models. We're seeking an exceptional individual to join our team and collaborate with industry leaders to shape the future of AI.

As Atla's alignment researcher, you'll be responsible for developing language models that serve as evaluators and creating safety protocols for LLMs. Your key objectives will include:

  • Developing methods to steer LLMs towards becoming strong evaluators aligned with human preferences using prompting, supervised fine-tuning, and adversarial training.
  • Designing and implementing comprehensive evaluation frameworks, including internal tooling, datasets, and metrics, to assess alignment and safety risks.
  • Collaborating with our research team to define and steer Atla's AI safety research direction, contributing significant findings to top-tier conferences.
  • Working closely with founders, advisors, and top-tier researchers to navigate the complexities of AI alignment.

Qualifications:

  • Evidence of exceptional ML engineering ability, including proficiency in training and evaluating language models across GPUs, preferably in PyTorch.
  • Proven track record in empirical research, including designing and executing experiments, and effectively writing up and communicating findings.
  • Strong software engineering experience with software design/architecture skills, preferably in Python.
  • Aptitude for distilling and applying ideas from complex research papers in our products.


  • London, Greater London, United Kingdom Atla Ai Full time

    About AtlaWe are a London-based start-up building the most capable AI evaluation models. Our mission is to engineer safe, beneficial AI systems that will have a massive positive impact on the future of humanity.RoleAs Atla's alignment research engineer, you'll develop language models as evaluators and use your insights to construct safety guardrails for...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    The AI Safety Institute is seeking a highly motivated and talented Research Scientist to join our team in the area of AI safety research. The successful candidate will work on studying, evaluating, and recommending mitigations for extreme risks from autonomous AI systems.Key ResponsibilitiesConduct research on AI safety and risk mitigationDevelop and...


  • London, Greater London, United Kingdom Atla Ai Full time

    Atla Ai: Safeguarding the Future of HumanityAbout Us:We're Atla Ai, a pioneering London-based start-up dedicated to engineering safe and beneficial AI systems. Our mission is to drive positive change in the world by developing cutting-edge AI evaluation models.Role Overview:As our alignment research engineer, you'll play a pivotal role in shaping the future...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    AI Safety Institute Job OverviewThe Department for Science, Innovation and Technology is seeking a highly skilled Senior AI Research Specialist to join its esteemed team at the forefront of artificial intelligence safety research. This role offers an exceptional opportunity for individuals with expertise in machine learning, large language models, and...

  • AI Safety Researcher

    2 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    We're focused on addressing extreme risks from autonomous AI systems that can interact with the real world. To do this, we're advancing the state of the art in risk modeling, incorporating insights from other safety-critical and adversarial domains, and developing novel techniques. We're also empirically evaluating these risks through one of the world's...

  • AI Safety Specialist

    1 month ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    Key ResponsibilitiesThe successful candidate will play a crucial role in shaping the AI Safety Institute's approach to AI safety, working closely with the Research Unit to develop and implement safety frameworks and guidelines.Key TasksDevelop and maintain safety frameworks and guidelines for AI development and deploymentCollaborate with the Research Unit to...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    We're pushing the boundaries of AI safety research at the AI Safety Institute. As a research scientist, you'll be part of a dynamic team exploring the risks of autonomous AI systems. Your expertise will help us advance the state of the science in risk modeling, incorporating insights from safety-critical and adversarial domains. You'll work closely with...

  • AI Safety Researcher

    15 hours ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    About the RoleWe are seeking a highly motivated and talented Research Scientist to join our Societal Impacts team at the AI Safety Institute. The successful candidate will work with our team to design and run studies that answer important questions about the effect AI will have on society.Key ResponsibilitiesDesign and run studies to evaluate the impact of...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    At the AI Safety Institute, we are dedicated to optimizing AI systems for state-of-the-art performance across various risk domains. Our Post-Training Team works tirelessly to fine-tune and scaffold AI models, ensuring they reach their full potential.About the RoleWe are seeking a strong Research Scientist to join our team. As a member of this team, you will...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    As advanced AI systems continue to evolve, the potential risks associated with their cyber capabilities pose a significant threat to organizational and individual security. These risks are particularly concerning when combined with other AI risk areas, such as harmful outcomes from biological and chemical capabilities, and autonomous systems.The AI Safety...

  • Research Scientist

    3 weeks ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    Job SummaryWe are seeking a highly skilled Research Scientist to join our Science of Evaluations team at the AI Safety Institute. As a Research Scientist, you will play a key role in conducting applied and foundational research focused on the measurement of frontier AI system capabilities.Key ResponsibilitiesDevelop and apply rigorous scientific techniques...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Estimated Salary: £80,000 - £110,000 per annumAbout the RoleWe are seeking an exceptional Senior AI Safety Researcher to join our team at the AI Safety Institute. This is a unique opportunity to contribute to the development of safety cases and advance the field of AI governance.Key ResponsibilitiesConduct foundational research on safety cases to help...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    We are seeking an exceptional Cybersecurity Research Engineer to join our team at the AI Safety Institute. Our goal is to develop first-of-its-kind government-run infrastructure to benchmark the progress of advanced AI capabilities in cyber security. The selected candidate will work closely with a cross-functional team of cybersecurity researchers, machine...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Role OverviewThe AI Safety Institute is seeking a highly skilled Senior AI Safety Researcher to join its Safeguard Analysis Team. The successful candidate will play a key role in researching and developing interventions that secure systems from abuse by bad actors.About the RoleThis is a challenging and rewarding opportunity for an experienced researcher to...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    We are advancing the state of the science in risk modeling, incorporating insights from safety-critical and adversarial domains, while developing novel techniques. Our research aims to empirically evaluate these risks by building one of the world's largest agentic evaluation suites and pushing forward the science of model evaluations.Job RoleYou will work as...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    About the RoleWe are seeking a highly skilled Workstream Lead to spearhead our Systemic Safety and Responsible Innovation team. As a key member of our leadership team, you will be responsible for developing and delivering a strategy to advance systemic AI safety research, shaping a market-shaping agenda, and building a multidisciplinary team focused on...

  • AI Safety Engineer

    1 day ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    The Post-Training Team at the AI Safety Institute is dedicated to optimizing AI systems for state-of-the-art performance in various risk domains. This involves a combination of scaffolding, prompting, supervised and RL fine-tuning of AI models.Key Responsibilities:Improve model performance using cutting-edge machine learning techniquesDevelop methodologies...

  • AI Researcher

    1 month ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    We're advancing the state of the science in risk modeling, incorporating insights from safety-critical and adversarial domains, while developing novel techniques. Our research focuses on extreme risks from autonomous AI systems, capable of interacting with the real world. As a risk modelling researcher, your work will span the full space of risks from...


  • London, Greater London, United Kingdom Atla Ai Full time

    Unlock the Future of AI with Atla AiAtla Ai is revolutionizing the field of Artificial Intelligence by developing safe and beneficial AI systems. We are a London-based start-up backed by top investors and founders, and we're looking for a talented Research Engineer to join our team.Role and ResponsibilitiesAs a Research Engineer at Atla Ai, you will be...

  • Research Scientist

    16 hours ago


    London, Greater London, United Kingdom AI Safety Institute Full time

    Are you passionate about improving AI systems for safer and more reliable operation? Do you have expertise in machine learning and a strong desire to apply it to real-world problems?We are seeking a talented Research Scientist to join our Post-Training Team at the AI Safety Institute. As a member of this team, you will be responsible for developing and...