Research Scientists and Engineers

2 weeks ago


London, United Kingdom Anthropic Limited Full time

You want to help construct and rapidly iterate on machine learning experiments to help us improve the behavior of powerful AI systems through finetuning. You care about making AI helpful, honest, and harmless, and are interested in shaping model behavior to be more aligned with human values and goals. You could describe yourself as both a scientist and an engineer. As a Research Scientist or Research Engineer on the Finetuning team, you'll contribute to research on improving language models through techniques like constitutional AI. You will have the opportunity to do creative, cutting-edge research on frontier models, and to see your work result in concrete improvements in performance and safety.
We generally expect research scientists to be able to iterate on their own experiments. We also provide opportunities for engineers to pursue their own research projects. Therefore this role can be more research oriented or more engineering oriented, depending on the experience and interests of the candidate.
Note: Currently, the team has a preference for candidates who are able to be based in the Bay Area. However, we remain open to any candidate who can meet the organization's 25% in-person policy.
Representative projects:
Help develop novel finetuning techniques to improve language model behavior and make models more helpful, honest, and harmless
Test out techniques like constitutional AI at scale and measure their impacts on model behavior
Build tooling and infrastructure to enable efficient fine-tuning experiments on large language models
Develop novel prompts and prompting strategies to improve and test model behaviors
Run experiments that feed into key AI research and safety efforts at Anthropic
You may be a good fit if you:
Have significant Python, machine learning, research engineering, or research experience
Prefer fast-moving collaborative projects with concrete goals that involve improving model behaviors
Are results-oriented, with a bias towards flexibility and impact
Pick up slack, even if it goes outside your job description
Care about the impact of AI and of your work
Strong candidates may also:
Haver prior experience with large language model finetuning techniques such as RLHF
Have experience with complex shared codebases and RL infrastructure
Have experience authoring research papers in machine learning, NLP, or AI alignment or similar industry experience
Deadline to apply: None. Applications will be reviewed on a rolling basis.
#J-18808-Ljbffr



  • London, United Kingdom COL Limited Full time

    Please note that due to a high volume of very talented applicants, there may be a lag of several weeks between submission and follow-up from Apollo Research on your application. In the long run, we aim to explain and perhaps reverse engineer arbitrary mechanisms of arbitrary neural networks. However, we expect the day-to-day work of most scientists and...


  • London, United Kingdom COL Limited Full time

    Please note that due to a high volume of very talented applicants, there may be a lag of several weeks between submission and follow-up from Apollo Research on your application. In the long run, we aim to explain and perhaps reverse engineer arbitrary mechanisms of arbitrary neural networks. However, we expect the day-to-day work of most scientists and...


  • London, United Kingdom COL Limited Full time

    Please note that due to a high volume of very talented applicants, there may be a lag of several weeks between submission and follow-up from Apollo Research on your application. In the long run, we aim to explain and perhaps reverse engineer arbitrary mechanisms of arbitrary neural networks. However, we expect the day-to-day work of most scientists and...


  • London, United Kingdom COL Limited Full time

    Please note that due to a high volume of very talented applicants, there may be a lag of several weeks between submission and follow-up from Apollo Research on your application. In the long run, we aim to explain and perhaps reverse engineer arbitrary mechanisms of arbitrary neural networks. However, we expect the day-to-day work of most scientists and...


  • London, United Kingdom Anthropic Limited Full time

    You want to help construct and rapidly iterate on machine learning experiments to help us improve the behavior of powerful AI systems through finetuning. You care about making AI helpful, honest, and harmless, and are interested in shaping model behavior to be more aligned with human values and goals. You could describe yourself as both a scientist and an...


  • London, United Kingdom Anthropic Limited Full time

    You want to help construct and rapidly iterate on machine learning experiments to help us improve the behavior of powerful AI systems through finetuning. You care about making AI helpful, honest, and harmless, and are interested in shaping model behavior to be more aligned with human values and goals. You could describe yourself as both a scientist and an...


  • London, United Kingdom Abs Data Full time

    Research Scientist/Engineer, Empirical Scientific Approaches in Global Business DivisionsInvestment Bank Your role Do you want to apply your expertise in coding to propel hypothesis-driven empirical research? We’re looking for a Research Scientist/Engineer to: build tools and help chart the strategy to bring enterprise statistical computing to...


  • London, United Kingdom Abs Data Full time

    Research Scientist/Engineer, Empirical Scientific Approaches in Global Business DivisionsInvestment Bank Your role Do you want to apply your expertise in coding to propel hypothesis-driven empirical research? We’re looking for a Research Scientist/Engineer to: build tools and help chart the strategy to bring enterprise statistical computing to...


  • London, United Kingdom Abs Data Full time

    Research Scientist/Engineer, Empirical Scientific Approaches in Global Business DivisionsInvestment Bank Your role Do you want to apply your expertise in coding to propel hypothesis-driven empirical research? We’re looking for a Research Scientist/Engineer to: build tools and help chart the strategy to bring enterprise statistical computing to...

  • Research Engineer

    6 days ago


    London, United Kingdom Anthropic Limited Full time

    You want to help construct and rapidly iterate on machine learning experiments to help us improve the behavior of powerful AI systems through finetuning. You care about making AI helpful, honest, and harmless, and are interested in shaping model behavior to be more aligned with human values and goals. You could describe yourself as both a scientist and an...

  • Research Engineer

    4 weeks ago


    London, United Kingdom Anthropic Limited Full time

    You want to help construct and rapidly iterate on machine learning experiments to help us improve the behavior of powerful AI systems through finetuning. You care about making AI helpful, honest, and harmless, and are interested in shaping model behavior to be more aligned with human values and goals. You could describe yourself as both a scientist and an...

  • Research Engineer

    4 weeks ago


    London, United Kingdom Anthropic Limited Full time

    You want to help construct and rapidly iterate on machine learning experiments to help us improve the behavior of powerful AI systems through finetuning. You care about making AI helpful, honest, and harmless, and are interested in shaping model behavior to be more aligned with human values and goals. You could describe yourself as both a scientist and an...

  • Research Engineer

    3 weeks ago


    London, United Kingdom Anthropic Limited Full time

    You want to help construct and rapidly iterate on machine learning experiments to help us improve the behavior of powerful AI systems through finetuning. You care about making AI helpful, honest, and harmless, and are interested in shaping model behavior to be more aligned with human values and goals. You could describe yourself as both a scientist and an...

  • Research Engineer

    2 weeks ago


    London, United Kingdom Anthropic Limited Full time

    You want to help construct and rapidly iterate on machine learning experiments to help us improve the behavior of powerful AI systems through finetuning. You care about making AI helpful, honest, and harmless, and are interested in shaping model behavior to be more aligned with human values and goals. You could describe yourself as both a scientist and an...

  • Research Engineer

    2 weeks ago


    London, United Kingdom Anthropic Limited Full time

    You want to help construct and rapidly iterate on machine learning experiments to help us improve the behavior of powerful AI systems through finetuning. You care about making AI helpful, honest, and harmless, and are interested in shaping model behavior to be more aligned with human values and goals. You could describe yourself as both a scientist and an...


  • London, United Kingdom COL Limited Full time

    Rolling Basis Applications: We are currently reviewing applications on a rolling basis, our active hiring round recently concluded. Please note that due to a high volume of very talented applicants, there may be a lag of several weeks between submission and follow-up from Apollo Research on your application. Thank you for your patience. About The Role: ...


  • London, United Kingdom COL Limited Full time

    Rolling Basis Applications: We are currently reviewing applications on a rolling basis, our active hiring round recently concluded. Please note that due to a high volume of very talented applicants, there may be a lag of several weeks between submission and follow-up from Apollo Research on your application. Thank you for your patience. About The...


  • London, United Kingdom COL Limited Full time

    Rolling Basis Applications: We are currently reviewing applications on a rolling basis, our active hiring round recently concluded. Please note that due to a high volume of very talented applicants, there may be a lag of several weeks between submission and follow-up from Apollo Research on your application. Thank you for your patience. About The...

  • Research Scientist

    3 weeks ago


    London, United Kingdom Notpla Limited Full time

    The Role At Notpla, we create disappearing packaging carefully engineered for a healthy planet. Founded on the belief that nature knows best, we’re an innovative, ideas and action-oriented scale-up developing and manufacturing packaging solutions from seaweed and plants that disappear naturally. We are currently looking to recruit a Research Scientist to...

  • Research Scientist

    3 weeks ago


    London, United Kingdom Notpla Limited Full time

    The Role At Notpla, we create disappearing packaging carefully engineered for a healthy planet. Founded on the belief that nature knows best, we’re an innovative, ideas and action-oriented scale-up developing and manufacturing packaging solutions from seaweed and plants that disappear naturally. We are currently looking to recruit a Research Scientist to...