Reinforcement Learning

2 weeks ago

London Area, United Kingdom Humanoid Full time

Reinforcement Learning (RL) Engineer, ManipulationHumanoid is the first AI and robotics company in the UK, creating the world’s most advanced, reliable, commercially scalable, and safe humanoid robots. Our first humanoid robot HMND 01 is a next-gen labour automation unit, providing highly efficient services across various use cases, starting with industrial applications. Our MissionAt Humanoid we strive to create the world’s leading, commercially scalable, safe, and advanced humanoid robots that seamlessly integrate into daily life and amplify human capacity.VisionIn a world where artificial intelligence opens up new horizons, our faith in its potential unveils a new outlook where, together, humans and machines build a new future filled with knowledge, inspiration, and incredible discoveries. The development of a functional humanoid robot underpins an era of abundance and well-being where poverty will disappear, and people will be able to choose what they want to do. We believe that providing a universal basic income will eventually be a true evolution of our civilization.SolutionAs the demands on our built environment rise, labour shortages loom. With the world’s workforce increasingly moving away from undesirable tasks, the manufacturing, construction, and logistics industries critical to our daily lives are left exposed. By deploying our general-purpose humanoid robots in environments deemed hazardous or monotonous, we envision a future where human well-being is safeguarded while closing the gaps in critical global labour needs.What You’ll DoTrain language-vision conditioned manipulation policies via reinforcement learning (RL) in simulation and in the real world.Construct challenging and diverse suites of manipulation tasks in simulation.Partner with teleoperations to collect trajectories in simulation for behavior cloning.Partner with testing and operations to establish real-world RL training pipelines.Experiment with various ways of bringing policies trained in simulation to the real world..We’re Looking For3+ years building deep‑learning systems (industry or research) with shipped models or published artifacts to show for it.Hands‑on with at least one of: LLMs, VLMs, or image/video generative models — architecture, training, and inference.Experience solving real problems using reinforcement learning with deep neural networks in any domain.Strong Python + PyTorch/JAX; you can profile, debug numerics, and write maintainable research code.You are self-driven, pro-active, communicate efficiently, document experiments clearly and communicate trade‑offs crisply.Nice to haveExperience with simulators for robotics (Isaac Sim, MuJoCo etc.)Experience in RL for robotics.Experience building infrastructure for large-scale RL (e.g. using ray).Publications at ICLR/ICML/NeurIPS or equivalent open‑source contributions.Familiarity with OpenVLA, Physical Intelligence (π) models, or similar open VLA frameworks.What We OfferCompetitive salary plus participation in our Stock Option PlanPaid vacation with adjustments based on your location to comply with local labor laws, and additional paid sick leave daysTravel opportunities to our Vancouver and Boston officesOffice perks: free breakfasts, lunches, snacks, and regular team eventsFreedom to influence the product and own key initiativesCollaboration with top‑tier engineers, researchers, and product experts in AI and roboticsStartup culture prioritising speed, transparency, and minimal bureaucracyHow to ApplyDoes this role sound like the perfect fit for you?Fill in the form and include links or files that showcase the best of what you’ve built and achieved.

Senior Reinforcement Learning expert

1 week ago

London Area, United Kingdom Barrington James Full time

I am hiring Senior Robotic & AI Engineer to drive the development of intelligent controllers for real-world robotic systems. This is a hands-on, highly technical role: you’ll design, build, and maintain advanced learning pipelines that combine imitation learning, reinforcement learning, and language or vision-conditioned models. You will play a pivotal...
Senior Reinforcement Learning expert

4 days ago

London Area, United Kingdom Barrington James Full time

I am hiringSenior Robotic & AI Engineerto drive the development of intelligent controllers for real-world robotic systems. This is a hands-on, highly technical role: you'll design, build, and maintain advanced learning pipelines that combine imitation learning, reinforcement learning, and language or vision-conditioned models. You will play apivotal role in...
Machine Learning Engineer

7 days ago

London, United Kingdom FBI &TMT Full time

I am recruiting on behalf of a leading client in the technology sector who is seeking a highly skilled and experienced Machine Learning Engineer with a strong background in Reinforcement Learning. This role will contribute to the continued development of Arena, the company's web-based platform for reinforcement learning training and RLOps, as well as its...
Machine Learning Engineer

7 days ago

London, United Kingdom FBI &TMT Full time

I am recruiting on behalf of a leading client in the technology sector who is seeking a highly skilled and experienced Machine Learning Engineer with a strong background in Reinforcement Learning. This role will contribute to the continued development of Arena, the company's web-based platform for reinforcement learning training xsabvtc and RLOps, as well as...
Reinforcement learning Engineer

1 week ago

London E, United Kingdom Go Arrow Full time £60,000 - £100,000 per year

We are looking for a Reinforcement Learning (RL) Engineer to join our AI research and development team. In this role, you will design, implement, and optimize reinforcement learning algorithms to solve complex, real-world problems. You'll collaborate closely with data scientists, machine learning engineers, and software developers to bring intelligent...
DevOps Engineer – Reinforcement Learning Platforms

5 days ago

London Area, United Kingdom Gattaca Full time £80,000 - £120,000 per year

DevOps Engineer – Reinforcement Learning PlatformsWe are seeking an experienced DevOps Engineer to help build and scale a web-based platform for reinforcement learning (RL) training and RLOps. You will design, implement, and maintain the cloud infrastructure, CI/CD pipelines, and deployment systems that support large-scale RL workloads.Responsibilities•...
Senior Reinforcement Learning expert

3 weeks ago

london, United Kingdom Barrington James Full time

I am hiring Senior Robotic & AI Engineer to drive the development of intelligent controllers for real-world robotic systems. This is a hands-on, highly technical role: you’ll design, build, and maintain advanced learning pipelines that combine imitation learning, reinforcement learning, and language or vision-conditioned models. You will play a pivotal...
Senior Reinforcement Learning expert

3 days ago

London, United Kingdom Barrington James Full time

I am hiring Senior Robotic & AI Engineer to drive the development of intelligent controllers for real-world robotic systems. This is a hands-on, highly technical role: you’ll design, build, and maintain advanced learning pipelines that combine imitation learning, reinforcement learning, and language or vision-conditioned models. You will play a pivotal...
Machine Learning Engineer

3 days ago

City Of London, United Kingdom AgileRL Ltd Full time

Machine Learning Engineer (Reinforcement Learning) We are seeking a talented and experienced Machine Learning Engineer with a background in Reinforcement Learning to join our team. This engineer will contribute to the further development of Arena, a web-based software platform for reinforcement learning training and RLOps, and our open-source reinforcement...
Senior Reinforcement Learning expert

4 weeks ago

City of London, United Kingdom Barrington James Full time

I am hiring Senior Robotic & AI Engineer to drive the development of intelligent controllers for real-world robotic systems. This is a hands-on, highly technical role: you’ll design, build, and maintain advanced learning pipelines that combine imitation learning, reinforcement learning, and language or vision-conditioned models. You will play a pivotal...

Americas

Europe

Asia / Oceania

Africa

Reinforcement Learning