Site Reliability Engineer
9 hours ago
Role:
Senior SRE
Skills:
Deep Linux, Scripting - Python, DevOps, Kubernetes
Salary:
£500k Plus
Location:
London
The ideal candidate comes from a top-tier tech environment (FAANG, elite trading, hyperscale infra). They have experience building technology
0→1
, owning systems end-to-end, and working close to the metal. They will operate across everything from
bare-metal Linux
to
modern build and observability stacks
.
Overview
Join a core engineering group as Lead Site Reliability Engineer, designing and scaling Linux platforms that underpin ML/AI-driven trading. You will architect and own reliability for massive simulation, HPC, and production workloads—ensuring ultra-reliable, ultra-fast trading systems. This is a hands-on, leadership role focused equally on technical depth, strategic decision-making, and driving platform SRE excellence.
Key Responsibilities
- Lead SRE practices for Linux platforms powering low-latency, high-throughput trading workloads.
- Architect, optimize, and tune Linux for performance, resilience, and minimal latency.
- Drive incident response, root cause analysis, and continuous reliability improvement across production systems.
- Oversee system automation and reproducibility—build, deploy, and fleet-manage bare-metal Linux and containerized stacks.
- Manage and enhance Kubernetes clusters, network configuration, and large-scale orchestration.
- Set observability standards; expand monitoring, alerting, and performance metrics across platforms.
- Analyze networking, kernel-level performance, and distributed systems—solving core challenges in a multi-petabyte, multi-cluster environment.
- Build Python tools for automation, reliability engineering, and performance analysis.
- Mentor and lead a high-performing Linux engineering/SRE team.
What You Will Work On
- Ultra-reliable, high-performance trading infrastructure where every engineering optimization affects performance
- Next-generation simulation and HPC compute pipelines, supporting ML/AI workflows at scale.
- Integration and continuous improvement of internal and open-source tools for automation and reliability.
- Strategic platform direction: shaping foundational systems for critical infrastructure in an elite trading environment.
Team and Culture
- Small, autonomous Linux SRE team with direct ownership and impact.
- Collaborative engagement with quants, researchers, and trading experts to deliver robust platforms.
- A culture built on deep technical ownership, learning, and high standards of performance engineering
Apply now for an informal confidential chat
-
Site Reliability Engineer
2 weeks ago
London Area, United Kingdom K&K Talents Full time £40,000 - £80,000 per yearK&K Talentsis an international recruiting agency that has been providing technical resources in the European region since 1993. This position is with one of our clients inPolandwho is actively hiring candidates to expand their teams.Role: Site Reliability EngineerLocation: London, UK (Onsite)Employment type: Contract IR35Years of experience in Site...
-
Site Reliability Engineer
31 minutes ago
London Area, United Kingdom Xpertise Recruitment Full time £50,000 - £150,000 per yearSite Reliability Engineer (SRE) – AWSLocation:LondonSalary:£100,000 per annum + Bonus + Excellent BenefitsI am looking for an SRE for a large-scale digital organisation in the middle of a major engineering modernisation journey. This is not a BAU support role, this is a chance to help define what "good" looks like as SRE is brought fully in-house for the...
-
Site Reliability Engineer
7 days ago
London Area, United Kingdom RP International Full timeOur client currently seeks a SC Cleared Senior Site Reliability Engineer to join their dynamic team on an initial 6 month contract. This role will be done on a Hybrid based working model (2 days on-site a week in London).Requirements:Extensive experience with Azure, particularly in migrating applications to the Azure cloud platform.Proven track record of...
-
Site Reliability Engineer
7 days ago
London Area, United Kingdom RP International Full timeOur client currently seeks a SC Cleared Senior Site Reliability Engineer to join their dynamic team on an initial 6 month contract. This role will be done on a Hybrid based working model (2 days on-site a week in London). Requirements: Extensive experience with Azure, particularly in migrating applications to the Azure cloud platform. Proven track record of...
-
Site Reliability Engineer
7 days ago
Manchester Area, United Kingdom Anson McCade Full timeAbout the RoleAre you passionate about building resilient systems and eliminating operational toil through automation? We’re looking for a Site Reliability Engineer (SRE) to join our high-impact team and help shape the future of our digital infrastructure.As an SRE, you’ll blend software engineering with systems engineering to ensure the reliability,...
-
Site Reliability Engineer
4 days ago
London Area, United Kingdom RP International Full time £60,000 - £120,000 per yearOur client currently seeks aSC Cleared Senior Site Reliability Engineerto join their dynamic team on an initial 6 month contract. This role will be done on a Hybrid based working model (2 days on-site a week in London).Requirements:Extensive experience with Azure, particularly in migrating applications to the Azure cloud platform.Proven track record of cloud...
-
Site Reliability Engineer
1 week ago
London Area, United Kingdom psd group Full time £60,000 - £100,000 per yearFor one of our clients, a leading provider of innovative software solutions for the financial services industry, we are currently looking for an experienced Cloud Site Reliability Engineer to join their UK entity in London (hybrid).Working with our client means joining one of the most recognised leaders in the fintech sector, renowned for their expertise in...
-
Site Reliability Engineer
2 weeks ago
London Area, United Kingdom Blockchain Full time £80,000 - £100,000 per yearis connecting the world to the future of finance. As the most trusted and fastest-growing global crypto company, it helps millions of people worldwide safely access cryptocurrency. Since its inception in 2011, has earned the trust of over 90 million wallet holders and more than 40 million verified users, facilitating over $1 trillion in crypto...
-
Lead Cloud Site Reliability Engineer
7 hours ago
London Area, United Kingdom LSA Recruit Full time £60,000 - £120,000 per yearJob opportunity forLead Cloud Site Reliability Engineer (SRE)based inLondon, UK - Contract (SC Cleared)Job Description:Job Description –We're looking for aLead Cloud Site Reliability Engineer (SRE)with strong expertise inAzure, Kubernetes, Terraform, and GitHubto lead large-scale projects and mentor a growing team.Key ResponsibilitiesLead SRE activities...
-
Site Reliability Engineer
6 days ago
London, Greater London, United Kingdom eMFusion Global Full time £60,000 - £120,000 per yearJob Opportunity: Freelance Site Reliability Engineer (Outside IR35)£ | Remote (UK-Based) | Occasional travel to Farnborough or HammersmithContract until 2026We're hiring two hands-on Site Reliability Engineers (SREs) to join a fast-moving platform team on a long-term contract. This role is ideal for engineers with strong coding skills who are comfortable...