Site Reliability Engineer

1 day ago


City of Westminster, United Kingdom MBDA S.A.S. Full time

Site Reliability Engineer (SRE) – AWS Location: London Salary: £100,000 per annum + Bonus + Excellent Benefits We are looking for an SRE for a large-scale digital organisation in the middle of a major engineering modernisation journey. This is not a BAU support role; this is a chance to help define what “good” looks like as SRE is brought fully in-house for the first time. You'll work across high-impact platforms (web/mobile, payments, CRM, operations, cloud) and play a key role in shifting the organisation away from ticket-driven support and towards proactive, automated, AWS-first, engineering-led reliability. Responsibilities Embed SRE principles to improve availability, reliability, performance and incident response. Modernise legacy support by introducing automation, observability, shift-left practices and CI/CD. Work across multiple domains (web/mobile, payments, CRM, cloud infrastructure, airline systems). Partner with vendors and internal engineering teams; influence technical and financial decisions. Qualifications 5+ years in an SRE or closely related reliability/DevSecOps discipline. Strong knowledge of SRE practices: monitoring, observability, incident response, automation. Hands‑on with AWS and infrastructure‑as‑code (Terraform, Ansible or CloudFormation). Experience with CI/CD pipelines and container platforms (Docker / Kubernetes). Comfortable working with vendors, suppliers and internal product/engineering teams. Ability to communicate clearly and influence engineering culture. Lead Cloud Site Reliability Engineer – Azure/Kubernetes Location: Hybrid working in Bristol (eligible for SC/DV clearance) Salary: £70,000 - £95,000 We are looking for a Lead Cloud Site Reliability Engineer (SRE) with strong expertise in Azure, Kubernetes, Terraform, and GitHub to lead large-scale projects and mentor a growing team. Key Responsibilities Lead SRE activities for large-scale cloud projects. Define and drive SLOs/SLIs, service health metrics and standards. Troubleshoot, monitor and improve systems using tooling such as Datadog, Splunk, etc. Contribute to IaC, containerisation and cloud-native adoption (AWS, Terraform, Docker/K8s). Mentor and support engineers as SRE ways of working are introduced across the organisation. Experience / What you bring 5+ years in an SRE or closely related reliability/DevSecOps discipline. Strong knowledge of SRE practices: monitoring, observability, incident response, automation. Hands‑on with AWS and infrastructure‑as‑code (Terraform, Ansible or CloudFormation). Experience with CI/CD pipelines and container platforms (Docker / Kubernetes). Comfortable working with vendors, suppliers and internal product/engineering teams. Excellent communication and influence skills. How to apply Interested in learning more? Please get in touch with Benjamin Applewhaite to discuss the role in confidence. #J-18808-Ljbffr



  • City of Westminster, United Kingdom Thomson Reuters Full time

    We are seeking a senior technical expert to lead strategy and drive transformation through the coordination of teams implementing innovative engineering and reliability projects. These initiatives will leverage data‑driven and AI‑enabled tools, as well as other advanced digital technologies, to optimise engineering and reliability processes within the...


  • City of Westminster, United Kingdom Tribal Group Full time

    As a Site Reliability Engineer, you'll design, build, and operate large-scale systems with an emphasis on reliability, efficiency, and automation. You'll work across deployment, monitoring, and incident response to ensure our platforms stay healthy and our customers experience uninterrupted service. Responsibilities Maintaining and improving production...


  • City of Westminster, United Kingdom Macquarie Group Limited Full time

    Site Reliability Engineer As a Site Reliability Engineer, you join our dynamic Engineering team and help ensure the reliability, scalability, and performance of our data platforms. You monitor, troubleshoot, and optimise systems that support vital data workflows, enabling smooth data movement and accessibility across the organisation. You work closely with...


  • City of London, United Kingdom Amelco Limited Full time

    Role: Site Reliability EngineerType: Full-time permanent roleLocation: Hybrid/ Shoreditch, London 3 days per weekAbout UsAmelco Ltd are a leading gaming and gambling solution software provider with a strong presence in the USA, UK, and Europe. Through partnerships with global gaming companies, we build cutting-edge technical platforms across sportsbooks,...


  • City of Westminster, United Kingdom Macquarie Group Limited Full time

    A leading financial services firm in the UK seeks a Site Reliability Engineer to join their dynamic Engineering team. This role involves ensuring the reliability and performance of data platforms, troubleshooting and optimizing data workflows, and collaborating with cross-functional teams. Ideal candidates will have strong skills in Python, SQL, and cloud...


  • City Of London, United Kingdom Different Technologies Pty Ltd. Full time

    OverviewWe are hiring for a next generation telecoms software company who are seeking a Network Autonomy Engineer to join their expanding team.Primary Function of the PositionReporting to the Site Reliability Engineer Team Lead, the Site Reliability Engineer will be responsible for ensuring the reliability, scalability and performance of our...


  • City of Westminster, United Kingdom UK Health Security Agency Full time

    Overview The Digital and Directorate has primary responsibility for scientific computing and research computing services and support. Key functions of the Digital Development and Operations unit are to provide and support such platforms required by the staff of UKHSA and provide technical capabilities to enable public health services, both within the...


  • City of Westminster, United Kingdom Tribal Group Full time

    A leading EdTech business is looking for a Site Reliability Engineer to design, build, and operate large-scale systems focusing on reliability and automation. You will maintain production systems, support deployments, enhance automation tools, and analyze system performance metrics. The ideal candidate has strong experience with AWS or Azure, Linux, Apache,...


  • City Of Bristol, United Kingdom TwinStream Full time

    Details: Salary: £70,000 - £95,000 DOELocation: Hybrid working in BristolSecurity Clearance: Eligible for SC/DV ClearanceAbout the role:Our cross-domain services are used in high-profile government organisations. The demand for these services continues to grow in both scope and scale. We are seeking an experienced Site Reliability Engineer to help satisfy...


  • City Of London, United Kingdom Opus Recruitment Solutions Full time

    Azure Site reliability Engineer 6 month contract Onsite 2/3 days per week 650 per day InsideIR35 Opus RS are looking for a Senior Site Reliability Engineer with deep expertise in Azure cloud migration and a strong DevOps background to join our clients team. What We're Looking For Previous experience as a Site Reliability Engineer Strong skills in Terraform,...