Senior Site Reliability Engineering Leader

4 weeks ago


London, Greater London, United Kingdom Rewardgateway Full time
Engineering, London, Full Time, £100,000 - £120,000 / year

As a key member of our team, you will contribute to improving employee engagement and building better, stronger, and more resilient organisations to improve people's daily lives. Our shared mission guides our actions and charts a sustainable path to a better future.

With the acquisition of Reward Gateway by Edenred, we are expanding our existing operational workloads to an SRE approach and seeking a skilled Head of Site Reliability Engineering to help us achieve this goal. In this role, you will be responsible for establishing and managing our new SRE function, operating and modernizing our existing cloud infrastructure, and partnering with our DevOps team to ensure fast and supportable platform updates.

Key Responsibilities:
  • Establishing and managing our new SRE function
  • Operating and modernizing our existing cloud infrastructure
  • Partnering with our DevOps team to ensure fast and supportable platform updates
  • Maintaining the highest standards for our customer-facing systems
  • Balancing the desire for innovation with stability and delivery for our customers
  • Ensuring our availability and performance are maintained at the highest levels
  • Acting as a key Incident Commander and escalation point
  • Liaising closely with our SecOps teams to ensure timely vulnerability management
  • Implementing world-class observability standards utilizing SLI/SLO/Error Budgets
Required Skills:
  • Demonstrated leadership and management experience as a Senior Manager or Head of SRE within a global organization
  • Experience with AWS preferred (or another cloud provider)
  • Enterprise infrastructure experience in high-availability environments
  • Automation skills through Terraform, Python, Bash, or similar
  • Wide-reaching SRE skills and a deep understanding of SRE practices
  • A strong understanding of SQL, PHP, Kubernetes, CI/CD
  • Observability product experience (e.g., New Relic, Datadog)
The Interview Process:
  • Screening video interview with the Senior Talent Partner
  • Interview with the Director of Infrastructure and Head of Development
  • Final interview with the Director of Engineering & CTO

At Reward Gateway, we value all cultures, backgrounds, and experiences, as we truly believe that diversity drives innovation. Express yourself, join our community and help us Make the World a Better Place to Work.



  • London, Greater London, United Kingdom Remotestar Full time

    Remotestar is seeking a Senior Site Reliability Engineering Manager to join our client's team in the UK. The client is building a B2B marketplace for diamonds, and we need someone to ensure the reliability, scalability, and performance of our infrastructure and services.The ideal candidate will have a strong track record of building and maintaining highly...


  • London, Greater London, United Kingdom Rewardgateway Full time

    Engineering, LondonEarn a salary of £110,000 - £130,000 per year with Reward Gateway.We are seeking an experienced Site Reliability Engineer to lead our team and drive the transformation of our operational workloads to a Service Reliability Engineering (SRE) approach. The successful candidate will be responsible for establishing and managing our new SRE...


  • London, Greater London, United Kingdom Google Full time

    About the RoleAt Google, we're looking for a talented Cloud Engineer and Site Reliability Leader to join our team. As a key member of our SRE organization, you'll be responsible for designing, building, and operating large-scale distributed systems that meet the high standards of reliability, scalability, and performance.We're seeking someone with 8+ years...


  • London, Greater London, United Kingdom Apple Inc. Full time

    At Apple Inc., we're looking for a seasoned Site Reliability Engineering (SRE) manager to join our iCloud Services team.About the RoleWe're seeking an accomplished builder and leader of teams with a passion for SRE and a track record of delivering operational perfection at scale. As a key member of our SRE leadership team, you will shape the future of how we...


  • London, Greater London, United Kingdom Remotestar Full time

    Remotestar is seeking a Senior Site Reliability Engineering Manager to join our client's team in the UK. The client is a leading B2B marketplace for diamonds, and we're looking for a seasoned expert to lead our infrastructure and services team.The ideal candidate will have a strong track record of building and maintaining highly reliable infrastructure and...


  • London, Greater London, United Kingdom Remotestar Full time

    Remotestar, a cutting-edge B2B diamond marketplace, seeks a Senior Site Reliability Engineering Manager to ensure the reliability, scalability, and performance of our infrastructure and services.As the SRE Manager, you will play a critical role in:Creating and maintaining high-end monitoring and automation tooling.Developing and maintaining tools, scripts,...


  • London, Greater London, United Kingdom Citigroup, Inc. Full time

    Citigroup, Inc. Chief Reliability Engineering LeaderAbout the Job:We are seeking a highly skilled Chief Reliability Engineering Leader to join our team at Citigroup, Inc. This is a full-time position based on a competitive salary of $200,000 per year.Job Description:The successful candidate will play a crucial role in driving operational excellence,...


  • London, Greater London, United Kingdom Mondrian Alpha Recruitment Solutions Full time

    At Mondrian Alpha Recruitment Solutions, we are seeking a highly skilled Site Reliability Engineer to join our team responsible for engineering and supporting the company's critical infrastructure platforms.This team handles the centralized development infrastructure and works alongside engineering teams across the business to ensure the optimal route of...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesAs a Site Reliability Engineer at Fourier, you will be responsible for designing and implementing tools to enhance the reliability and resilience of our production systems. This includes investigating failures, improving system performance, and automating manual processes.Required SkillsExcellent Python scripting skillsExperience with...


  • London, Greater London, United Kingdom Apple Inc. Full time

    Site Reliability Engineering Manager, AppleAt Apple, we're not just building products - we're crafting experiences our customers love and depend on. Our Apple Services Engineering (ASE) team builds and supports the systems that make many of these daily experiences possible. If you've used Apple products, you've likely interacted with us. Our iCloud Services...


  • London, Greater London, United Kingdom Apple Inc. Full time

    Unlock the Future of Cloud ServicesAt Apple Inc., we're not just building products - we're crafting experiences that our customers love and depend on. Our Apple Services Engineering (ASE) team is responsible for the systems that make these daily experiences possible. If you've used Apple products, you've likely interacted with us. Our iCloud Services SRE...


  • London, Greater London, United Kingdom BenevolentAI Full time

    About the Role:BenevolentAI is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will be responsible for designing and implementing software solutions for cloud infrastructure, improving long-term infrastructure availability and reliability, and monitoring and handling incident response of...


  • London, Greater London, United Kingdom Lorien Full time

    Key Responsibilities:Collaborate with the existing team to deliver a brand-new project.Work on a hybrid model with 1 day a week on-site in London.Develop and maintain reliable and efficient systems.Utilize experience with Java, Python, Splunk, ServiceNow, and MongoDB.Contribute to incident management and application monitoring.Ensure seamless interaction...


  • London, Greater London, United Kingdom Canonical Full time

    Job SummaryCanonical is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our Infrastructure Services team, you will be responsible for designing, implementing, and operating highly available and scalable cloud infrastructure.Key Responsibilities:Drive the development of automation and GitOps practices within the...


  • London, Greater London, United Kingdom Tbwa ChiatDay Inc Full time

    About StacklokStacklok is an innovative software supply chain security startup that empowers developers to make safer open source dependency choices.We're seeking a Senior Site Reliability Engineer (SRE) to support Trusty, our package intelligence service. This role focuses on driving essential initiatives in automation, system monitoring, configuration...


  • London, Greater London, United Kingdom College of Charleston Full time

    Transformative SRE Leadership OpportunityAre you a seasoned leader with a passion for strategy, leadership, and engineering excellence? Do you want to make a meaningful impact at a global financial institution? We're seeking a talented Site Reliability Engineering Manager to join our Operations and Technology Chief Information Office Business area.About the...


  • London, Greater London, United Kingdom Cisco Full time

    Job OverviewThe Cisco Site Reliability Engineering team is responsible for providing tools, services, and infrastructure to monitor and observe the ThousandEyes platform. As a Senior Site Reliability Engineer, you will own our logging pipeline and monitoring stack while working with developers to continuously improve our view of the platform.Key...


  • London, Greater London, United Kingdom Spectrum IT Recruitment Full time

    Job Title: Senior Site Reliability EngineerWe are partnering with a leading company to help them scale their digital marketplace consumer services.This role is crucial in streamlining software delivery pipelines, enhancing reliability, performance, and scalability of systems, and driving continuous improvement across the software lifecycle.You will be...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    Job SummaryJ Bandy Consulting is seeking an experienced Site Reliability Engineer to join our team. The ideal candidate will have a strong background in software engineering and a passion for building scalable and reliable systems.Key ResponsibilitiesDevelop and implement automation tools to improve the efficiency of our systemsCollaborate with...


  • London, Greater London, United Kingdom Google Full time

    Job OverviewThis is an exciting opportunity to join our Site Reliability Engineering (SRE) team at Google Cloud, where you will play a critical role in ensuring the reliability and uptime of our services. As a seasoned technical leader, you will be responsible for leading teams, developing and implementing solutions, and providing expert guidance to drive...