Head of Site Reliability Engineering

1 month ago


London, Greater London, United Kingdom Rewardgateway Full time
Job Title: Head of Site Reliability Engineering

At Reward Gateway, we're seeking a highly skilled and experienced Head of Site Reliability Engineering to join our team. As a key member of our engineering organization, you will be responsible for establishing and managing our new SRE function, operating and modernizing our existing cloud infrastructure, and partnering with our DevOps team to ensure fast and supportable platform updates.

Key Responsibilities:
  • Establishing and managing our new SRE function
  • Operating and modernizing our existing cloud infrastructure
  • Partnering with our DevOps team to ensure fast and supportable platform updates
  • Maintaining the highest standards for our customer-facing systems
  • Balancing the desire for innovation with stability and delivery for our customers
  • Ensuring our availability and performance are maintained at the highest levels
  • Acting as a key Incident Commander and escalation point
  • Liaising closely with our SecOps teams to ensure timely vulnerability management
  • Educating teams in SRE practices and maintaining high standards of compliance
  • Implementing world-class observability standards utilizing SLI/SLO/Error Budgets
  • Continually evolving our observability platforms for greater coverage
  • Liaising with Product & Engineering teams for constant evolution of metrics
  • Aligning SRE Sprints & Backlog with our roadmaps to meet business expectations
  • Guiding our teams in a more Agile approach to demand management
  • Actively taking part in our daily stand-ups and keeping our Sprints on track
  • Keeping up-to-date documentation in our JIRA & Confluence tools
  • Owning and maintaining our SRE Incident Management processes
  • Ensuring a focus on cost efficiency for our platforms & services
  • Removing obstacles and fostering team collaboration
  • Communicating with our stakeholders

We value all cultures, backgrounds, and experiences, as we truly believe that diversity drives innovation. Express yourself, join our community, and help us Make the World a Better Place to Work.

From perks to people, our BETTER approach to hiring earns us more trust, happier people, and more world-class talent that help us to make the world a better place to work. Find out more about Reward Gateway's approach to benefits, equality, talent, technology, and empathy, and what you'll get in return for joining our Mission at rg.



  • London, Greater London, United Kingdom Dabster Full time

    Dabster is a leading company in the field of [industry], and we are looking for a talented Site Reliability Engineer Leader to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems, while also collaborating with cross-functional teams to drive business growth.The ideal candidate will have a strong...


  • London, Greater London, United Kingdom Selby Jennings Full time

    About Selby JenningsWe're a leading global financial services firm where technologists and investment professionals collaborate to drive innovation and operational excellence.About the RoleAs a Site Reliability Engineer, you'll apply your expertise in software and systems engineering to design, build, and maintain our robust infrastructure. You'll reduce...


  • London, Greater London, United Kingdom GoCardless Full time

    The RoleGoCardless is looking for a Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining the infrastructure and systems that support our payment and open banking products.Key ResponsibilitiesDesign and implement scalable and efficient infrastructure solutionsDevelop...


  • London, Greater London, United Kingdom Preqin Full time

    About the Role:Preqin is seeking an experienced Site Reliability Engineer to join our team in London. As a Site Reliability Engineer, you will work across Preqin's full suite of services, supporting our clients around the world.You will be responsible for designing, building, and operating our infrastructure, middleware, and CI/CD systems to ensure our teams...


  • London, Greater London, United Kingdom Highfield Professional Solutions Ltd Full time

    Highfield Professional Solutions Ltd is seeking a Site Reliability Engineer to join our team in Central London. The successful candidate will be responsible for managing and maintaining critical engineering systems within our Data Centre, ensuring that they operate efficiently and effectively. This role offers a competitive salary of up to 48,000 per year,...


  • London, Greater London, United Kingdom Rewardgateway Full time

    Engineering, LondonEarn a salary of £110,000 - £130,000 per year with Reward Gateway.We are seeking an experienced Site Reliability Engineer to lead our team and drive the transformation of our operational workloads to a Service Reliability Engineering (SRE) approach. The successful candidate will be responsible for establishing and managing our new SRE...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    Job OverviewJ Bandy Consulting seeks a skilled Site Reliability Engineer to ensure the reliability, scalability, and performance of our systems. The successful candidate will be responsible for developing the SRE culture, applying automation, and monitoring application performance.Key ResponsibilitiesDrive the evolution of the DevOps/GitOps toolchain to...


  • London, Greater London, United Kingdom BenevolentAI Full time

    About the Role:BenevolentAI is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will be responsible for designing and implementing software solutions for cloud infrastructure, improving long-term infrastructure availability and reliability, and monitoring and handling incident response of...


  • London, Greater London, United Kingdom STAND 8 Full time

    Job SummaryWe are seeking an experienced Site Reliability Engineer to join our team at STAND 8. As a Site Reliability Engineer, you will be responsible for maintaining existing systems, working on infrastructure modernization, and supporting the streaming engineering team to ensure smooth operation of linear streaming channels.Key ResponsibilitiesMaintain...


  • London, Greater London, United Kingdom Hamilton Barnes Associates Limited Full time

    Job Title: Site Reliability EngineerHiring Company: Hamilton Barnes Associates LimitedWe are seeking a highly skilled and experienced Site Reliability Engineer to join our team on a 6-month contract basis. The selected candidate will be working with one of the largest technology companies globally, ensuring seamless database environment operations and...


  • London, Greater London, United Kingdom Kroo Bank Ltd Full time

    **Job OverviewWe are seeking an highly motivated Site Reliability Engineer to join our team. The successful candidate will have excellent technical skills and experience in cloud computing, specifically with AWS. The role involves taking ownership of core services, monitoring performance, and improving reliability.


  • London, Greater London, United Kingdom BenevolentAI Full time

    **Job Overview:**We are seeking a highly skilled Senior Site Reliability Engineer to join our team at BenevolentAI. As a key member of our squad, you will play a crucial role in ensuring the reliability and scalability of our cloud infrastructure.The ideal candidate will have a strong background in software development, with experience in implementing cloud...


  • London, Greater London, United Kingdom Preqin Full time

    Role Overview Preqin is seeking an experienced Site Reliability Manager to join our Engineering team. As a Site Reliability Manager, you will play a crucial role in designing, operating, and supporting our infrastructure, middleware, and internal services. Key Responsibilities Design and operate scalable and high-available services, while establishing...


  • London, Greater London, United Kingdom Spectrum IT Recruitment Full time

    Job Title: Senior Site Reliability EngineerWe are partnering with a leading company to help them scale their digital marketplace consumer services.This role is crucial in streamlining software delivery pipelines, enhancing reliability, performance, and scalability of systems, and driving continuous improvement across the software lifecycle.You will be...


  • London, Greater London, United Kingdom Techruiter Full time

    We are a pioneering tech company specialising in cutting-edge Language Models (LLM) and Machine Learning solutions.About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team and ensure the reliability, scalability, and performance of our LLM and Machine Learning infrastructure.As an SRE, you will play a critical role in...


  • London, Greater London, United Kingdom LinuxRecruit Full time

    Join Our TeamAbout Us: LinuxRecruit is a leading organization dedicated to delivering innovative solutions and exceptional user experiences.We are committed to fostering a culture of excellence, collaboration, and continuous learning. By joining our team, you will be part of a dynamic and inclusive organization that values your contributions and supports...


  • London, Greater London, United Kingdom LinuxRecruit Full time

    Company OverviewWe are a leading technology company, celebrated globally for our cutting-edge solutions and unparalleled user base. Our culture of excellence, collaboration, and continuous learning drives us to innovate and push boundaries.Salary: $120,000 - $180,000 per year (dependent on experience)Job Description: As a Site Reliability Engineer, you will...


  • London, Greater London, United Kingdom IO Associates Full time

    Job Title: Site Reliability EngineerIO Associates is seeking a highly skilled Site Reliability Engineer with Active NPPV3 Clearance to work on a short-term project within the Law Enforcement sector.Key Responsibilities:Monitor system performance and security.What We Offer:This is a short-term contract paying up to £500 per day Outside IR35 for an initial 6...


  • London, Greater London, United Kingdom Google Full time

    Job OverviewThis is an exciting opportunity to join our Site Reliability Engineering (SRE) team at Google Cloud, where you will play a critical role in ensuring the reliability and uptime of our services. As a seasoned technical leader, you will be responsible for leading teams, developing and implementing solutions, and providing expert guidance to drive...


  • London, Greater London, United Kingdom AVT Reliability Ltd Full time

    About AVT Reliability LtdWe are a leading company in the field of asset integrity and reliability. Our team is passionate about delivering high-quality services to our clients.Job SummaryThis is an exciting opportunity for a talented engineering graduate to join our Asset Integrity Division as a specialist. You will be responsible for supporting a diverse...