Current jobs related to Site Reliability Engineering Manager - London, Greater London - Apollo solutions


  • London, Greater London, United Kingdom Insight Global Full time

    Site Reliability Engineer OpportunityInsight Global is seeking a skilled Site Reliability Engineer to join their team in West London. As a key member of the streaming engineering group, you will be responsible for ensuring the smooth operation of linear streaming channels.The ideal candidate will have previous experience in Site Reliability Engineering, with...


  • London, Greater London, United Kingdom Insight Global Full time

    Site Reliability Engineer OpportunityInsight Global is seeking a skilled Site Reliability Engineer to join their team in West London. As a key member of the streaming engineering group, you will be responsible for ensuring the smooth operation of linear streaming channels.The ideal candidate will have previous experience in Site Reliability Engineering, with...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesWe are seeking a highly skilled Site Reliability Engineer to join our team at Fourier. As a member of our Site Reliability Engineering team, you will be responsible for developing tools to enhance and monitor our production systems.Required SkillsExcellent Python scripting skillsExperience with Version Control best practicesGood knowledge...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesWe are seeking a highly skilled Site Reliability Engineer to join our team at Fourier. As a member of our Site Reliability Engineering team, you will be responsible for developing tools to enhance and monitor our production systems.Required SkillsExcellent Python scripting skillsExperience with Version Control best practicesGood knowledge...


  • London, Greater London, United Kingdom Apollo Solutions Full time

    Job OverviewSite Reliability Engineering ManagerApollo Solutions is seeking a seasoned Site Reliability Engineering Manager to lead our team in ensuring the reliability and efficiency of our cloud-based services. As a key member of our Platform Engineering team, you will be responsible for driving the development and implementation of our cloud...


  • London, Greater London, United Kingdom Apple Full time

    About the RoleWe're seeking a seasoned Site Reliability Engineering Manager to lead our team of engineers responsible for the reliability and performance of our on-prem and cloud-based services. As a key member of our Apple Services Engineering team, you will be responsible for managing staging and production environments, promoting observability of systems,...


  • London, Greater London, United Kingdom MarkLogic Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at CMK Resources. As a Site Reliability Engineer, you will be responsible for ensuring the stability and performance of our infrastructure, with a focus on Portworx and Kubernetes environments.Key Responsibilities:Upgrade and manage Portworx...


  • London, Greater London, United Kingdom MarkLogic Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at CMK Resources. As a Site Reliability Engineer, you will be responsible for ensuring the stability and performance of our infrastructure, with a focus on Portworx and Kubernetes environments.Key Responsibilities:Upgrade and manage Portworx...


  • London, Greater London, United Kingdom MarkLogic Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at CMK Resources. As a Site Reliability Engineer, you will be responsible for ensuring the stability and performance of our infrastructure, with a focus on cloud platforms, virtualization tools, and automation tools.Key Responsibilities:Upgrade and...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesAs a Site Reliability Engineer at Fourier, you will be responsible for designing and implementing tools to enhance the reliability and resilience of our production systems. This includes investigating failures, improving system performance, and automating manual processes.Required SkillsExcellent Python scripting skillsExperience with...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesWe are seeking a skilled Site Reliability Engineer to join our team at Fourier. As a member of our Site Reliability Engineering team, you will be responsible for developing tools for surveillance and enhancement of our production systems.Key responsibilities include increasing system resilience, investigating failure, and improving...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    Job Title: Site Reliability EngineerAt J Bandy Consulting, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems.Key Responsibilities:Develop and maintain a culture of reliability and performance within...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    Job Title: Site Reliability EngineerAt J Bandy Consulting, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems.Key Responsibilities:Develop and maintain a culture of reliability and performance within...


  • London, Greater London, United Kingdom iO Associates Full time

    Job Opportunity: Site Reliability EngineeriO Associates is seeking a skilled Site Reliability Engineer to join our team for a short-term project within the Law Enforcement sector.Key Responsibilities:Monitor system performance and security to ensure optimal functionality.Collaborate with our team to identify and resolve technical issues.This role offers a...


  • London, Greater London, United Kingdom iO Associates Full time

    Job Opportunity: Site Reliability EngineeriO Associates is seeking a skilled Site Reliability Engineer to join our team for a short-term project within the Law Enforcement sector.Key Responsibilities:Monitor system performance and security to ensure optimal functionality.Collaborate with our team to identify and resolve technical issues.This role offers a...


  • London, Greater London, United Kingdom LinuxRecruit Full time

    Unlock Your Potential as a Site Reliability EngineerAre you a seasoned Site Reliability Engineer looking to take your skills to the next level? Do you thrive in fast-paced environments and have a keen eye for detail? We're seeking a talented individual to join our team and contribute to the development and maintenance of our cutting-edge platform.As a Site...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    At J Bandy Consulting, we are seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our Site Reliability Engineering team, you will be responsible for ensuring the reliability, scalability, and performance of our systems.The ideal candidate will have a strong background in SRE best practices, with expertise in Git and...


  • London, Greater London, United Kingdom Arrows Full time

    About the RoleArrows is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for ensuring the smooth operation of our linear streaming channels. This involves working closely with our streaming engineers to maintain and improve our existing system, as well as developing a new...


  • London, Greater London, United Kingdom Arrows Full time

    About the RoleArrows is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for ensuring the smooth operation of our linear streaming channels. This involves working closely with our streaming engineers to maintain and improve our existing system, as well as developing a new...


  • London, Greater London, United Kingdom ESL FACEIT Group Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at ESL FACEIT Group. As a key member of our infrastructure team, you will be responsible for designing, analyzing, and troubleshooting large-scale distributed systems.As a Site Reliability Engineer, you will work closely with our software engineering teams to deploy and...

Site Reliability Engineering Manager

2 months ago


London, Greater London, United Kingdom Apollo solutions Full time
Site Reliability Platform Engineering Manager

Apollo Solutions is seeking a highly skilled Site Reliability Platform Engineering Manager to lead our team in delivering exceptional service reliability and efficiency.

Key Responsibilities:

  • Lead a team of L1/L2 engineers to improve incident resolution, blameless post-mortems, and problem records.
  • Ensure service tickets and incidents are resolved within SLA and effectively passed on to product teams.
  • Drive cloud compliance framework controls, including annual DR and recovery testing, capacity management, and more.
  • Continuously improve service ticket and incident resolution rates.
  • Identify top reasons for service requests and incidents, and address root causes to reduce tickets.
  • Provide thought leadership in operational areas like change and release management, capacity management, and backup and recovery.
  • Ensure the team is correctly skilled for their roles and identify candidates for transition from Ops to SRE.

Requirements:

  • Solid understanding of SRE principles and experience working with Azure and GCP.
  • Experience with CI/CD and infrastructure as code tools like Terraform, GitHub, Azure DevOps, and more.
  • Experience leading an SRE or Operations team and negotiating skills to influence technical decisions.
  • Good understanding of public cloud security and experience leading teams in a large, complex industry.
  • Azure or GCP Certifications are desirable.
  • Experience handling risks and controls across technical platforms.

What We Offer:

  • Up to 15% pension contribution
  • 30% bonus
  • Hybrid working pattern
  • Private Healthcare
  • Access to Share Schemes

Apollo Solutions is a dynamic company shaping the future of Technology. If you're passionate about Platform Engineering/Site Reliability, please send your CV for a confidential discussion. Note: No Sponsorship is offered.