Site Reliability Engineering Manager

4 weeks ago


London, Greater London, United Kingdom Apollo Solutions Full time

{"h1": "Site Reliability Engineering Manager", "p": "At Apollo Solutions, we are seeking a highly skilled Site Reliability Engineering Manager to lead our team in ensuring services are operational while supporting program timelines and business outcomes.

Responsibilities:

* Lead the L1/L2 team to improve the cycle time and efficiency of incident & service request resolution, including blameless post-mortems and problem records.
* Ensure service tickets and incidents are resolved within SLA and effectively passed on to product teams for L3/L4 support.
* Drive cloud compliance framework controls such as Annual DR and recovery testing, capacity management, etc.
* Improve the percentage of service tickets and incidents resolved by the team without escalation.
* Identify top reasons for service requests and incidents and address root causes to reduce ticket volume quarterly.
* Provide thought leadership in operational areas such as change and release management, capacity management, backup, and recovery.
* Ensure team members are appropriately skilled and identify candidates for transition from Ops roles to SRE.

Requirements:

* Solid understanding of the SRE role and principles.
* Experience with Azure and GCP, Kubernetes, container registries, and networking.
* Experience with CI/CD and infrastructure as code tools such as Terraform, GitHub, Azure DevOps, Jenkins, Chef.
* Experience leading an SRE or Operations team.
* Negotiation skills to influence technical and leadership decisions.
* Good understanding of public cloud security.
* Experience in a large, complex, highly regulated industry.
* Previous experience leading a team responsible for the public cloud estate.
* Azure or GCP Certifications are desirable.
* Experience handling risks and controls across technical platforms.
* Desire to learn and cross-skill.

What We Offer:

* Up to 15% pension contribution.
* 30% bonus.
* Hybrid working pattern.
* Private Healthcare.
* Access to Share Schemes.

If you are passionate about Platform Engineering/Site Reliability and want to be part of a dynamic team shaping the future of Technology, please send your CV for a confidential discussion.

Note: No Sponsorship is offered.

#J-18808-Ljbffr"}



  • London, Greater London, United Kingdom Insight Global Full time

    Site Reliability Engineer OpportunityInsight Global is seeking a skilled Site Reliability Engineer to join their team in West London. As a key member of the streaming engineering group, you will be responsible for ensuring the smooth operation of linear streaming channels.The ideal candidate will have previous experience in Site Reliability Engineering, with...


  • London, Greater London, United Kingdom Insight Global Full time

    Site Reliability Engineer OpportunityInsight Global is seeking a skilled Site Reliability Engineer to join their team in West London. As a key member of the streaming engineering group, you will be responsible for ensuring the smooth operation of linear streaming channels.The ideal candidate will have previous experience in Site Reliability Engineering, with...


  • London, Greater London, United Kingdom Lorien Full time

    Site Reliability Engineer / DevOps EngineerWe are seeking a skilled Site Reliability Engineer / DevOps Engineer to join our team at Lorien, a leading consultancy. The ideal candidate will have experience with Salesforce automation and a strong background in SRE / Site Reliability Engineering.Key Responsibilities:Design and implement scalable and efficient...


  • London, Greater London, United Kingdom Lorien Full time

    Site Reliability Engineer / DevOps EngineerWe are seeking a skilled Site Reliability Engineer / DevOps Engineer to join our team at Lorien, a leading consultancy. The ideal candidate will have experience with Salesforce automation and a strong background in SRE / Site Reliability Engineering.Key Responsibilities:Design and implement scalable and efficient...


  • London, Greater London, United Kingdom Apple Inc. Full time

    Site Reliability Engineering ManagerAre you passionate about building scalable and reliable systems? As a Site Reliability Engineering Manager at Apple Inc., you will lead a team of engineers responsible for designing, developing, and deploying high-performance systems that handle billions of queries every day.About the RoleWe are seeking a seasoned leader...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesWe are seeking a highly skilled Site Reliability Engineer to join our team at Fourier. As a member of our Site Reliability Engineering team, you will be responsible for developing tools to enhance and monitor our production systems.Required SkillsExcellent Python scripting skillsExperience with Version Control best practicesGood knowledge...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesWe are seeking a highly skilled Site Reliability Engineer to join our team at Fourier. As a member of our Site Reliability Engineering team, you will be responsible for developing tools to enhance and monitor our production systems.Required SkillsExcellent Python scripting skillsExperience with Version Control best practicesGood knowledge...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesWe are seeking a highly skilled Site Reliability Engineer to join our team at Fourier. As a member of our Site Reliability Engineering team, you will be responsible for developing tools to enhance and monitor our production systems.Required SkillsExcellent Python scripting skillsExperience with Version Control best practicesGood knowledge...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesWe are seeking a highly skilled Site Reliability Engineer to join our team at Fourier. As a member of our Site Reliability Engineering team, you will be responsible for developing tools to enhance and monitor our production systems.Required SkillsExcellent Python scripting skillsExperience with Version Control best practicesGood knowledge...


  • London, Greater London, United Kingdom Apple Inc. Full time

    Site Reliability Engineering Manager, Apple Services EngineeringAre you passionate about building scalable and reliable systems? As a Site Reliability Engineering Manager at Apple, you will lead a team of engineers in designing, developing, and deploying high-performance systems that handle billions of queries every day.About the RoleWe are looking for a...


  • London, Greater London, United Kingdom Apple Inc. Full time

    Site Reliability Engineering Manager, Apple Services EngineeringAre you passionate about building scalable and reliable systems? As a Site Reliability Engineering Manager at Apple, you will lead a team of engineers in designing, developing, and deploying high-performance systems that handle billions of queries every day.About the RoleWe are looking for a...


  • London, Greater London, United Kingdom Apollo Solutions Full time

    Job OverviewSite Reliability Engineering ManagerApollo Solutions is seeking a seasoned Site Reliability Engineering Manager to lead our team in ensuring the reliability and efficiency of our cloud-based services. As a key member of our Platform Engineering team, you will be responsible for driving the development and implementation of our cloud...


  • London, Greater London, United Kingdom Apple Full time

    About the RoleWe're seeking a seasoned Site Reliability Engineering Manager to lead our team of engineers responsible for the reliability and performance of our on-prem and cloud-based services. As a key member of our Apple Services Engineering team, you will be responsible for managing staging and production environments, promoting observability of systems,...


  • London, Greater London, United Kingdom LinuxRecruit Full time

    Unlock Your Potential as a Site Reliability EngineerWe are seeking a seasoned Site Reliability Engineer to join our team and contribute to the development and maintenance of our cutting-edge platform.As a Site Reliability Engineer, you will be responsible for designing, developing, and maintaining systems and applications using Golang. You will work closely...


  • London, Greater London, United Kingdom Alexander Ash Consulting Full time

    Unlock System Reliability and EfficiencyWe are seeking a skilled professional with expertise in Salesforce automation, Copado, Site Reliability Engineering, and Release Management. Key technical proficiency includes automation scripting, Azure DevOps, and experience with DevSecOps practices such as SAST/DAST.Key Responsibilities:Collaborate with Engineering...


  • London, Greater London, United Kingdom Alexander Ash Consulting Full time

    Unlock System Reliability and EfficiencyWe are seeking a skilled professional with expertise in Salesforce automation, Copado, Site Reliability Engineering, and Release Management. Key technical proficiency includes automation scripting, Azure DevOps, and experience with DevSecOps practices such as SAST/DAST.Key Responsibilities:Collaborate with Engineering...


  • London, Greater London, United Kingdom FactSet Full time

    Site Reliability EngineerAt FactSet, we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our infrastructure and applications.ResponsibilitiesCollaborate with cross-functional teams to design, implement, and maintain...


  • London, Greater London, United Kingdom MarkLogic Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at CMK Resources. As a Site Reliability Engineer, you will be responsible for ensuring the stability and performance of our infrastructure, with a focus on Portworx and Kubernetes environments.Key Responsibilities:Upgrade and manage Portworx...


  • London, Greater London, United Kingdom MarkLogic Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at CMK Resources. As a Site Reliability Engineer, you will be responsible for ensuring the stability and performance of our infrastructure, with a focus on Portworx and Kubernetes environments.Key Responsibilities:Upgrade and manage Portworx...


  • London, Greater London, United Kingdom MarkLogic Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at CMK Resources. As a Site Reliability Engineer, you will be responsible for ensuring the stability and performance of our infrastructure, with a focus on cloud platforms, virtualization tools, and automation tools.Key Responsibilities:Upgrade and...