Site Reliability Engineering Manager

7 days ago


London, Greater London, United Kingdom Google Full time
About the Role

As a Site Reliability Engineering Manager at Google, you will be responsible for leading a team of Software/Systems Engineers on projects that impact users globally. Your primary focus will be on ensuring the uptime and availability of key services, while also building automation to prevent problem recurrence.

You will be directly responsible for the performance and scalability of Google's services, and will design, write, and deliver software to improve their efficiency. This role requires strong technical leadership skills, as well as the ability to mentor and develop teams to achieve their full potential.

Key Responsibilities
  • Lead a team of Software/Systems Engineers on projects that impact users globally.
  • Own the availability and performance of key services, and build automation to prevent problem recurrence.
  • Design, write, and deliver software to improve the availability, scalability, latency, and efficiency of Google's services.
  • Manage on-call rotations across continents, using a follow-the-sun model.
  • Lead by example, mentor the team, and establish credibility through quality technical execution.
About the Team

The Site Reliability Engineering team at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. We ensure that Google's services have reliability, uptime appropriate to users' needs, and a fast rate of improvement.

We are a diverse team of individuals with a wide range of backgrounds, experiences, and perspectives. We encourage collaboration, intellectual curiosity, problem-solving, and openness, and strive to create an environment that provides the support and mentorship needed to learn and grow.

What We're Looking For
  • 8 years of experience with data structures or algorithms.
  • 5 years of experience with software development in one or more programming languages.
  • 3 years of experience with people management, designing, analyzing, and troubleshooting distributed systems.
  • Experience working in computing, distributed systems, storage, or networking.
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Ability to debug, optimize code, and to automate routine tasks.
  • Excellent problem-solving approach, coupled with effective verbal and written communication skills.


  • London, Greater London, United Kingdom Opus Recruitment Solutions Full time

    Site Reliability Engineer | Remote | Competitive SalaryCloud Computing | DevOps | Google Cloud Platform | Amazon Web Services | Kubernetes | Infrastructure | SRE | ELK StackWe are collaborating with a dynamic online retail company seeking to enhance their technical team by adding a Site Reliability Engineer. This role focuses on managing the reliability and...


  • London, Greater London, United Kingdom Apple Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will play a critical role in supporting and scaling cloud services for thousands of development and operations engineers.Key ResponsibilitiesCloud Service Maintenance: Automate deployment and orchestration of...


  • London, Greater London, United Kingdom Apple Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will play a critical role in supporting and scaling cloud services for thousands of development and operations engineers.Key ResponsibilitiesAutomate Deployment and Orchestration: Automate the deployment and...


  • London, Greater London, United Kingdom Apple Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will play a critical role in supporting and scaling cloud services for thousands of development and operations engineers.Key ResponsibilitiesAutomate Deployment and Orchestration: Automate the deployment and...


  • London, Greater London, United Kingdom Mondrian Alpha Full time

    About Mondrian AlphaMondrian Alpha is a renowned hedge fund with a global presence, seeking a seasoned Site Reliability Engineer to join their London team.Job SummaryWe are looking for a highly skilled Site Reliability Engineer to play a pivotal role in maintaining the technology infrastructure that drives our operations, directly contributing to our...


  • London, Greater London, United Kingdom Mondrian Alpha Full time

    About Mondrian AlphaMondrian Alpha is a renowned hedge fund with a global presence, seeking a seasoned Site Reliability Engineer to join their London team.Job SummaryWe are looking for a highly skilled Site Reliability Engineer to play a pivotal role in maintaining the technology infrastructure that drives our operations, directly contributing to our...


  • London, Greater London, United Kingdom Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our services.Key ResponsibilitiesDesign, implement, and maintain large-scale distributed systems and...


  • London, Greater London, United Kingdom Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our services.Key ResponsibilitiesDesign, implement, and maintain large-scale distributed systems and...


  • London, Greater London, United Kingdom Experian Full time

    Job Opportunity for a Skilled Site Reliability EngineerWe are seeking a highly skilled and driven Site Reliability Engineer to join our dedicated team at Experian Data Quality in London, with a flexible working arrangement.As a key member reporting to the QA Director, you will be responsible for ensuring the dependability, efficiency, and scalability of our...


  • London, Greater London, United Kingdom Trust In SODA Full time

    Job OverviewPosition: Site Reliability Engineering ManagerIndustry: InsurTechLocation: RemoteSalary: £75,000 - £85,000Benefits: Bonus, Equity Options, Comprehensive Health Coverage, Learning & Development Fund, 25 Days Annual Leave, Flexibility for International WorkAre you eager to join a fast-growing InsurTech firm that is transforming the Premium...


  • London, Greater London, United Kingdom Citi Full time

    Job Summary:Citi is seeking a highly skilled Site Reliability Engineering Manager to lead our SRE team in delivering high-quality software solutions that meet the needs of our customers. As a key member of our engineering organization, you will be responsible for improving the productivity of engineers, creating effective reporting mechanisms, and ensuring...


  • London, Greater London, United Kingdom Citi Full time

    Job Summary:Citi is seeking a highly skilled Site Reliability Engineering Manager to lead our SRE team in delivering high-quality software solutions that meet the needs of our customers. As a key member of our engineering organization, you will be responsible for improving the productivity of engineers, creating effective reporting mechanisms, and ensuring...


  • London, Greater London, United Kingdom Canonical Full time

    Site Reliability Engineering ManagerWe are seeking an experienced professional to oversee a dedicated team within a prominent software organization, where you will drive excellence in a vibrant and rapidly evolving atmosphere.Key Responsibilities:Lead and mentor a group of operations engineersEngage with internal stakeholders and clientsEstablish and enforce...


  • London, Greater London, United Kingdom Canonical Full time

    Site Reliability Engineering ManagerWe are seeking an experienced leader to oversee a talented team in a vibrant and challenging environment within a prominent software organization.Key Responsibilities:Lead and mentor a group of operations engineersWork collaboratively with various internal teams and clientsEstablish and refine engineering and operational...


  • London, Greater London, United Kingdom WeAreTechWomen Full time

    About the RoleWe are seeking a skilled Site Reliability Engineering Specialist to join our team at WeAreTechWomen. As a Site Reliability Engineer, you will play a critical role in ensuring the resilience and reliability of our firm's most critical platform services.Key ResponsibilitiesCollaborate with our businesses to build and run resilient and reliable...


  • London, Greater London, United Kingdom WeAreTechWomen Full time

    About the RoleWe are seeking a skilled Site Reliability Engineering Specialist to join our team at WeAreTechWomen. As a Site Reliability Engineer, you will be responsible for ensuring the resilience and reliability of our firm's critical platform services.Key ResponsibilitiesCollaborate with our businesses to build and run resilient and reliable production...


  • London, Greater London, United Kingdom WeAreTechWomen Full time

    About the RoleWe are seeking a skilled Site Reliability Engineering Specialist to join our team at WeAreTechWomen. As a Site Reliability Engineer, you will play a critical role in ensuring the resilience and reliability of our firm's most critical platform services.Key ResponsibilitiesCollaborate with our businesses to build and run resilient and reliable...


  • London, Greater London, United Kingdom WeAreTechWomen Full time

    About the RoleWe are seeking a skilled Site Reliability Engineering Specialist to join our team at WeAreTechWomen. As a Site Reliability Engineer, you will be responsible for ensuring the resilience and reliability of our firm's critical platform services.Key ResponsibilitiesCollaborate with our businesses to build and run resilient and reliable production...


  • London, Greater London, United Kingdom Canonical Full time

    Site Reliability Engineering ManagerWe are seeking an experienced leader to oversee a dedicated team within a prominent software organization, operating in a vibrant and rapidly evolving landscape.Key Responsibilities:Direct and mentor a group of operations specialistsEngage with internal stakeholders and clientsEstablish and refine engineering and...


  • London, Greater London, United Kingdom FactSet Research Systems Full time

    Job DescriptionWe are seeking a highly skilled and motivated Site Reliability Engineer to join our growing team at FactSet Research Systems. As an SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our software systems and infrastructure.Key Responsibilities:Collaborate with cross-functional teams to define,...