Site Reliability Engineering Manager

7 days ago


London, Greater London, United Kingdom Google Full time
About the Role

As a Site Reliability Engineering Manager at Google, you will be responsible for leading a team of Software/Systems Engineers on projects that impact users globally. Your primary focus will be on ensuring the uptime and availability of key services, while also building automation to prevent problem recurrence.

You will be directly responsible for the performance and scalability of Google's services, and will design, write, and deliver software to improve their efficiency. This role requires strong technical leadership skills, as well as the ability to mentor and develop teams to achieve their full potential.

Key Responsibilities
  • Lead a team of Software/Systems Engineers on projects that impact users globally.
  • Own the availability and performance of key services, and build automation to prevent problem recurrence.
  • Design, write, and deliver software to improve the availability, scalability, latency, and efficiency of Google's services.
  • Manage on-call rotations across continents, using a follow-the-sun model.
  • Lead by example, mentor the team, and establish credibility through quality technical execution.
About the Team

The Site Reliability Engineering team at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. We ensure that Google's services have reliability, uptime appropriate to users' needs, and a fast rate of improvement.

We are a diverse team of individuals with a wide range of backgrounds, experiences, and perspectives. We encourage collaboration, intellectual curiosity, problem-solving, and openness, and strive to create an environment that provides the support and mentorship needed to learn and grow.

What We're Looking For
  • 8 years of experience with data structures or algorithms.
  • 5 years of experience with software development in one or more programming languages.
  • 3 years of experience with people management, designing, analyzing, and troubleshooting distributed systems.
  • Experience working in computing, distributed systems, storage, or networking.
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Ability to debug, optimize code, and to automate routine tasks.
  • Excellent problem-solving approach, coupled with effective verbal and written communication skills.


  • London, Greater London, United Kingdom Google Inc. Full time

    {"h1": "Site Reliability Engineering Manager", "p": "At Google, we're building a team of talented Site Reliability Engineers to help us scale our services and ensure they're always available to our users. As a Site Reliability Engineering Manager, you'll be responsible for leading a team of engineers to design, build, and operate large-scale distributed...


  • London, Greater London, United Kingdom Google Inc. Full time

    {"h1": "Site Reliability Engineering Manager", "p": "At Google, we're building a team of talented Site Reliability Engineers to help us scale our services and ensure they're always available to our users. As a Site Reliability Engineering Manager, you'll be responsible for leading a team of engineers to design, build, and operate large-scale distributed...


  • London, Greater London, United Kingdom Opus Recruitment Solutions Full time

    Site Reliability Engineer | Remote | Competitive SalaryCloud Computing | DevOps | Google Cloud Platform | Amazon Web Services | Kubernetes | Infrastructure | SRE | ELK StackWe are collaborating with a dynamic online retail company seeking to enhance their technical team by adding a Site Reliability Engineer. This role focuses on managing the reliability and...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    {"h1": "Site Reliability Engineer", "p": "At J Bandy Consulting, we're seeking a skilled Site Reliability Engineer to join our team of experienced engineers. As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability and performance of our cloud-agnostic, micro-service network management platform.Your primary responsibilities...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    {"h1": "Site Reliability Engineer", "p": "At J Bandy Consulting, we're seeking a skilled Site Reliability Engineer to join our team of experienced engineers. As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability and performance of our cloud-agnostic, micro-service network management platform.Your primary responsibilities...


  • London, Greater London, United Kingdom Apple Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will play a critical role in supporting and scaling cloud services for thousands of development and operations engineers.Key ResponsibilitiesCloud Service Maintenance: Automate deployment and orchestration of...


  • London, Greater London, United Kingdom Apple Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will play a critical role in supporting and scaling cloud services for thousands of development and operations engineers.Key ResponsibilitiesAutomate Deployment and Orchestration: Automate the deployment and...


  • London, Greater London, United Kingdom Apple Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will play a critical role in supporting and scaling cloud services for thousands of development and operations engineers.Key ResponsibilitiesAutomate Deployment and Orchestration: Automate the deployment and...


  • London, Greater London, United Kingdom FactSet Full time

    Site Reliability EngineerAt FactSet, we're seeking a skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you'll play a critical role in ensuring the reliability and performance of our systems.ResponsibilitiesCollaborate with cross-functional teams to design, implement, and maintain highly available and scalable...


  • London, Greater London, United Kingdom FactSet Full time

    Site Reliability EngineerAt FactSet, we're seeking a skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you'll play a critical role in ensuring the reliability and performance of our systems.ResponsibilitiesCollaborate with cross-functional teams to design, implement, and maintain highly available and scalable...


  • London, Greater London, United Kingdom Mondrian Alpha Full time

    About Mondrian AlphaMondrian Alpha is a renowned hedge fund with a global presence, seeking a seasoned Site Reliability Engineer to join their London team.Job SummaryWe are looking for a highly skilled Site Reliability Engineer to play a pivotal role in maintaining the technology infrastructure that drives our operations. As part of this team, you will be...


  • London, Greater London, United Kingdom Mondrian Alpha Full time

    About Mondrian AlphaMondrian Alpha is a renowned hedge fund with a global presence, seeking a seasoned Site Reliability Engineer to join their London team.Job SummaryWe are looking for a highly skilled Site Reliability Engineer to play a pivotal role in maintaining the technology infrastructure that drives our operations. As part of this team, you will be...


  • London, Greater London, United Kingdom Mondrian Alpha Full time

    About Mondrian AlphaMondrian Alpha is a renowned hedge fund with a global presence, seeking a seasoned Site Reliability Engineer to join their London team.Job SummaryWe are looking for a highly skilled Site Reliability Engineer to play a pivotal role in maintaining the technology infrastructure that drives our operations, directly contributing to our...


  • London, Greater London, United Kingdom Mondrian Alpha Full time

    About Mondrian AlphaMondrian Alpha is a renowned hedge fund with a global presence, seeking a seasoned Site Reliability Engineer to join their London team.Job SummaryWe are looking for a highly skilled Site Reliability Engineer to play a pivotal role in maintaining the technology infrastructure that drives our operations, directly contributing to our...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesWe are seeking a skilled Site Reliability Engineer to join our team at Fourier. As a member of our Site Reliability Engineering team, you will be responsible for developing tools to enhance and monitor our production systems.Key SkillsExcellent Python scripting skillsExperience with Version Control best practicesGood knowledge of...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesWe are seeking a skilled Site Reliability Engineer to join our team at Fourier. As a member of our Site Reliability Engineering team, you will be responsible for developing tools to enhance and monitor our production systems.Key SkillsExcellent Python scripting skillsExperience with Version Control best practicesGood knowledge of...


  • London, Greater London, United Kingdom Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our services.Key ResponsibilitiesDesign, implement, and maintain large-scale distributed systems and...


  • London, Greater London, United Kingdom Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our services.Key ResponsibilitiesDesign, implement, and maintain large-scale distributed systems and...


  • London, Greater London, United Kingdom Experian Full time

    Job Opportunity for a Skilled Site Reliability EngineerWe are seeking a highly skilled and driven Site Reliability Engineer to join our dedicated team at Experian Data Quality in London, with a flexible working arrangement.As a key member reporting to the QA Director, you will be responsible for ensuring the dependability, efficiency, and scalability of our...


  • London, Greater London, United Kingdom Apollo Solutions Full time

    Job Title: Site Reliability Engineering ManagerCompany: Apollo SolutionsLocation: Hybrid - 2 days per week onsiteSalary: Up to £120kBenefits: Excellent Benefits + 30% Bonus + Stock OptionsJob Summary:Apollo Solutions is seeking a highly skilled Site Reliability Engineering Manager to lead our team in ensuring the reliability and efficiency of our...