Highly Skilled Site Reliability Engineering Leader

1 week ago


London, Greater London, United Kingdom Apple Inc. Full time

At Apple Inc., we're looking for a seasoned Site Reliability Engineering (SRE) manager to join our iCloud Services team.

About the Role

We're seeking an accomplished builder and leader of teams with a passion for SRE and a track record of delivering operational perfection at scale. As a key member of our SRE leadership team, you will shape the future of how we build and run our services on a global scale.

Responsibilities

  • Lead high-performing SRE teams responsible for ensuring the reliability and performance of our on-prem and cloud-based services.
  • Grow and develop engineers on your team, fostering a culture of innovation and excellence.
  • Develop and implement strategies to maximize availability in staging and production environments.
  • Promote observability and monitoring best practices across our systems.
  • Advocate for industry-leading reliability engineering practices.

Requirements

  • Proven experience leading large-scale distributed systems, including ML infrastructure and services like LLMs, Generative AI, and transformers.
  • Demonstrated success as a technical leader, ideally in SRE or Production Engineering.
  • Strong knowledge of core operating system principles, networking fundamentals, and systems management.
  • Familiarity with SRE principles, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts.

Preferred Qualifications

  • Experience in hiring and developing engineers.
  • Professional experience in an engineering leadership position.

Compensation and Benefits

We offer a competitive salary of approximately $175,000 per year, plus a comprehensive benefits package, including health insurance, retirement savings plan, and paid time off. Located in London, England, United Kingdom, this role offers the opportunity to work on cutting-edge technology and collaborate with a talented team of professionals.



  • London, Greater London, United Kingdom Oxford Knight Full time

    Site Reliability Engineer OpportunityOxford Knight is seeking a highly skilled Site Reliability Engineer to join our team and contribute to the development of innovative trading solutions. As a Site Reliability Engineer, you will play a crucial role in ensuring the smooth operation of our applications, providing early support for apps while being developed,...


  • London, Greater London, United Kingdom Google Full time

    About the RoleAt Google, we're looking for a talented Cloud Engineer and Site Reliability Leader to join our team. As a key member of our SRE organization, you'll be responsible for designing, building, and operating large-scale distributed systems that meet the high standards of reliability, scalability, and performance.We're seeking someone with 8+ years...


  • London, Greater London, United Kingdom Citigroup, Inc. Full time

    Citigroup, Inc. Chief Reliability Engineering LeaderAbout the Job:We are seeking a highly skilled Chief Reliability Engineering Leader to join our team at Citigroup, Inc. This is a full-time position based on a competitive salary of $200,000 per year.Job Description:The successful candidate will play a crucial role in driving operational excellence,...


  • London, Greater London, United Kingdom Rewardgateway Full time

    Engineering, LondonEarn a salary of £110,000 - £130,000 per year with Reward Gateway.We are seeking an experienced Site Reliability Engineer to lead our team and drive the transformation of our operational workloads to a Service Reliability Engineering (SRE) approach. The successful candidate will be responsible for establishing and managing our new SRE...


  • London, Greater London, United Kingdom Apple Inc. Full time

    Unlock the Future of Cloud ServicesAt Apple Inc., we're not just building products - we're crafting experiences that our customers love and depend on. Our Apple Services Engineering (ASE) team is responsible for the systems that make these daily experiences possible. If you've used Apple products, you've likely interacted with us. Our iCloud Services SRE...


  • London, Greater London, United Kingdom Randstad Staffing Full time

    Job Description:A highly skilled System Reliability Engineer with expertise in Java is required to join our team. This exciting role will see you play a critical part in ensuring the reliability, availability, and performance of applications or systems built using Java technologies.Key Responsibilities:Application Performance Monitoring & Optimization: Use...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesAs a Site Reliability Engineer at Fourier, you will be responsible for designing and implementing tools to enhance the reliability and resilience of our production systems. This includes investigating failures, improving system performance, and automating manual processes.Required SkillsExcellent Python scripting skillsExperience with...


  • London, Greater London, United Kingdom Techruiter Full time

    We are Techruiter, a pioneering technology company specializing in cutting-edge Language Models (LLM) and Machine Learning solutions. Our team is seeking a highly skilled Site Reliability Engineer to ensure the reliability, scalability, and performance of our LLM and Machine Learning infrastructure.About This RoleIn this key position, you will play a...


  • London, Greater London, United Kingdom Apple Inc. Full time

    Site Reliability Engineering Manager, AppleAt Apple, we're not just building products - we're crafting experiences our customers love and depend on. Our Apple Services Engineering (ASE) team builds and supports the systems that make many of these daily experiences possible. If you've used Apple products, you've likely interacted with us. Our iCloud Services...


  • London, Greater London, United Kingdom Mondrian Alpha Recruitment Solutions Full time

    At Mondrian Alpha Recruitment Solutions, we are seeking a highly skilled Site Reliability Engineer to join our team responsible for engineering and supporting the company's critical infrastructure platforms.This team handles the centralized development infrastructure and works alongside engineering teams across the business to ensure the optimal route of...


  • London, Greater London, United Kingdom Remotestar Full time

    Remotestar is seeking a Senior Site Reliability Engineering Manager to join our client's team in the UK. The client is building a B2B marketplace for diamonds, and we need someone to ensure the reliability, scalability, and performance of our infrastructure and services.The ideal candidate will have a strong track record of building and maintaining highly...


  • London, Greater London, United Kingdom Trade Nation Full time

    Site Reliability Engineer Job DescriptionAt Trade Nation, we're seeking a highly skilled Site Reliability Engineer to join our dynamic team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining scalable and reliable systems that ensure high availability and performance.Key ResponsibilitiesDesign and Implement...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    Job SummaryJ Bandy Consulting is seeking an experienced Site Reliability Engineer to join our team. The ideal candidate will have a strong background in software engineering and a passion for building scalable and reliable systems.Key ResponsibilitiesDevelop and implement automation tools to improve the efficiency of our systemsCollaborate with...


  • London, Greater London, United Kingdom College of Charleston Full time

    Transformative SRE Leadership OpportunityAre you a seasoned leader with a passion for strategy, leadership, and engineering excellence? Do you want to make a meaningful impact at a global financial institution? We're seeking a talented Site Reliability Engineering Manager to join our Operations and Technology Chief Information Office Business area.About the...


  • London, Greater London, United Kingdom Fourier Full time

    At Fourier, we are looking for a highly skilled DevOps engineer to join our Site Reliability Engineering team.The successful candidate will be responsible for developing tools and solutions that enhance the reliability and resilience of our production systems.We offer a competitive salary range of $120,000 - $180,000 per annum, depending on experience, to...


  • London, Greater London, United Kingdom Selby Jennings Full time

    About Selby JenningsWe're a leading global financial services firm where technologists and investment professionals collaborate to drive innovation and operational excellence.About the RoleAs a Site Reliability Engineer, you'll apply your expertise in software and systems engineering to design, build, and maintain our robust infrastructure. You'll reduce...


  • London, Greater London, United Kingdom BenevolentAI Full time

    About the Role:BenevolentAI is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will be responsible for designing and implementing software solutions for cloud infrastructure, improving long-term infrastructure availability and reliability, and monitoring and handling incident response of...


  • London, Greater London, United Kingdom GoCardless Full time

    The RoleGoCardless is looking for a Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining the infrastructure and systems that support our payment and open banking products.Key ResponsibilitiesDesign and implement scalable and efficient infrastructure solutionsDevelop...


  • London, Greater London, United Kingdom Hamilton Barnes Associates Limited Full time

    Job Title: Site Reliability EngineerHiring Company: Hamilton Barnes Associates LimitedWe are seeking a highly skilled and experienced Site Reliability Engineer to join our team on a 6-month contract basis. The selected candidate will be working with one of the largest technology companies globally, ensuring seamless database environment operations and...


  • London, Greater London, United Kingdom Remotestar Full time

    Remotestar is seeking a Senior Site Reliability Engineering Manager to join our client's team in the UK. The client is a leading B2B marketplace for diamonds, and we're looking for a seasoned expert to lead our infrastructure and services team.The ideal candidate will have a strong track record of building and maintaining highly reliable infrastructure and...