Site Reliability Engineering manager

2 weeks ago


London, United Kingdom Apple Inc. Full time

Site Reliability Engineering (SRE) Manager, iCloud
People at Apple don’t just build products — they craft experiences our customers love and depend on. Apple Services Engineering (ASE) builds and supports the systems that make many of these daily experiences possible. If you’ve used Apple products, you’ve likely interacted with us. iCloud Services SRE teams are responsible for the systems and services that directly support those customers and their experiences. We focus on availability and automation of key services that run iCloud every minute of every day all around the world.
Key Qualifications
Experience with large scale distributed systems, especially ML infrastructure and services including LLMs, Generative AI, and transformers
Demonstrable success leading engineering teams - ideally SRE or Production Engineering
Knowledge of core operating system principles, networking fundamentals, and systems management
Understanding of SRE principals, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts
Experience with hiring and leading engineers
Professional experience in an engineering leadership position
Description We're looking for a hardworking and passionate person to join this amazing team. You will be an accomplished builder and leader of teams looking to tackle your next challenge. You know SRE and you know what it will take to run services at Apple scale with a high degree of operational perfection. This role will position you to help shape the future of how we build and run our services on a global scale. You will have the technical skills to go deep and retain the ability to focus on higher-level business and product goals. We hire high quality leaders and engineers with a diverse set of experiences and skill sets for positions on Apple. Our customers count on us to provide extraordinary availability, scalability, and security for services. If you’d like to positively influence millions of customers’ experience of Apple this is the job for you.As a Site Reliability Engineering Manager, responsibilities include:Lead SRE teams responsible for reliability and performance of on-prem and cloud-based servicesLeading and growing the engineers on your teamManage staging and production environments with goal of maximizing availabilityPromote observability of systems for monitoring, alerting, and metrics reportingAdvocate best practices of reliability engineering
Education & Experience
Bachelors or Masters degree in computer science or equivalent field.
#J-18808-Ljbffr



  • London, United Kingdom TEKsystems Full time

    Site Reliability Engineer / SRE Description: My global client is looking for a Site Reliability Engineer / SRE to join their growing team who must have strong experience working within the financial services industry on large complex projects. To be successful in this Site Reliability / SRE project you will need expert experience within: AWS ...

  • Engineering Manager

    2 weeks ago


    London, United Kingdom Nominet Full time

    Engineering Manager Site Reliability Engineering Contract Type: Permanent Location: Hybrid (minimum 20% on-site in our London Shoreditch office) Were proud to be an Equal Opportunity and Affirmative Action Employer, and were committed to building an inclusive, diverse community that celebrates and welcomes everyone. If there are any adjustments...


  • London, United Kingdom N Consulting Ltd Full time

    Job title: Site Reliability EngineerWork Mode: 3 days office MandatoryLocation: 5 Broadgate, London EC2M 2QS, United KingdomContract Duration: 12 monthsWe’re looking for a Site Reliability Engineer to:· determine the reliability of our digital products, technology services, and the infrastructure that underpins them· minimize the risk and impact of...


  • London, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer Check all associated application documentation thoroughly before clicking on the apply button at the bottom of this description.I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability,...


  • London, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and...


  • London, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...


  • London, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer Check all associated application documentation thoroughly before clicking on the apply button at the bottom of this description.I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability,...


  • London, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and...


  • London, United Kingdom Understanding Recruitment Full time

    Job Description Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and...


  • London, United Kingdom Understanding Recruitment Full time

    Job DescriptionSite Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance,...


  • London, United Kingdom Experian Full time

    Job Description Work that matters – what you’ll be doing We’re looking for a Site Reliability Engineer to join our Experian Data Quality team where you will be working on cutting edge products within our Aperture suite (Data Studio and Data Governance). This role has aspects of both reliability engineering (SRE) and test engineering (SDET)....


  • London, United Kingdom McGregor Boyall Full time

    **Permanent role** **£70k - £120k per annum (+ package)** **SPONSORSHIP - AVAILABLE** **Location - Central London (hybrid working model)** **The Company** A Fortune 500 company based in Central London. **The Role** As a**Site Reliability Engineer**you will collaborate with product development teams. You will be instrumental providing engineering...


  • London Area, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...


  • London Area, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...


  • London Area, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and...


  • London Area, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...


  • London, United Kingdom eFinancialCareers Full time

    Join us as a Senior Site Reliability Engineer - We'll look to you to establish and run a SRE function to help design, build, deliver and run highly reliable, scalable and secure software systems - This is a great opportunity to hone your existing engineering skills and advance your career in this critical role **What you'll do** As a Senior Site Reliability...


  • London, United Kingdom Alexander Ash Consulting Full time

    DevOps/Site Reliability Engineer - Global Quantitative Investment Management Permanent/Contract - Global Offices - Competitive   We are seeking a highly skilled and motivated Site Reliability Engineers (SRE) and DevOps Engineers to join a leading quantitative research and technology firm specializing in leveraging innovative data science and cutting-edge...


  • London, United Kingdom Neo4j Inc Full time

    The Role: The Site Reliability Engineering team’s mission is to improve the overall reliability of Neo4j’s DBaaS product: Neo4j Aura. Our product operates at scale and spans all 3 major cloud providers, with hundreds of Kubernetes clusters running in production. Until recently, the SRE function at Neo4j Aura achieved this by filling the shoes of a...


  • London, United Kingdom Cameron Connect Ltd Full time

    Join Our Clients Dynamic Mortgages Team at the Heart of Technological Innovation! Are you an experienced Java or C# engineer with a passion for building and maintaining reliable, high-performing systems? Do you thrive in roles where you can make a significant impact on the availability, performance, and efficiency of critical services? These opportunities...