Site Reliability Engineer

3 weeks ago


London, United Kingdom Apple Inc. Full time
Site Reliability Engineer (SRE) - Object Storage

London, England, United Kingdom

Software and Services

People at Apple don’t just build products — they craft the kind of experience that have revolutionized entire industries. The diverse collection of our people and their ideas inspire innovation in everything we do. Imagine what you could do here Join Apple, and help us leave the world better than we found it.

Apple Cloud infrastructure is BIG. The storage SRE teams of Apple Cloud are building and running the next generation distributed storage systems to support Apple’s most critical services. Operating at our scale, across multiple geographically dispersed data centers, and servicing users with vast data needs presents unique challenges. As a Storage SRE at Apple, you'll need to solve these problems using your deep understanding of storage, data analysis, programming, teamwork, and expertise in Linux system internals.

Description

We are looking for seasoned software and systems engineers to join the Object Storage SRE team at Apple. The role involves a tremendous amount of individual responsibility and influence over the direction of the platform, shaping its use by many critical Apple Cloud services for years to come. You are solution-oriented and have a passion for software delivered as a service to improve reuse, efficiency, and simplicity. Your work will affect hundreds of millions of users and be essential to the success of some of the most visible current and future Apple features.

The role involves understanding the team's priorities; taking ownership of projects or deliverables; designing solutions and building buy-in for those designs; and successful delivery of those designs in order to meet the project goal. The role involves giving technical feedback to colleagues to assist them in the delivery of their designs, features, and projects, as well as driving technical standards across the two-site team in collaboration with other senior members of the team. The team has an on-call rota including weekends and the successful candidate should expect to handle alerts and other escalations in order to maintain a high level of availability and functionality for our provided services.

The team is divided into two shards in the UK and US and cross-timezone meetings are a core feature of how our team collaborates, reaches agreements, and executes to deliver projects. At Apple Cloud, we run a mix of open source, vendor licensed, and internally developed tools to perform functions such as system configuration management, provisioning, software development & deployment, logging, and monitoring. You'll learn these tools and have opportunities to improve them. We think critically and strive to balance the best solution with the need to get things done for each engineering challenge we face. Good ideas are heard and results are rewarded. The candidate may be expected to travel to other Apple locations from time to time e.g. the USA.

Minimum Qualifications

Key Qualifications

  • Experience in building, operating, and scaling distributed storage systems in a private, public, or hybrid cloud environment.
  • The ability to design, author, understand, and release code in languages like Go (preferred), Java, Python, or Rust.
  • Good understanding of block, object, and file storage solutions in Linux (such as LVM, XFS, ext4, S3, Ceph, Gluster, NFS).
  • Understanding of Linux internals, standard networking protocols, and distributed systems.
  • Experience with provisioning, data migration, backup & recovery, at-scale testing, disaster recovery, and capacity planning.

Preferred Qualifications

Education & Experience

Bachelor's degree in Computer Science or related field, or equivalent employment.

Additional Requirements

  • Acute drive to automate manual operations and to improve them through repeated iteration.
  • Awareness of best practices for deployment of storage systems - implication of physical and virtual deployment models to change management, failure domains, hardware lifecycle management, etc.
  • Hands-on experience managing large numbers of diverse systems with configuration management or software delivery platforms (such as Puppet, Chef, Ansible, and Spinnaker).
  • Experience with deploying, supporting and monitoring new and existing services, platforms, and application stacks.
  • Familiarity with microservices architecture and container orchestration with Kubernetes.
  • Familiarity with relational & non-relational databases (such as Cassandra, Postgres, & RocksDB).
#J-18808-Ljbffr

  • London, Greater London, United Kingdom SNAPLOGIC Full time

    Site Reliability Engineer JobWe are seeking a highly skilled Site Reliability Engineer to join our Infrastructure Engineering and Operations Team at SNAPLOGIC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based systems, as well as developing and implementing strategies to improve their...


  • London, United Kingdom Understanding Recruitment Full time

    **Site Reliability Engineer** We are seeking a Fully Remote Site Reliability Engineer to join a growing team making the internet safer by using AI and Machine Learning to detect dangerous online content. With founders from Oxbridge who have worked with the likes of Meta and even Stephen Hawking, the team are using tech for good to ensure that online...


  • London, United Kingdom Proactive Appointments Full time

    **Site Reliability Engineer** Inside IR35 - Hybrid working Our client, a global banking organisation have an exciting opportunity for a Site Reliability Engineer to join on an initial 6 month contract. You will be responsible for managing, maintaining, enhancing and strategic development of all Platform and Databaserelated technologies in EMEA as well as...


  • London, United Kingdom Lorien Full time

    Site Reliability Engineer Location: London (hybrid remote working) **Salary**: Up to £100,000 + Very Generous Benefits Package One of the fastest growing software development organisation requires a Site Reliability Engineer to help be the glue between the companies Dev, QA and Product teams - enabling the smooth Continuous Build and Integration of new...


  • London, United Kingdom CV-Library Full time

    Site Reliability Engineer (SRE)Location: London - Onsite roleType: Permanent roleSalary: £TBDI am working exclusively with a financial services organisation. We are looking for a Site Reliability Engineer (SRE) with a focus on Java, playing a critical role in ensuring the reliability, availability, and performance of applications or systems that are built...


  • London, United Kingdom Prism Digital Full time

    **Senior Site Reliability Engineer (SRE) | GCP/AWS | Market Intelligence Leaders** We have an exciting opportunity for a Senior Site Reliability Engineer (SRE) to join a global organisation involved in the market intelligence space. Our client's AI-powered platform provides businesses with world-class and real-time consumer analytics. They are looking for...


  • London, Greater London, United Kingdom Pioneer Selection Careers Full time

    Situated in Park Royal, London, we are looking for a Site Reliability Engineer to maintain site machinery.The ideal candidate will have:Engineering QualificationExperience with electrical and mechanical systemsFault finding skillsThis is a permanent position with career opportunities and benefits including pension, healthcare, and technical training.Salary:...


  • London, United Kingdom Digital Waffle Full time

    **Site Reliability Engineer** **Remote** **Up to £80k + Excellent Benefits**Digital Waffle is working in partnership with a truly exciting and innovative electronic company. This is a fantastic opportunity for a dynamic, self motivated and proven Site Reliability Engineer. **Job Purpose**: You will be responsible for helping develop & support distributed,...


  • London, UK, United Kingdom Selby Jennings Full time

    Site Reliability Engineer - Global Quant Hedge Fund - London (Remote) Our client is a global quantitative and systematic hedge fund that leverages software engineering, data engineering, and financial engineering to drive innovation in crypto trading. They are seeking a Site Reliability Engineer (SRE) with a background in crypto trading to play a key role in...


  • London, United Kingdom Deerfoot IT Resources Ltd Full time

    **Site Reliability Engineer - Linux / Kubernetes** **£90k - £95k + bonus** **Financial Services** **Flexible working with office based in Central London** There are two positions available, one focusing on Linux and one focusing more on Kubernetes technologies. **Key Responsibilities**: - Develop software to make infrastructure services self-managing...


  • London, Greater London, United Kingdom Google Full time

    We are seeking an experienced Site Reliability Systems Engineer to join our Site Reliability Engineering team at Google. In this role, you will be responsible for designing, building, and maintaining large-scale distributed systems that support Google's product portfolio.As a Site Reliability Systems Engineer, you will work closely with cross-functional...


  • London, United Kingdom Hays Specialist Recruitment Limited Full time

    Site Reliability Engineer £100k - £135k+ 10% bonus Hybrid-2 Days in the Office Site Reliability Engineer Prestigious Financial Service Firm | £100k - £135k+ 10% bonus | London |Hybrid Flexible working - 2 Days in the Office **Your new company** Fantastic opportunity to work in a global financial service firm specializing in Investment Banking, Asset...


  • London, United Kingdom X4 Group Full time

    A leading financial data analytics company are seeking an experienced and ambitious Senior Site Reliability Engineer to join their established team on a permanent basis, taking up a senior or leading role in the design, build, and continual improvement oftheir cloud based microservices systems. The Senior Site Reliability Engineer would be joining the site...


  • London, United Kingdom Leap29 Full time

    Job Title: Site Reliability EngineerSite Reliability Engineer is required for a European leader in cloud implementation, application development and managed services. You will have the opportunity to bring your expertise, passion, and creativity to create fantastic value and success for their portfolio of the world's leading accounting clients.Contract...


  • London, United Kingdom Prism Digital Full time

    **Site Reliability Engineer | GCP OR AWS & Kubernetes | SaaS HealthTech** The local headcount currently is 35 in Ireland and 45 in the UK (remote sys admins, tech engineers, field engineers, project managers, programme managers and sales) and expanding the UK office - feels like a start-up with start-up good energy. Our client is around 50% through their...


  • London, Greater London, United Kingdom KAG Recruitment Consultancy Full time

    About the RoleWe are partnering with a leading client in Westerleigh to find an exceptional Multi-Skilled Shift Engineer. As an integral member of the Engineering Team, you will ensure the best performance and dependability of Manufacturing equipment, contributing to both safety and efficiency across all site assets.Key ResponsibilitiesPerform both planned...


  • London, Greater London, United Kingdom Dabster Full time

    Dabster is a leading company in the field of [industry], and we are looking for a talented Site Reliability Engineer Leader to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems, while also collaborating with cross-functional teams to drive business growth.The ideal candidate will have a strong...


  • London, United Kingdom Arcus Search Full time

    Senior Site Reliability Engineer (SRE) Are you interested in shaping the future of infrastructure, automation, and reliability at a Leading Fintech? We’re on the lookout for a Senior Site Reliability Engineer who thrives on tackling complex challenges, building scalable systems, and leading the charge in creating a world-class engineering ecosystem....


  • London, United Kingdom Arcus Search Full time

    Senior Site Reliability Engineer (SRE) Are you interested in shaping the future of infrastructure, automation, and reliability at a Leading Fintech? We’re on the lookout for a Senior Site Reliability Engineer who thrives on tackling complex challenges, building scalable systems, and leading the charge in creating a world-class engineering ecosystem....


  • London, United Kingdom Arcus Search Full time

    Senior Site Reliability Engineer (SRE) Are you interested in shaping the future of infrastructure, automation, and reliability at a Leading Fintech? We’re on the lookout for a Senior Site Reliability Engineer who thrives on tackling complex challenges, building scalable systems, and leading the charge in creating a world-class engineering ecosystem....