Site Reliability Engineer
3 weeks ago
London, England, United Kingdom
Software and Services
People at Apple don’t just build products — they craft the kind of experience that have revolutionized entire industries. The diverse collection of our people and their ideas inspire innovation in everything we do. Imagine what you could do here Join Apple, and help us leave the world better than we found it.
Apple Cloud infrastructure is BIG. The storage SRE teams of Apple Cloud are building and running the next generation distributed storage systems to support Apple’s most critical services. Operating at our scale, across multiple geographically dispersed data centers, and servicing users with vast data needs presents unique challenges. As a Storage SRE at Apple, you'll need to solve these problems using your deep understanding of storage, data analysis, programming, teamwork, and expertise in Linux system internals.
Description
We are looking for seasoned software and systems engineers to join the Object Storage SRE team at Apple. The role involves a tremendous amount of individual responsibility and influence over the direction of the platform, shaping its use by many critical Apple Cloud services for years to come. You are solution-oriented and have a passion for software delivered as a service to improve reuse, efficiency, and simplicity. Your work will affect hundreds of millions of users and be essential to the success of some of the most visible current and future Apple features.
The role involves understanding the team's priorities; taking ownership of projects or deliverables; designing solutions and building buy-in for those designs; and successful delivery of those designs in order to meet the project goal. The role involves giving technical feedback to colleagues to assist them in the delivery of their designs, features, and projects, as well as driving technical standards across the two-site team in collaboration with other senior members of the team. The team has an on-call rota including weekends and the successful candidate should expect to handle alerts and other escalations in order to maintain a high level of availability and functionality for our provided services.
The team is divided into two shards in the UK and US and cross-timezone meetings are a core feature of how our team collaborates, reaches agreements, and executes to deliver projects. At Apple Cloud, we run a mix of open source, vendor licensed, and internally developed tools to perform functions such as system configuration management, provisioning, software development & deployment, logging, and monitoring. You'll learn these tools and have opportunities to improve them. We think critically and strive to balance the best solution with the need to get things done for each engineering challenge we face. Good ideas are heard and results are rewarded. The candidate may be expected to travel to other Apple locations from time to time e.g. the USA.
Minimum Qualifications
Key Qualifications
- Experience in building, operating, and scaling distributed storage systems in a private, public, or hybrid cloud environment.
- The ability to design, author, understand, and release code in languages like Go (preferred), Java, Python, or Rust.
- Good understanding of block, object, and file storage solutions in Linux (such as LVM, XFS, ext4, S3, Ceph, Gluster, NFS).
- Understanding of Linux internals, standard networking protocols, and distributed systems.
- Experience with provisioning, data migration, backup & recovery, at-scale testing, disaster recovery, and capacity planning.
Preferred Qualifications
Education & Experience
Bachelor's degree in Computer Science or related field, or equivalent employment.
Additional Requirements
- Acute drive to automate manual operations and to improve them through repeated iteration.
- Awareness of best practices for deployment of storage systems - implication of physical and virtual deployment models to change management, failure domains, hardware lifecycle management, etc.
- Hands-on experience managing large numbers of diverse systems with configuration management or software delivery platforms (such as Puppet, Chef, Ansible, and Spinnaker).
- Experience with deploying, supporting and monitoring new and existing services, platforms, and application stacks.
- Familiarity with microservices architecture and container orchestration with Kubernetes.
- Familiarity with relational & non-relational databases (such as Cassandra, Postgres, & RocksDB).
-
Site Reliability Engineer
3 weeks ago
London, Greater London, United Kingdom SNAPLOGIC Full timeSite Reliability Engineer JobWe are seeking a highly skilled Site Reliability Engineer to join our Infrastructure Engineering and Operations Team at SNAPLOGIC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based systems, as well as developing and implementing strategies to improve their...
-
Site Reliability Engineer
1 week ago
London, United Kingdom Understanding Recruitment Full time**Site Reliability Engineer** We are seeking a Fully Remote Site Reliability Engineer to join a growing team making the internet safer by using AI and Machine Learning to detect dangerous online content. With founders from Oxbridge who have worked with the likes of Meta and even Stephen Hawking, the team are using tech for good to ensure that online...
-
Site Reliability Engineer
7 days ago
London, United Kingdom Proactive Appointments Full time**Site Reliability Engineer** Inside IR35 - Hybrid working Our client, a global banking organisation have an exciting opportunity for a Site Reliability Engineer to join on an initial 6 month contract. You will be responsible for managing, maintaining, enhancing and strategic development of all Platform and Databaserelated technologies in EMEA as well as...
-
Site Reliability Engineer
1 day ago
London, United Kingdom Lorien Full timeSite Reliability Engineer Location: London (hybrid remote working) **Salary**: Up to £100,000 + Very Generous Benefits Package One of the fastest growing software development organisation requires a Site Reliability Engineer to help be the glue between the companies Dev, QA and Product teams - enabling the smooth Continuous Build and Integration of new...
-
Site Reliability Engineer
4 weeks ago
London, United Kingdom CV-Library Full timeSite Reliability Engineer (SRE)Location: London - Onsite roleType: Permanent roleSalary: £TBDI am working exclusively with a financial services organisation. We are looking for a Site Reliability Engineer (SRE) with a focus on Java, playing a critical role in ensuring the reliability, availability, and performance of applications or systems that are built...
-
Site Reliability Engineer
6 days ago
London, United Kingdom Prism Digital Full time**Senior Site Reliability Engineer (SRE) | GCP/AWS | Market Intelligence Leaders** We have an exciting opportunity for a Senior Site Reliability Engineer (SRE) to join a global organisation involved in the market intelligence space. Our client's AI-powered platform provides businesses with world-class and real-time consumer analytics. They are looking for...
-
Site Reliability Engineer
2 weeks ago
London, Greater London, United Kingdom Pioneer Selection Careers Full timeSituated in Park Royal, London, we are looking for a Site Reliability Engineer to maintain site machinery.The ideal candidate will have:Engineering QualificationExperience with electrical and mechanical systemsFault finding skillsThis is a permanent position with career opportunities and benefits including pension, healthcare, and technical training.Salary:...
-
Site Reliability Engineer
3 days ago
London, United Kingdom Digital Waffle Full time**Site Reliability Engineer** **Remote** **Up to £80k + Excellent Benefits**Digital Waffle is working in partnership with a truly exciting and innovative electronic company. This is a fantastic opportunity for a dynamic, self motivated and proven Site Reliability Engineer. **Job Purpose**: You will be responsible for helping develop & support distributed,...
-
Site Reliability Engineer
1 week ago
London, UK, United Kingdom Selby Jennings Full timeSite Reliability Engineer - Global Quant Hedge Fund - London (Remote) Our client is a global quantitative and systematic hedge fund that leverages software engineering, data engineering, and financial engineering to drive innovation in crypto trading. They are seeking a Site Reliability Engineer (SRE) with a background in crypto trading to play a key role in...
-
Site Reliability Engineer
5 days ago
London, United Kingdom Deerfoot IT Resources Ltd Full time**Site Reliability Engineer - Linux / Kubernetes** **£90k - £95k + bonus** **Financial Services** **Flexible working with office based in Central London** There are two positions available, one focusing on Linux and one focusing more on Kubernetes technologies. **Key Responsibilities**: - Develop software to make infrastructure services self-managing...
-
Site Reliability Systems Engineer
3 weeks ago
London, Greater London, United Kingdom Google Full timeWe are seeking an experienced Site Reliability Systems Engineer to join our Site Reliability Engineering team at Google. In this role, you will be responsible for designing, building, and maintaining large-scale distributed systems that support Google's product portfolio.As a Site Reliability Systems Engineer, you will work closely with cross-functional...
-
Site Reliability Engineer
2 weeks ago
London, United Kingdom Hays Specialist Recruitment Limited Full timeSite Reliability Engineer £100k - £135k+ 10% bonus Hybrid-2 Days in the Office Site Reliability Engineer Prestigious Financial Service Firm | £100k - £135k+ 10% bonus | London |Hybrid Flexible working - 2 Days in the Office **Your new company** Fantastic opportunity to work in a global financial service firm specializing in Investment Banking, Asset...
-
Senior Site Reliability Engineer
2 weeks ago
London, United Kingdom X4 Group Full timeA leading financial data analytics company are seeking an experienced and ambitious Senior Site Reliability Engineer to join their established team on a permanent basis, taking up a senior or leading role in the design, build, and continual improvement oftheir cloud based microservices systems. The Senior Site Reliability Engineer would be joining the site...
-
Site Reliability Engineer.
4 weeks ago
London, United Kingdom Leap29 Full timeJob Title: Site Reliability EngineerSite Reliability Engineer is required for a European leader in cloud implementation, application development and managed services. You will have the opportunity to bring your expertise, passion, and creativity to create fantastic value and success for their portfolio of the world's leading accounting clients.Contract...
-
Site Reliability Engineer
1 day ago
London, United Kingdom Prism Digital Full time**Site Reliability Engineer | GCP OR AWS & Kubernetes | SaaS HealthTech** The local headcount currently is 35 in Ireland and 45 in the UK (remote sys admins, tech engineers, field engineers, project managers, programme managers and sales) and expanding the UK office - feels like a start-up with start-up good energy. Our client is around 50% through their...
-
Site Reliability Engineer
2 days ago
London, Greater London, United Kingdom KAG Recruitment Consultancy Full timeAbout the RoleWe are partnering with a leading client in Westerleigh to find an exceptional Multi-Skilled Shift Engineer. As an integral member of the Engineering Team, you will ensure the best performance and dependability of Manufacturing equipment, contributing to both safety and efficiency across all site assets.Key ResponsibilitiesPerform both planned...
-
Site Reliability Engineer Leader
1 month ago
London, Greater London, United Kingdom Dabster Full timeDabster is a leading company in the field of [industry], and we are looking for a talented Site Reliability Engineer Leader to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems, while also collaborating with cross-functional teams to drive business growth.The ideal candidate will have a strong...
-
Site Reliability Engineer
1 month ago
London, United Kingdom Arcus Search Full timeSenior Site Reliability Engineer (SRE) Are you interested in shaping the future of infrastructure, automation, and reliability at a Leading Fintech? We’re on the lookout for a Senior Site Reliability Engineer who thrives on tackling complex challenges, building scalable systems, and leading the charge in creating a world-class engineering ecosystem....
-
Site Reliability Engineer
1 month ago
London, United Kingdom Arcus Search Full timeSenior Site Reliability Engineer (SRE) Are you interested in shaping the future of infrastructure, automation, and reliability at a Leading Fintech? We’re on the lookout for a Senior Site Reliability Engineer who thrives on tackling complex challenges, building scalable systems, and leading the charge in creating a world-class engineering ecosystem....
-
Site Reliability Engineer
1 month ago
London, United Kingdom Arcus Search Full timeSenior Site Reliability Engineer (SRE) Are you interested in shaping the future of infrastructure, automation, and reliability at a Leading Fintech? We’re on the lookout for a Senior Site Reliability Engineer who thrives on tackling complex challenges, building scalable systems, and leading the charge in creating a world-class engineering ecosystem....