Reliability Engineering Expert for Cloud Infrastructure

1 week ago


London, Greater London, United Kingdom Apple Inc. Full time

We are seeking an experienced Reliability Engineer to join our team at Apple Inc. in a challenging role that combines engineering, software development, and problem-solving skills.

About the Role

This is an exciting opportunity to work on high-availability systems, scalable architecture, and monitoring tools to ensure seamless operations of our cloud infrastructure. The ideal candidate will have strong experience in DevOps practices, cloud computing environments like OpenStack, AWS, GCP or Azure, and scripting languages such as Python or Go.

Responsibilities

  1. Design, implement, and maintain large-scale production environments with high availability and scalability.
  2. Pioneer and implement telemetry systems for AIS services to monitor performance and detect issues early.
  3. Collaborate with global security teams to establish alert handling procedures, runbooks, and automate deployment processes.
  4. Participate in capacity planning and disaster recovery exercises to ensure business continuity.
  5. Work closely with partner teams across the enterprise to deliver solutions efficiently.

Requirements

  • Bachelor's Degree in Computer Science or equivalent experience.
  • 5+ years of experience in Site Reliability Engineering, DevOps, or a related field.
  • Strong programming skills: Python and/or Go.
  • Experience working with cloud compute environments like OpenStack, AWS, GCP or Azure.
  • Experience with infrastructure as code (IaC), configuration management, CI/CD, and automation.

Preferred Qualifications

  • Proficiency in implementing and coordinating telemetry using monitoring and observability tools.
  • Extensive experience administering and troubleshooting Linux systems, including standard Linux utilities.
  • Troubleshooting and debugging experience.
  • Shell scripting and system administration skills.
  • Measuring, analyzing, and optimizing performance experience.

What We Offer

We offer a competitive salary of $160,000 - $220,000 per year, depending on experience, plus benefits including health insurance, retirement plans, and paid time off. Located in Cupertino, CA.



  • London, Greater London, United Kingdom Asian Infrastructure Investment Bank Full time

    Job OverviewThe Asian Infrastructure Investment Bank seeks a highly skilled Cloud Infrastructure Management Expert to join our team.About the RoleThis is an exciting opportunity to work in a dynamic environment where you will be responsible for managing and maintaining the organization's cloud infrastructure, ensuring scalability, security, and...


  • London, Greater London, United Kingdom InfraView - Specialist Cloud & IT Infrastructure Technology Recruitment Full time

    Cloud Infrastructure Expert - Join InfraView to work on a Cloud Platform project with a leading Solution Provider.Our client is an insurance company who are currently working with a leading Solution Provider to help them with their Cloud Platform. The provider is running a multi-cloud environment – combining Google, Entra ID, and M365. This will be a...


  • London, Greater London, United Kingdom JP Engineering Full time

    In this challenging and rewarding role, you will be responsible for ensuring the smooth operation of our cloud-based infrastructure. As a Cloud Infrastructure Engineer, you will design, implement, and maintain scalable and secure cloud architectures, ensuring high availability and performance. You will work closely with our development team to identify and...


  • London, Greater London, United Kingdom Photon Full time

    Job Title: Cloud Infrastructure ExpertAbout the Role:We are seeking a skilled Cloud Infrastructure Expert with expertise in Terraform, Golang development, AWS, SDLC automation, and Kubernetes to join our team at Photon. As a Cloud Infrastructure Expert, you will play a crucial role in designing, building, and maintaining our infrastructure and automation...


  • London, Greater London, United Kingdom RVU Full time

    Cloud Native Platform ExpertWe are seeking a skilled Cloud Native Platform Expert to join our team at RVU. The ideal candidate will have extensive experience in running Kubernetes clusters in production, knowledge of Golang, Helm, and Terraform, and a strong understanding of cloud native technologies.As a Cloud Native Platform Expert, you will be responsible...


  • London, Greater London, United Kingdom InfraView - Specialist Cloud & IT Infrastructure Technology Recruitment Full time

    Job Role: Cloud Solutions ExpertWork with a leading solution provider to support a major insurance company with their cloud platform. As a key member of the team, you will be responsible for delivering high-quality cloud solutions and contributing to the development of the company's cloud strategy.Key Responsibilities:Collaborate with the client's team to...


  • London, Greater London, United Kingdom GoCardless Full time

    About the RoleWe are seeking an experienced Cloud Reliability Engineer to join our distributed team at GoCardless. As a key member of our engineering team, you will be responsible for designing and implementing scalable and reliable infrastructure solutions.With a strong interest in infrastructure management and site reliability engineering, you will...


  • London, Greater London, United Kingdom Cloud Decisions Full time

    Cloud Infrastructure SpecialistCloud Decisions is seeking a highly skilled Cloud Infrastructure Specialist to join their team. The ideal candidate will have a strong background in cloud infrastructure, with expertise in Microsoft Cloud (Azure, M365, AVD), Server Infrastructure, and Hyper-Converged Infrastructure environments.Key Responsibilities:Act as the...


  • London, Greater London, United Kingdom STAND 8 Full time

    Job SummaryWe are seeking an experienced Site Reliability Engineer to join our team at STAND 8. As a Site Reliability Engineer, you will be responsible for maintaining existing systems, working on infrastructure modernization, and supporting the streaming engineering team to ensure smooth operation of linear streaming channels.Key ResponsibilitiesMaintain...


  • London, Greater London, United Kingdom Curve Full time

    About the RoleWe are seeking a skilled Cloud and Infrastructure Specialist to join our team at Curve. As an integral part of our Platform and Engineering team, you will be responsible for designing, building, and maintaining scalable infrastructure solutions to support our growing business.In this role, you will work closely with our engineering teams to...


  • London, Greater London, United Kingdom BenevolentAI Full time

    Job Title:Cloud Infrastructure EngineerAbout the Role:We are seeking a highly skilled Cloud Infrastructure Engineer to join our team at BenevolentAI. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining our cloud infrastructure.The ideal candidate will have a strong background in cloud computing, with...


  • London, Greater London, United Kingdom OpenStack Full time

    As a key member of our Infrastructure and Platform Engineering teams, you will play a pivotal role in maintaining and expanding our private cloud infrastructure, powered by OpenStack, across a global environment. Here, you'll find yourself surrounded by leaders and teams who are not only experts in their fields but are also enthusiastic about making a...


  • London, Greater London, United Kingdom Source Technology Full time

    Senior Site Reliability EngineerA highly sought-after opportunity to spearhead a brand new team at Source Technology, a leading global financial services provider.About the RoleThis exceptional individual will lead a talented group of Full Stack Infrastructure Engineers in crafting robust infrastructure solutions that ensure reliability, scalability, and...


  • London, Greater London, United Kingdom ClearScore Full time

    The RoleClearScore is expanding its Site Reliability Engineering team to support our productivity, reliability, and efficiency. Our SRE team builds an internal developer platform that provides three nines of uptime to all critical services, supports over a thousand production releases per month, and scales intelligently in response to system load and...


  • London, Greater London, United Kingdom Preqin Full time

    Job Description:We are seeking a Cloud Infrastructure Engineer to join our Engineering team at Preqin in London. As a key member of the team, you will be responsible for designing, building, and operating our infrastructure, middleware, and CI/CD systems.In this role, you will use your site reliability expertise to design, operate, and support Preqin's...


  • London, Greater London, United Kingdom Anson McCade Full time

    Cloud Reliability Engineer PositionJob DescriptionAt Anson McCade, we're looking for a skilled Cloud Reliability Engineer to join our Cloud Infrastructure team. As a Cloud Reliability Engineer, you'll play a key role in designing, building, and optimizing our cloud infrastructure to ensure high availability, reliability, and...


  • London, Greater London, United Kingdom LA International Full time

    Job Title: Cloud Platform Engineer - Infrastructure ExpertLocation: Bracknell, UKEstimated Salary Range: £60,000 - £80,000 per annumJob Description:Company OverviewLA International is a leading provider of cloud-based solutions, and we are currently seeking an experienced Cloud Platform Engineer to join our team.Job ResponsibilitiesDesign and implement...


  • London, Greater London, United Kingdom StarRez Full time

    Join Our Team as a Cloud Reliability EngineerStarRez, Inc. is a leading provider of cloud software solutions for student housing and property management. Our platform serves 1,300 institutions across 25 countries, with over 3 million beds. We're committed to delivering exceptional customer satisfaction, with a score of 99%.About the RoleWe're seeking an...


  • London, Greater London, United Kingdom InfraView - Specialist Cloud & IT Infrastructure Technology Recruitment Full time

    Cloud Architect Job OpportunityThis role involves working with a leading Solution Provider to help an insurance company build and deliver a multi-cloud environment. The ideal candidate will have experience with Google Cloud, Azure, Entra ID, and M365. Responsibilities include collaborating with the internal team to implement the cloud solutions, migrating to...


  • London, Greater London, United Kingdom Close Brothers Group Full time

    Job Title: Senior Software Engineer - Cloud ExpertJob Description: As a Senior Software Engineer - Cloud Expert, you will be responsible for designing, developing, and maintaining scalable and secure cloud-based systems.Key Responsibilities:Design and develop cloud-based systems using AWS and Azure.Maintain and improve existing cloud...