Reliability Engineering Expert for Cloud Infrastructure
1 week ago
We are seeking an experienced Reliability Engineer to join our team at Apple Inc. in a challenging role that combines engineering, software development, and problem-solving skills.
About the Role
This is an exciting opportunity to work on high-availability systems, scalable architecture, and monitoring tools to ensure seamless operations of our cloud infrastructure. The ideal candidate will have strong experience in DevOps practices, cloud computing environments like OpenStack, AWS, GCP or Azure, and scripting languages such as Python or Go.
Responsibilities
- Design, implement, and maintain large-scale production environments with high availability and scalability.
- Pioneer and implement telemetry systems for AIS services to monitor performance and detect issues early.
- Collaborate with global security teams to establish alert handling procedures, runbooks, and automate deployment processes.
- Participate in capacity planning and disaster recovery exercises to ensure business continuity.
- Work closely with partner teams across the enterprise to deliver solutions efficiently.
Requirements
- Bachelor's Degree in Computer Science or equivalent experience.
- 5+ years of experience in Site Reliability Engineering, DevOps, or a related field.
- Strong programming skills: Python and/or Go.
- Experience working with cloud compute environments like OpenStack, AWS, GCP or Azure.
- Experience with infrastructure as code (IaC), configuration management, CI/CD, and automation.
Preferred Qualifications
- Proficiency in implementing and coordinating telemetry using monitoring and observability tools.
- Extensive experience administering and troubleshooting Linux systems, including standard Linux utilities.
- Troubleshooting and debugging experience.
- Shell scripting and system administration skills.
- Measuring, analyzing, and optimizing performance experience.
What We Offer
We offer a competitive salary of $160,000 - $220,000 per year, depending on experience, plus benefits including health insurance, retirement plans, and paid time off. Located in Cupertino, CA.
-
Cloud Infrastructure Management Expert
3 weeks ago
London, Greater London, United Kingdom Asian Infrastructure Investment Bank Full timeJob OverviewThe Asian Infrastructure Investment Bank seeks a highly skilled Cloud Infrastructure Management Expert to join our team.About the RoleThis is an exciting opportunity to work in a dynamic environment where you will be responsible for managing and maintaining the organization's cloud infrastructure, ensuring scalability, security, and...
-
Cloud Infrastructure Expert
3 weeks ago
London, Greater London, United Kingdom InfraView - Specialist Cloud & IT Infrastructure Technology Recruitment Full timeCloud Infrastructure Expert - Join InfraView to work on a Cloud Platform project with a leading Solution Provider.Our client is an insurance company who are currently working with a leading Solution Provider to help them with their Cloud Platform. The provider is running a multi-cloud environment – combining Google, Entra ID, and M365. This will be a...
-
Cloud Infrastructure Engineer
1 week ago
London, Greater London, United Kingdom JP Engineering Full timeIn this challenging and rewarding role, you will be responsible for ensuring the smooth operation of our cloud-based infrastructure. As a Cloud Infrastructure Engineer, you will design, implement, and maintain scalable and secure cloud architectures, ensuring high availability and performance. You will work closely with our development team to identify and...
-
Cloud Infrastructure Expert
3 weeks ago
London, Greater London, United Kingdom Photon Full timeJob Title: Cloud Infrastructure ExpertAbout the Role:We are seeking a skilled Cloud Infrastructure Expert with expertise in Terraform, Golang development, AWS, SDLC automation, and Kubernetes to join our team at Photon. As a Cloud Infrastructure Expert, you will play a crucial role in designing, building, and maintaining our infrastructure and automation...
-
Cloud Infrastructure Engineer
4 weeks ago
London, Greater London, United Kingdom RVU Full timeCloud Native Platform ExpertWe are seeking a skilled Cloud Native Platform Expert to join our team at RVU. The ideal candidate will have extensive experience in running Kubernetes clusters in production, knowledge of Golang, Helm, and Terraform, and a strong understanding of cloud native technologies.As a Cloud Native Platform Expert, you will be responsible...
-
Cloud Solutions Expert
3 weeks ago
London, Greater London, United Kingdom InfraView - Specialist Cloud & IT Infrastructure Technology Recruitment Full timeJob Role: Cloud Solutions ExpertWork with a leading solution provider to support a major insurance company with their cloud platform. As a key member of the team, you will be responsible for delivering high-quality cloud solutions and contributing to the development of the company's cloud strategy.Key Responsibilities:Collaborate with the client's team to...
-
Cloud Reliability Engineer
1 week ago
London, Greater London, United Kingdom GoCardless Full timeAbout the RoleWe are seeking an experienced Cloud Reliability Engineer to join our distributed team at GoCardless. As a key member of our engineering team, you will be responsible for designing and implementing scalable and reliable infrastructure solutions.With a strong interest in infrastructure management and site reliability engineering, you will...
-
Cloud Infrastructure Specialist
4 weeks ago
London, Greater London, United Kingdom Cloud Decisions Full timeCloud Infrastructure SpecialistCloud Decisions is seeking a highly skilled Cloud Infrastructure Specialist to join their team. The ideal candidate will have a strong background in cloud infrastructure, with expertise in Microsoft Cloud (Azure, M365, AVD), Server Infrastructure, and Hyper-Converged Infrastructure environments.Key Responsibilities:Act as the...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom STAND 8 Full timeJob SummaryWe are seeking an experienced Site Reliability Engineer to join our team at STAND 8. As a Site Reliability Engineer, you will be responsible for maintaining existing systems, working on infrastructure modernization, and supporting the streaming engineering team to ensure smooth operation of linear streaming channels.Key ResponsibilitiesMaintain...
-
Infrastructure Reliability Engineer
7 days ago
London, Greater London, United Kingdom Curve Full timeAbout the RoleWe are seeking a skilled Cloud and Infrastructure Specialist to join our team at Curve. As an integral part of our Platform and Engineering team, you will be responsible for designing, building, and maintaining scalable infrastructure solutions to support our growing business.In this role, you will work closely with our engineering teams to...
-
Cloud Infrastructure Engineer
1 week ago
London, Greater London, United Kingdom BenevolentAI Full timeJob Title:Cloud Infrastructure EngineerAbout the Role:We are seeking a highly skilled Cloud Infrastructure Engineer to join our team at BenevolentAI. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining our cloud infrastructure.The ideal candidate will have a strong background in cloud computing, with...
-
Cloud Infrastructure Automation Expert
1 week ago
London, Greater London, United Kingdom OpenStack Full timeAs a key member of our Infrastructure and Platform Engineering teams, you will play a pivotal role in maintaining and expanding our private cloud infrastructure, powered by OpenStack, across a global environment. Here, you'll find yourself surrounded by leaders and teams who are not only experts in their fields but are also enthusiastic about making a...
-
Cloud Infrastructure Expert
3 weeks ago
London, Greater London, United Kingdom Source Technology Full timeSenior Site Reliability EngineerA highly sought-after opportunity to spearhead a brand new team at Source Technology, a leading global financial services provider.About the RoleThis exceptional individual will lead a talented group of Full Stack Infrastructure Engineers in crafting robust infrastructure solutions that ensure reliability, scalability, and...
-
Site Reliability Engineer
3 weeks ago
London, Greater London, United Kingdom ClearScore Full timeThe RoleClearScore is expanding its Site Reliability Engineering team to support our productivity, reliability, and efficiency. Our SRE team builds an internal developer platform that provides three nines of uptime to all critical services, supports over a thousand production releases per month, and scales intelligently in response to system load and...
-
Cloud Infrastructure Engineer
1 week ago
London, Greater London, United Kingdom Preqin Full timeJob Description:We are seeking a Cloud Infrastructure Engineer to join our Engineering team at Preqin in London. As a key member of the team, you will be responsible for designing, building, and operating our infrastructure, middleware, and CI/CD systems.In this role, you will use your site reliability expertise to design, operate, and support Preqin's...
-
Senior Cloud Infrastructure Architect
3 weeks ago
London, Greater London, United Kingdom Anson McCade Full timeCloud Reliability Engineer PositionJob DescriptionAt Anson McCade, we're looking for a skilled Cloud Reliability Engineer to join our Cloud Infrastructure team. As a Cloud Reliability Engineer, you'll play a key role in designing, building, and optimizing our cloud infrastructure to ensure high availability, reliability, and...
-
Cloud Platform Engineer
1 week ago
London, Greater London, United Kingdom LA International Full timeJob Title: Cloud Platform Engineer - Infrastructure ExpertLocation: Bracknell, UKEstimated Salary Range: £60,000 - £80,000 per annumJob Description:Company OverviewLA International is a leading provider of cloud-based solutions, and we are currently seeking an experienced Cloud Platform Engineer to join our team.Job ResponsibilitiesDesign and implement...
-
Cloud Infrastructure Engineer
1 month ago
London, Greater London, United Kingdom StarRez Full timeJoin Our Team as a Cloud Reliability EngineerStarRez, Inc. is a leading provider of cloud software solutions for student housing and property management. Our platform serves 1,300 institutions across 25 countries, with over 3 million beds. We're committed to delivering exceptional customer satisfaction, with a score of 99%.About the RoleWe're seeking an...
-
Cloud Infrastructure Specialist
1 week ago
London, Greater London, United Kingdom InfraView - Specialist Cloud & IT Infrastructure Technology Recruitment Full timeCloud Architect Job OpportunityThis role involves working with a leading Solution Provider to help an insurance company build and deliver a multi-cloud environment. The ideal candidate will have experience with Google Cloud, Azure, Entra ID, and M365. Responsibilities include collaborating with the internal team to implement the cloud solutions, migrating to...
-
Senior Software Engineer
4 weeks ago
London, Greater London, United Kingdom Close Brothers Group Full timeJob Title: Senior Software Engineer - Cloud ExpertJob Description: As a Senior Software Engineer - Cloud Expert, you will be responsible for designing, developing, and maintaining scalable and secure cloud-based systems.Key Responsibilities:Design and develop cloud-based systems using AWS and Azure.Maintain and improve existing cloud...