Highly Skilled Site Reliability Engineering Leader
1 week ago
At Apple Inc., we're looking for a seasoned Site Reliability Engineering (SRE) manager to join our iCloud Services team.
About the Role
We're seeking an accomplished builder and leader of teams with a passion for SRE and a track record of delivering operational perfection at scale. As a key member of our SRE leadership team, you will shape the future of how we build and run our services on a global scale.
Responsibilities
- Lead high-performing SRE teams responsible for ensuring the reliability and performance of our on-prem and cloud-based services.
- Grow and develop engineers on your team, fostering a culture of innovation and excellence.
- Develop and implement strategies to maximize availability in staging and production environments.
- Promote observability and monitoring best practices across our systems.
- Advocate for industry-leading reliability engineering practices.
Requirements
- Proven experience leading large-scale distributed systems, including ML infrastructure and services like LLMs, Generative AI, and transformers.
- Demonstrated success as a technical leader, ideally in SRE or Production Engineering.
- Strong knowledge of core operating system principles, networking fundamentals, and systems management.
- Familiarity with SRE principles, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts.
Preferred Qualifications
- Experience in hiring and developing engineers.
- Professional experience in an engineering leadership position.
Compensation and Benefits
We offer a competitive salary of approximately $175,000 per year, plus a comprehensive benefits package, including health insurance, retirement savings plan, and paid time off. Located in London, England, United Kingdom, this role offers the opportunity to work on cutting-edge technology and collaborate with a talented team of professionals.
-
London, Greater London, United Kingdom Oxford Knight Full timeSite Reliability Engineer OpportunityOxford Knight is seeking a highly skilled Site Reliability Engineer to join our team and contribute to the development of innovative trading solutions. As a Site Reliability Engineer, you will play a crucial role in ensuring the smooth operation of our applications, providing early support for apps while being developed,...
-
Cloud Engineer and Site Reliability Leader
6 days ago
London, Greater London, United Kingdom Google Full timeAbout the RoleAt Google, we're looking for a talented Cloud Engineer and Site Reliability Leader to join our team. As a key member of our SRE organization, you'll be responsible for designing, building, and operating large-scale distributed systems that meet the high standards of reliability, scalability, and performance.We're seeking someone with 8+ years...
-
Chief Reliability Engineering Leader
6 days ago
London, Greater London, United Kingdom Citigroup, Inc. Full timeCitigroup, Inc. Chief Reliability Engineering LeaderAbout the Job:We are seeking a highly skilled Chief Reliability Engineering Leader to join our team at Citigroup, Inc. This is a full-time position based on a competitive salary of $200,000 per year.Job Description:The successful candidate will play a crucial role in driving operational excellence,...
-
Site Reliability Engineering Leader
1 day ago
London, Greater London, United Kingdom Rewardgateway Full timeEngineering, LondonEarn a salary of £110,000 - £130,000 per year with Reward Gateway.We are seeking an experienced Site Reliability Engineer to lead our team and drive the transformation of our operational workloads to a Service Reliability Engineering (SRE) approach. The successful candidate will be responsible for establishing and managing our new SRE...
-
Site Reliability Engineering Manager
1 month ago
London, Greater London, United Kingdom Apple Inc. Full timeUnlock the Future of Cloud ServicesAt Apple Inc., we're not just building products - we're crafting experiences that our customers love and depend on. Our Apple Services Engineering (ASE) team is responsible for the systems that make these daily experiences possible. If you've used Apple products, you've likely interacted with us. Our iCloud Services SRE...
-
London, Greater London, United Kingdom Randstad Staffing Full timeJob Description:A highly skilled System Reliability Engineer with expertise in Java is required to join our team. This exciting role will see you play a critical part in ensuring the reliability, availability, and performance of applications or systems built using Java technologies.Key Responsibilities:Application Performance Monitoring & Optimization: Use...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom Fourier Full timeKey ResponsibilitiesAs a Site Reliability Engineer at Fourier, you will be responsible for designing and implementing tools to enhance the reliability and resilience of our production systems. This includes investigating failures, improving system performance, and automating manual processes.Required SkillsExcellent Python scripting skillsExperience with...
-
London, Greater London, United Kingdom Techruiter Full timeWe are Techruiter, a pioneering technology company specializing in cutting-edge Language Models (LLM) and Machine Learning solutions. Our team is seeking a highly skilled Site Reliability Engineer to ensure the reliability, scalability, and performance of our LLM and Machine Learning infrastructure.About This RoleIn this key position, you will play a...
-
Site Reliability Engineering Manager
1 month ago
London, Greater London, United Kingdom Apple Inc. Full timeSite Reliability Engineering Manager, AppleAt Apple, we're not just building products - we're crafting experiences our customers love and depend on. Our Apple Services Engineering (ASE) team builds and supports the systems that make many of these daily experiences possible. If you've used Apple products, you've likely interacted with us. Our iCloud Services...
-
Senior Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom Mondrian Alpha Recruitment Solutions Full timeAt Mondrian Alpha Recruitment Solutions, we are seeking a highly skilled Site Reliability Engineer to join our team responsible for engineering and supporting the company's critical infrastructure platforms.This team handles the centralized development infrastructure and works alongside engineering teams across the business to ensure the optimal route of...
-
Senior Site Reliability Engineering Manager
1 month ago
London, Greater London, United Kingdom Remotestar Full timeRemotestar is seeking a Senior Site Reliability Engineering Manager to join our client's team in the UK. The client is building a B2B marketplace for diamonds, and we need someone to ensure the reliability, scalability, and performance of our infrastructure and services.The ideal candidate will have a strong track record of building and maintaining highly...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom Trade Nation Full timeSite Reliability Engineer Job DescriptionAt Trade Nation, we're seeking a highly skilled Site Reliability Engineer to join our dynamic team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining scalable and reliable systems that ensure high availability and performance.Key ResponsibilitiesDesign and Implement...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom J Bandy Consulting Full timeJob SummaryJ Bandy Consulting is seeking an experienced Site Reliability Engineer to join our team. The ideal candidate will have a strong background in software engineering and a passion for building scalable and reliable systems.Key ResponsibilitiesDevelop and implement automation tools to improve the efficiency of our systemsCollaborate with...
-
Site Reliability Engineering Manager
1 month ago
London, Greater London, United Kingdom College of Charleston Full timeTransformative SRE Leadership OpportunityAre you a seasoned leader with a passion for strategy, leadership, and engineering excellence? Do you want to make a meaningful impact at a global financial institution? We're seeking a talented Site Reliability Engineering Manager to join our Operations and Technology Chief Information Office Business area.About the...
-
Highly Skilled DevOps Engineer for Fourier
1 week ago
London, Greater London, United Kingdom Fourier Full timeAt Fourier, we are looking for a highly skilled DevOps engineer to join our Site Reliability Engineering team.The successful candidate will be responsible for developing tools and solutions that enhance the reliability and resilience of our production systems.We offer a competitive salary range of $120,000 - $180,000 per annum, depending on experience, to...
-
Site Reliability Engineer
3 weeks ago
London, Greater London, United Kingdom Selby Jennings Full timeAbout Selby JenningsWe're a leading global financial services firm where technologists and investment professionals collaborate to drive innovation and operational excellence.About the RoleAs a Site Reliability Engineer, you'll apply your expertise in software and systems engineering to design, build, and maintain our robust infrastructure. You'll reduce...
-
Site Reliability Engineering Lead
4 days ago
London, Greater London, United Kingdom BenevolentAI Full timeAbout the Role:BenevolentAI is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will be responsible for designing and implementing software solutions for cloud infrastructure, improving long-term infrastructure availability and reliability, and monitoring and handling incident response of...
-
Site Reliability Engineering Expert
3 weeks ago
London, Greater London, United Kingdom GoCardless Full timeThe RoleGoCardless is looking for a Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining the infrastructure and systems that support our payment and open banking products.Key ResponsibilitiesDesign and implement scalable and efficient infrastructure solutionsDevelop...
-
Site Reliability Engineer
6 days ago
London, Greater London, United Kingdom Hamilton Barnes Associates Limited Full timeJob Title: Site Reliability EngineerHiring Company: Hamilton Barnes Associates LimitedWe are seeking a highly skilled and experienced Site Reliability Engineer to join our team on a 6-month contract basis. The selected candidate will be working with one of the largest technology companies globally, ensuring seamless database environment operations and...
-
Senior Site Reliability Engineering Manager
1 month ago
London, Greater London, United Kingdom Remotestar Full timeRemotestar is seeking a Senior Site Reliability Engineering Manager to join our client's team in the UK. The client is a leading B2B marketplace for diamonds, and we're looking for a seasoned expert to lead our infrastructure and services team.The ideal candidate will have a strong track record of building and maintaining highly reliable infrastructure and...