Current jobs related to Infrastructure Site Reliability Engineer - London, Greater London - undisclosed


  • London, Greater London, United Kingdom Apple Inc. Full time

    The diverse collection of our people and their ideas inspire innovation in everything we do at Apple. We are looking for passionate and talented Site Reliability Engineers to continue our focus in providing our customers the highest quality Apple Services experience.As a Site Reliability Engineer, you'll need to solve unique challenges using data, teamwork,...


  • London, Greater London, United Kingdom SNAPLOGIC Full time

    Site Reliability Engineer JobWe are seeking a highly skilled Site Reliability Engineer to join our Infrastructure Engineering and Operations Team at SNAPLOGIC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based systems, as well as developing and implementing strategies to improve their...


  • London, Greater London, United Kingdom Google Full time

    We are seeking an experienced Site Reliability Systems Engineer to join our Site Reliability Engineering team at Google. In this role, you will be responsible for designing, building, and maintaining large-scale distributed systems that support Google's product portfolio.As a Site Reliability Systems Engineer, you will work closely with cross-functional...


  • London, Greater London, United Kingdom Futureheads Recruitment | B Corp™ Full time £110,000

    Job DescriptionAs a Senior Site Reliability Engineer at Futureheads Recruitment | B Corp, you will play a critical role in ensuring global clients enjoy seamless access to our services. You will design, build, and operate the infrastructure, middleware, and CI/CD systems that power our teams.With expertise in Amazon AWS services, containerization, and...


  • London, Greater London, United Kingdom Remotestar Full time

    Remotestar is seeking a skilled Site Reliability Engineering Director to oversee the development and implementation of high-end monitoring and automation tooling for our client's B2B marketplace in the UK. As a senior leader, you will be responsible for creating and maintaining robust monitoring and automation tooling to ensure infrastructure and service...


  • London, Greater London, United Kingdom Rewardgateway Full time

    Engineering, LondonEarn a salary of £110,000 - £130,000 per year with Reward Gateway.We are seeking an experienced Site Reliability Engineer to lead our team and drive the transformation of our operational workloads to a Service Reliability Engineering (SRE) approach. The successful candidate will be responsible for establishing and managing our new SRE...


  • London, Greater London, United Kingdom Rewardgateway Full time

    Job Title: Site Reliability Engineering LeaderWe are seeking an experienced Site Reliability Engineering Leader to join our team. The successful candidate will be responsible for establishing and managing our new SRE function, operating and modernising our existing cloud infrastructure, partnering with our DevOps team to ensure fast & supportable platform...


  • London, Greater London, United Kingdom Inara Full time

    About the RoleThis position involves maintaining and improving the reliability of Inara's services. As a Site Reliability Engineer, you will be responsible for hands-on technical work, contributing to automation, and supporting various infrastructure components.You will collaborate closely with software engineering teams to ensure seamless integration of...


  • London, Greater London, United Kingdom Apple Full time

    **Job Title:** Site Reliability Engineering ExpertAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for developing and running automated services that hundreds of millions of customers use every day.**About the Role:**We are seeking a creative, versatile, and...


  • London, Greater London, United Kingdom Searchability® Full time £50,000

    About the Role:">We are seeking a highly skilled Site Reliability Engineer to join our team. The successful candidate will be responsible for maintaining and developing all aspects of our technical estate, with a focus on ensuring the stability, scalability, and security of our infrastructure.


  • London, Greater London, United Kingdom Apple Inc. Full time

    As a Site Reliability Engineering manager at Apple Inc., you will be responsible for leading SRE teams responsible for the reliability and performance of on-prem and cloud-based services.Job SummaryWe are looking for a highly experienced SRE leader to join our team. The ideal candidate will have a strong background in distributed systems, especially ML...


  • London, Greater London, United Kingdom Winton Full time

    Company Overview: Winton is a research-based investment management company with a specialist focus on statistical and mathematical inference in financial markets.">About the Role: We are seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for ensuring the operational stability...


  • London, Greater London, United Kingdom GoCardless Full time

    Overview of the RoleWe're looking for a highly skilled Site Reliability Engineer to help us build and maintain our infrastructure, ensuring it's scalable, reliable, and secure. As part of our cloud-first engineering team, you'll be working with cutting-edge technologies like AWS and GCP, and collaborating with cross-functional teams to drive innovation and...


  • London, Greater London, United Kingdom Selby Jennings Full time

    About Selby JenningsWe're a leading global financial services firm where technologists and investment professionals collaborate to drive innovation and operational excellence.About the RoleAs a Site Reliability Engineer, you'll apply your expertise in software and systems engineering to design, build, and maintain our robust infrastructure. You'll reduce...


  • London, Greater London, United Kingdom Bumble Full time

    Bumble Inc. is an equal opportunity employer committed to diversity and inclusion. As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, scalability, and performance of our software systems.Key Responsibilities:Proactively manage and automate infrastructure to deliver a robust foundation for the business and exceptional...


  • London, Greater London, United Kingdom Palantir Technologies Full time

    **Overview of the Position**We are looking for a Site Reliability Operations Engineer to join our team at Palantir Technologies. As a key member of our engineering team, you will be responsible for building and maintaining high-performance, scalable, and reliable services for our production infrastructure.You will have experience with monitoring systems...


  • London, Greater London, United Kingdom Selby Jennings Full time

    Unlock Career Opportunities at Selby JenningsA leading global financial services firm seeks a talented Site Reliability Engineer to join its team of technologists and investment professionals. This innovative company combines expertise in software and systems engineering to build cutting-edge systems that drive business excellence.As a key member of the...


  • London, Greater London, United Kingdom Deutsche Bank Full time

    Deutsche Bank is a leading global bank with strong European roots and a global network. We are seeking an experienced Site Reliability Engineer Lead to join our team in London. The ideal candidate will have a strong background in technology and operations, with a proven track record of delivering high-quality solutions in a fast-paced environment.The...


  • London, Greater London, United Kingdom Kroo Bank Ltd Full time

    A dynamic opportunity has arisen for a skilled Site Reliability Engineer to join our team at Kroo Bank Ltd. With a competitive salary of £95,000 - £115,000 per annum, you'll be part of a talented team driving innovation in banking technology.We're committed to creating a diverse and inclusive workplace where everyone feels valued and empowered. To achieve...


  • London, Greater London, United Kingdom Rewardgateway Full time

    Role OverviewIn the heart of London, a leading digital platform for services and payments is seeking an exceptional Site Reliability Engineering (SRE) professional to drive its transformation. Reward Gateway, part of Edenred, aims to improve employee engagement and organisational resilience.Key ResponsibilitiesEstablish and manage a new SRE function to...

Infrastructure Site Reliability Engineer

2 months ago


London, Greater London, United Kingdom undisclosed Full time

Our Team Overview

At undisclosed, we're working to scale intelligence to serve humanity by training and deploying frontier models for developers and enterprises building AI systems.

We're building world-class infrastructure that's critical to our success, focusing on stability, scalability, and observability.

Our team optimizes for a wide range of technical skillsets, including experience running production infrastructure at a large scale, working with MLEs or data scientists, and designing large, highly available distributed systems.

Key Responsibilities

  • Designing and deploying complex Linux-based distributed computing environments
  • Collaborating with data engineers to manage costs and data lifecycle optimization
  • Building internal tooling to help large numbers of data engineers

Desirable Profiles

Storage Engineer: Experience with synchronized data between different cloud providers, distributed filesystems, and built internal tooling to help data engineers manage costs.

Analytics & Observability Engineer: Experience running analysis for technical teams, designing dashboards and reports, and using systems like Grafana, Prometheus, and BigQuery.

What We Offer

An open and inclusive culture, remote-flexible work arrangement, and a range of benefits including a weekly lunch stipend, full health and dental benefits, and 6 weeks of vacation.