Site Reliability Engineer
4 weeks ago
About the Role
We are seeking a highly skilled Senior Site Reliability Engineer to join our team at the Government Digital Service. As a key member of our multidisciplinary service team, you will work closely with front-end and back-end developers, delivery and product managers, tech writers, and architects to build and maintain resilient, highly available, and secure systems that meet the needs of our users.
Key Responsibilities
- Design and implement infrastructure as code to ensure our infrastructure and deployment pipelines are reusable, repeatable, and reliable.
- Develop and maintain monitoring tools to ensure systems are appropriately monitored and instrumented to enable teams to identify and respond to operational issues quickly and effectively.
- Build and maintain CI/CD pipelines to enable our developers to get their code into production as quickly and safely as possible.
- Act as a digital ambassador, sharing experiences through public speaking and blog posts.
- Participate in our in-hours 2nd line and out-of-hours support rotas to gain empathy for users and awareness of operational concerns.
- Share knowledge of tools and practices with your wider team and peers to drive consistency and maintain our high engineering standards.
- Directly line-manage 2-4 technologists, supporting their career progression, development, and wellbeing, and providing regular coaching and performance feedback.
Requirements
- Experience with Linux operating system internals and comfort working with Linux virtual machines or containers.
- Experience of working with technologies that underpin digital services such as databases, web servers, DNS, CDNs, reverse proxies, message queues, and load balancers.
- Experience of cloud infrastructure providers such as AWS.
- Familiarity with container orchestration technologies such as Kubernetes, ECS, or Cloud Foundry; or serverless application design such as AWS Lambda.
- Understanding of SRE principles such as capacity planning, SLOs, and SLIs and how to design and support resilient, large-scale, high-performance services in a production environment.
- Ability to deploy monitoring tools to ensure systems are appropriately monitored and instrumented to enable teams to identify and respond to operational issues quickly and effectively.
- Familiarity with at least one programming language (we use TypeScript, Java, Python, Ruby, and Go).
- Proficiency using Git for version control.
- Understanding of the benefits of continuous integration and continuous deployment and experience with CI/CD tools such as Concourse, Jenkins, GitHub Actions, and CodePipeline.
- Experience of leading a technical team or project.
- Experience of line management, helping colleagues with their career development, or coaching others.
About Us
The Government Digital Service is part of the Cabinet Office, working at the very centre of government to make user-focused digital transformation happen. We build and maintain common platforms, products, and tools to use and create great public services that are accessible, inclusive, and easy to use. We also work with departments to identify patterns, share learning, and create change to make government more efficient.
We are an ambitious, fast-paced, and visionary team, with a background in software delivery and experience working in a scaled agile environment. If you are a motivated and multi-disciplined delivery team player, with a passion for delivering high-quality products and services, then this could be the place for you.
-
Site Reliability Engineering Manager
2 weeks ago
Bristol, Bristol, United Kingdom Xcede Full timeSite Reliability Engineering ManagerXcede is seeking a highly skilled Site Reliability Engineering Manager to join our team. As a Site Reliability Engineering Manager, you will be responsible for deploying and managing a suite of enterprise-wide tools used for provisioning, automation, and monitoring. You will also lead technical teams and collaborate with...
-
Site Reliability Engineering Manager
2 weeks ago
Bristol, Bristol, United Kingdom Xcede Full timeSite Reliability Engineering ManagerXcede is seeking a highly skilled Site Reliability Engineering Manager to join our team. As a Site Reliability Engineering Manager, you will be responsible for deploying and managing a suite of enterprise-wide tools used for provisioning, automation, and monitoring. You will also lead technical teams and collaborate with...
-
Site Reliability Engineering Lead
1 month ago
Bristol, Bristol, United Kingdom Xcede Full timeSite Reliability Engineering ManagerXcede is seeking a highly skilled Site Reliability Engineering Manager to join our team. As a key member of our technical leadership, you will be responsible for deploying and managing a suite of enterprise-wide tools used for provisioning, automation, and monitoring.Key Responsibilities:Platform owner and lead on...
-
Site Reliability Engineering Lead
1 month ago
Bristol, Bristol, United Kingdom Xcede Full timeSite Reliability Engineering ManagerXcede is seeking a highly skilled Site Reliability Engineering Manager to join our team. As a key member of our technical leadership, you will be responsible for deploying and managing a suite of enterprise-wide tools used for provisioning, automation, and monitoring.Key Responsibilities:Platform owner and lead on...
-
Cloud Site Reliability Engineer
5 days ago
Bristol, Bristol, United Kingdom Lloyds Banking Group Full timeAbout This OpportunityWe are seeking an experienced Cloud Site Reliability Engineer to join our Consumer Servicing and Engagement Platform team. As an application-level SRE, you will be an active and leading member of a cloud-focused team of engineers, working on one of the Group's flagship projects to run and maintain a set of products and services on the...
-
Site Reliability Engineering Expert
3 weeks ago
Bristol, Bristol, United Kingdom BT Group Full timeJob Title: Site Reliability Engineering SpecialistBT Group is seeking a highly skilled Site Reliability Engineering Specialist to join our team. As a key member of our engineering team, you will be responsible for building a highly available, robust, and reliable cloud native platform for engineering teams to quickly and seamlessly deploy their...
-
Site Reliability Engineering Expert
3 weeks ago
Bristol, Bristol, United Kingdom BT Group Full timeJob Title: Site Reliability Engineering SpecialistBT Group is seeking a highly skilled Site Reliability Engineering Specialist to join our team. As a key member of our engineering team, you will be responsible for building a highly available, robust, and reliable cloud native platform for engineering teams to quickly and seamlessly deploy their...
-
Cloud Site Reliability Engineer
3 days ago
Bristol, Bristol, United Kingdom Lloyds Banking Group Full timeAbout the RoleAt Lloyds Banking Group, we're seeking a skilled Cloud Site Reliability Engineer to join our Consumer Servicing and Engagement Platform team in Bristol. As a key member of our team, you'll be responsible for delivering against Google Cloud Platform (GCP) and Site Reliability Engineering (SRE) Public Cloud technology roadmaps.Key...
-
Electronics Engineer Intern
1 month ago
Bristol, Bristol, United Kingdom The Engineer Full timeJoin Our Team as a Navigation Sensor EngineerWe are seeking a talented and motivated individual to join our Navigation Sensors Group as a summer placement engineer. As a member of our team, you will have the opportunity to work on real-world engineering projects and technologies, applying your university learning to drive innovation and success.About the...
-
Senior Site Reliability Engineer
2 weeks ago
Bristol, Bristol, United Kingdom Government Digital Service Full timeAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at the Government Digital Service. As a key member of our multidisciplinary service team, you will work closely with front-end and back-end developers, delivery and product managers, tech writers, and architects to build and maintain resilient, highly available,...
-
Reliability Engineer
2 weeks ago
Bristol, Bristol, United Kingdom Kingston Barnes Full timeJob Title: Condition Monitoring Reliability EngineerKingston Barnes is seeking a skilled Condition Monitoring Reliability Engineer to join their team. As a key member of the maintenance team, you will be responsible for ensuring the reliability and efficiency of the company's equipment.Key Responsibilities:Develop and implement Condition-Based Maintenance...
-
Dynamic Simulation Engineer
4 weeks ago
Bristol, Bristol, United Kingdom The Engineer Full timeSimulation and Modelling Engineer - Summer Placement 2025Location: Stevenage or BristolWorking Pattern: Onsite in an office environment with a team to support youClosing Date: Midnight on Monday 4th NovemberThe OpportunitySystem modelling is a vital tool for ensuring reliable and cost-effective weapon systems. We build dynamic system performance models,...
-
Dynamic Simulation Engineer
4 weeks ago
Bristol, Bristol, United Kingdom The Engineer Full timeSimulation and Modelling Engineer - Summer Placement 2025Location: Stevenage or BristolWorking Pattern: Onsite in an office environment with a team to support youClosing Date: Midnight on Monday 4th NovemberThe OpportunitySystem modelling is a vital tool for ensuring reliable and cost-effective weapon systems. We build dynamic system performance models,...
-
Product Site Reliability Engineer
3 weeks ago
Bristol, Bristol, United Kingdom Lloyds Banking Group Full timeAbout this opportunityThe Cloud Platform team is on a mission to create the next generation technical platform for Lloyds Banking Group, driving the UK's biggest financial service transformation. We're seeking a Cloud Product Service Reliability Engineer to join our Cloud SRE within the Cloud Platform.This role offers a unique chance to be part of an...
-
Product Site Reliability Engineer
3 weeks ago
Bristol, Bristol, United Kingdom Lloyds Banking Group Full timeAbout this opportunityThe Cloud Platform team is on a mission to create the next generation technical platform for Lloyds Banking Group, driving the UK's biggest financial service transformation. We're seeking a Cloud Product Service Reliability Engineer to join our Cloud SRE within the Cloud Platform.This role offers a unique chance to be part of an...
-
Reliability Engineer
3 weeks ago
Bristol, Bristol, United Kingdom GKN Aerospace Full timeAbout the RoleGKN Aerospace is a leading global supplier of systems and components to the aerospace industry. We are seeking a highly skilled Reliability Engineer to join our team at our Filton site in the UK.Key ResponsibilitiesLead the reliability management program, continually reviewing equipment data to identify opportunities to improve the technical...
-
Reliability Engineer
3 weeks ago
Bristol, Bristol, United Kingdom GKN Aerospace Full timeAbout the RoleGKN Aerospace is a leading global supplier of systems and components to the aerospace industry. We are seeking a highly skilled Reliability Engineer to join our team at our Filton site in the UK.Key ResponsibilitiesLead the reliability management program, continually reviewing equipment data to identify opportunities to improve the technical...
-
Reliability Engineer
2 weeks ago
Bristol, Bristol, United Kingdom GKN Aerospace Full timeAbout the RoleGKN Aerospace is a leading global supplier of systems and components to the aerospace industry. We are seeking a highly skilled Reliability Engineer to join our team at our Filton site in the UK.Key ResponsibilitiesLead the reliability management program, continually reviewing equipment data to identify opportunities to improve the technical...
-
Reliability Engineer
2 weeks ago
Bristol, Bristol, United Kingdom GKN Aerospace Full timeAbout the RoleGKN Aerospace is a leading global supplier of systems and components to the aerospace industry. We are seeking a highly skilled Reliability Engineer to join our team at our Filton site in the UK.Key ResponsibilitiesLead the reliability management program, continually reviewing equipment data to identify opportunities to improve the technical...
-
Reliability Engineering Specialist
2 months ago
Bristol, Bristol, United Kingdom GKN Aerospace Full timeGKN Aerospace is a global leader in the aerospace industry, dedicated to innovation and sustainable practices. We are committed to excellence and reliability, seeking a skilled Reliability Engineer to contribute to our Filton site. As a key member of our team, you will play a vital role in ensuring the highest standards of product performance and customer...