Site Reliability Engineer
3 days ago
DWP. Digital with Purpose.
We have a fantastic opportunity to join our community of experts at DWP Digital as a Senior Site Reliability Engineer, within one of our SRE teams at the heart of Digital Transformation.
We're using fresh ideas and leading-edge tech to build and maintain digital solutions that will be used by nearly every person in the UK, every day and at key moments in their lives.
DWP is the UK's largest government department. We help people into work and make payments worth over £195bn a year to support and empower millions of people.
The scale of what we do is extraordinary, and our purpose is unique. We'd love you to join us.
What skills, knowledge and experience will you need?
- Lead Criteria: Automation Expertise: Proven experience in scripting to automate processes, eliminating manual tasks, and implementing infrastructure and configuration as code.
- CI/CD Pipeline Development: Demonstrated ability to build and enhance CI/CD pipelines for efficient and reliable software delivery.
- Development: Demonstrable experience of developing cloud based and supporting cloud-based applications in AWS & Azure.
- Incident Resolution: Strong experience in resolving complex technical incidents, ensuring minimal downtime and swift recovery.
- Reliability Engineering: Expertise in reliability engineering, including capacity and performance management through effective monitoring, logging, and alerting.
- Leadership: Demonstrated ability to engage with stakeholders at all levels, providing valuable feedback and support, while leading teams effectively, mentoring junior engineers, and driving improvements in working practices.
You and your role
Your day will be all about making sure our applications and infrastructure are reliable, secure and ready for scale. You'll work closely with development teams from the design stage, helping them build systems that follow best practices and meet department standards.
You'll lead by example, mentoring other SREs, guiding teams and driving improvements. A big part of your role will be creating and maintaining detailed runbooks so incidents can be resolved quickly and efficiently. You'll also automate repetitive tasks, reduce toil and make sure monitoring is in place so issues are spotted before they become problems.
When major incidents happen, you'll take the lead in coordinating the right people and restoring services fast. You'll manage error budgets, review high-priority incidents and push a culture of engineering ownership across the organisation.
Details. Wages. Perks.
Hybrid Working:
We work a hybrid model - you'll spend some time working at home and some time collaborating face to face in a hub.
Pay:
We offer competitive pay of up to £78,517
Pension:
You'll get a brilliant civil service pension with employer contributions worth 28.97%, worth over £16,000 per year.
Holidays:
A generous leave package starting at 26 days rising to 31 days over time.
You can also take up to 3 extra days off a month on flexi-time. You'll also get all the usual public holidays.
We have a broad benefits package built around your work-life balance which includes:
- Flexible working including flexible hours and flex-friendly policies
- Time off volunteering and charitable giving
- Bring your authentic self to work with 'I Can Be Me in DWP'
- Discounts and savings on shopping, fun days out and more
- Interest-free loans to buy a bike or a season ticket, so it's even easier for you to get to work and start making a difference
- Professional development, coaching, mentoring and career progression opportunities.
Process
We know your time is valuable, so our application and selection process are just two stages:
Apply:
complete your application on Civil Service Jobs. There'll be full instructions when you click through.
Interview:
a single stage interview online.
CLICK APPLY
for more information and to start your application.
-
Site Reliability Engineer
2 weeks ago
Manchester, United Kingdom Searchability Full timeSITE RELIABILITY ENGINEER £40k salary Join a growing, technology-driven business operating at scale within the online gaming and sports sector. Opportunity to shape the SRE strategy. ABOUT THE CLIENT Our client is a fast-growing digital technology company at the forefront of delivering high-availability platforms for the sports and gaming industry. They...
-
Site Reliability Engineer
1 week ago
Manchester, England, United Kingdom Iceberg Full time £50,000 - £63,000 per yearSite Reliability Engineer (Azure)Location:Manchester or Glasgow (2 days per week on-site)Salary:£50,000 – £63,000 + discretionary bonusSponsorship:Unfortunately my client is unable to support visa sponsorships with this hireI am seeking an experienced Site Reliability Engineer to join a global financial services organisation. This is a true engineering...
-
Site Reliability Engineer
1 week ago
Manchester, United Kingdom Manchester Digital Full timeSite Reliability Engineer role at Manchester Digital You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and availability of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability,...
-
Site Reliability Engineer
7 days ago
Manchester, United Kingdom bet365 Group Full timeAs a Site Reliability Engineer, you will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices.Full-timeCloses 28/01/2026You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and...
-
Site Reliability Engineer
21 hours ago
Manchester, United Kingdom hackajob Full timeJoin to apply for the Site Reliability Engineer role at hackajob Company Description At bet365, we're one of the world's leading online gambling companies, revolutionising the industry since 2000. Founded by Denise Coates CBE, we now employ over 9,000 people and serve over 100 million customers in 27 languages. Our focus on In-Play betting has solidified our...
-
Site Reliability Engineer
2 weeks ago
Manchester, United Kingdom Anson McCade Full timeJob DescriptionAbout the RoleAre you passionate about building resilient systems and eliminating operational toil through automation? We’re looking for a Site Reliability Engineer (SRE) to join our high-impact team and help shape the future of our digital infrastructure.As an SRE, you’ll blend software engineering with systems engineering to ensure the...
-
Site Reliability Engineer
2 days ago
Manchester, United Kingdom hackajob Full timehackajob is collaborating with Bet365 to connect them with exceptional tech professionals for this role.Company DescriptionAt bet365, we're one of the world's leading online gambling companies, revolutionising the industry since 2000. Founded by Denise Coates CBE, we now employ over 9,000 people and serve over 100 million customers in 27 languages. Our focus...
-
Site Reliability Engineer
2 weeks ago
Manchester, United Kingdom Caspian One Full timeJob DescriptionWe’re building a Centralised SRE team to champion reliability engineering across global technology infrastructure. As a Senior Site Reliability Engineer, you’ll be at the forefront of this transformation engineering scalable systems, automating operations, and embedding resilience into every layer of the tech stack.This isn’t just about...
-
Site Reliability Engineer
2 days ago
Manchester, United Kingdom hackajob Full timehackajob is collaborating with Bet365 to connect them with exceptional tech professionals for this role. Company Description At bet365, we're one of the world's leading online gambling companies, revolutionising the industry since 2000. Founded by Denise Coates CBE, we now employ over 9,000 people and serve over 100 million customers in 27 languages. Our...
-
Site Reliability Engineer
4 weeks ago
Manchester, United Kingdom Caspian One Full timeWe’re building a Centralised SRE team to champion reliability engineering across global technology infrastructure. As a Senior Site Reliability Engineer, you’ll be at the forefront of this transformation engineering scalable systems, automating operations, and embedding resilience into every layer of the tech stack.This isn’t just about keeping the...