Senior Site Reliability Engineer
1 week ago
Location
Belfast, Birmingham, Cardiff, Darlington, Edinburgh, London, Salford
About The Job
Job summary
If you would like to find out more about the role, the Site Reliability Engineering team and what it's like to work at DBT, we are holding a Hiring Manager Q&A session for this role where you can virtually 'meet the team' on Friday 17th October at 12:30pm. Please click here to book your spot.
About Us
The Department for Business and Trade (DBT) has a clear mission - to grow the economy. Our role is to help businesses invest, grow and export to create jobs and opportunities right across the country. We do this in three ways.
Firstly, we help to build a strong, competitive business environment, where consumers are protected and companies rewarded for treating their employees properly.
Secondly, we open international markets and ensure resilient supply chains. This can be through Free Trade Agreements, trade facilitation and multilateral agreements.
Finally, we work in partnership with businesses every day, providing advance, finance and deal-making support to those looking to start up, invest, export and grow.
The Digital, Data and Technology (DDaT) directorate develops and operates tools and services to support us in this mission.
About The Role
We are on a mission to build a new cutting-edge developer platform in AWS and support DBT services running on the platform.
Can we rely on you to make us more reliable? We need Site Reliability Engineers (SREs) to make sure our internet services work as users expect.
Job Description
As a Senior Site Reliability Engineer you will work to give development teams the tools for their job, including application performance monitoring, exception, log and metrics aggregation, dashboards, and declarative CI/CD (continuous integration/continuous delivery) pipelines.
You'll evangelise product teams about service-level indicators, objectives, and error budgets, and negotiate them. You'll help build and scale our global product platform and participate in an on-call rota for which you will receive an additional allowance.
Specific projects the team are working on include rolling out an observability tool to enhance system monitoring and incident response and streamlining deployment processes to reduce downtime and speed up feature delivery.
Out of the 4 positions available, one of these posts will have line management responsibilities but we expect all of our Senior Site Reliability Engineers to coach and mentor junior colleagues across DDaT.
You Will Be Using
- Amazon Web Services
- Azure
- AWS CodePipelines and AWS CodeBuild
- Terraform & AWS Copilot (CloudFormation)
- Docker, Elastic Container Service (ECS) and Elastic Container Registry (ECR)
- ElasticSearch/OpenSearch
- Python and Django framework
- PostgreSQL as a service (Amazon RDS)
- Sentry
- Redis/Elasticache
Person specification
It Is Essential That You Have
- Cloud experience with either Amazon Web Services, Azure or Google Cloud.
- Ability to build code-defined, reliable, and well tested infrastructure on top of cloud computing systems (e.g., Terraform, CloudFormation, Pulumi).
- Experience and fluency in one or more programming languages, writing clean and effective code.
- Experience in designing, analysing, and troubleshooting distributed systems.
- Knowledge of Linux/Unix fundamentals and TCP/IP networking.
- Ability to see user impact in the infrastructure changes.
- Excellent communication skills when dealing with both technical and non-technical stakeholders
It Is Desirable That You Have
- Experience in defining and measuring Service Level Objectives through observability.
- Experience in prototyping through reuse of existing Open-Source components.
-
Senior Site Reliability Engineer
2 days ago
Manchester, United Kingdom Anaplan Full timeOverviewAnaplan Manchester, England, United KingdomJoin to apply for the Senior Site Reliability Engineer role at AnaplanAt Anaplan, we are a team of innovators focused on optimizing business decision-making through our leading AI-infused scenario planning and analysis platform so our customers can outpace their competition and the market. Our Winning...
-
Site Reliability Engineer
2 weeks ago
Manchester, United Kingdom Manchester Digital Full timeSite Reliability Engineer role at Manchester Digital You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and availability of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability,...
-
Senior Site Reliability Engineer
2 days ago
Manchester, United Kingdom Canonical Full timeOverviewJoin to apply for the Senior Site Reliability Engineer role at Canonical.Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT. The...
-
Site Reliability Engineer
3 weeks ago
Manchester, United Kingdom Caspian One Full timeJob DescriptionWe’re building a Centralised SRE team to champion reliability engineering across global technology infrastructure. As a Senior Site Reliability Engineer, you’ll be at the forefront of this transformation engineering scalable systems, automating operations, and embedding resilience into every layer of the tech stack.This isn’t just about...
-
Site Reliability Engineer
4 weeks ago
manchester, United Kingdom Caspian One Full timeWe’re building a Centralised SRE team to champion reliability engineering across global technology infrastructure. As a Senior Site Reliability Engineer, you’ll be at the forefront of this transformation engineering scalable systems, automating operations, and embedding resilience into every layer of the tech stack. This isn’t just about keeping the...
-
Site Reliability Engineer
6 days ago
Manchester, United Kingdom hackajob Full timeJoin to apply for the Site Reliability Engineer role at hackajob Company Description At bet365, we're one of the world's leading online gambling companies, revolutionising the industry since 2000. Founded by Denise Coates CBE, we now employ over 9,000 people and serve over 100 million customers in 27 languages. Our focus on In-Play betting has solidified our...
-
Site Reliability Engineer
3 weeks ago
Manchester, United Kingdom Searchability Full timeSITE RELIABILITY ENGINEER £40k salary Join a growing, technology-driven business operating at scale within the online gaming and sports sector. Opportunity to shape the SRE strategy. ABOUT THE CLIENT Our client is a fast-growing digital technology company at the forefront of delivering high-availability platforms for the sports and gaming industry. They...
-
Site Reliability Engineer
2 weeks ago
Manchester, United Kingdom bet365 Group Full timeAs a Site Reliability Engineer, you will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices.Full-timeCloses 28/01/2026You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and...
-
Site Reliability Engineer
3 weeks ago
Bolton, Greater Manchester, United Kingdom Caspian One Full timeWe're building a Centralised SRE team to champion reliability engineering across global technology infrastructure. As a Senior Site Reliability Engineer, you'll be at the forefront of this transformation engineering scalable systems, automating operations, and embedding resilience into every layer of the tech stack. This isn't just about keeping the lights...
-
Site Reliability Engineer
2 days ago
Manchester, United Kingdom Sectigo Full timeSectigo Manchester, England, United KingdomSite Reliability EngineerSectigo Manchester, England, United KingdomGet AI-powered advice on this job and more exclusive features.Job DescriptionWe are looking for a Site Reliability Engineer to join our growing global team at Sectigo.Job DescriptionWe are looking for a Site Reliability Engineer to join our growing...