Current jobs related to Site Reliability Engineer SRE Terraform Azure - West London - Client Server


  • London, Greater London, United Kingdom Onyx-Conseil Full time

    Senior SRE / Site Reliability Engineer (Terraform Azure)At Onyx-Conseil, we're seeking a highly skilled Senior SRE / Site Reliability Engineer to join our team. As a key member of our infrastructure team, you'll be responsible for ensuring the reliability, performance, and availability of our core platforms.Key Responsibilities:Design, build, and monitor...


  • London, Greater London, United Kingdom Onyx-Conseil Full time

    Senior SRE / Site Reliability Engineer (Terraform Azure)At Onyx-Conseil, we're seeking a highly skilled Senior SRE / Site Reliability Engineer to join our team. As a key member of our infrastructure team, you'll be responsible for ensuring the reliability, performance, and availability of our core platforms.Key Responsibilities:Design, build, and monitor...


  • London, Greater London, United Kingdom Onyx-Conseil Full time

    Senior SRE / Site Reliability Engineer (Terraform Azure)At Onyx-Conseil, we're seeking a highly skilled Senior SRE / Site Reliability Engineer to join our team. As a key member of our infrastructure team, you'll be responsible for ensuring the reliability, performance, and availability of our core platforms.Key Responsibilities:Design, build, and monitor...


  • London, Greater London, United Kingdom client server Full time

    Job Title: Senior SRE Terraform AzureAbout the Role:We are seeking a highly skilled Senior SRE / Site Reliability Engineer to join our team at Client Server. As a Senior SRE / Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and availability of our core platforms. You will design, build, and monitor systems to...


  • London, Greater London, United Kingdom client server Full time

    Job Title: Senior SRE Terraform Azure  About the Role We are seeking a highly skilled Senior SRE / Site Reliability Engineer to join our technology investment company. The ideal candidate will have experience managing public cloud infrastructure, particularly Azure, and a strong understanding of Terraform. Key Responsibilities Design, build, and monitor...


  • London, Greater London, United Kingdom Client Server Full time

    Senior SRE / Site Reliability Engineer (Terraform Azure)At Client Server, we're seeking a highly skilled Senior SRE / Site Reliability Engineer to join our team. As a key member of our Platform Team, you'll be responsible for ensuring the reliability, performance, and availability of our core platforms.Key Responsibilities:Design, build, and monitor systems...


  • London, Greater London, United Kingdom Client Server Full time

    Senior SRE / Site Reliability Engineer (Terraform Azure)At Client Server, we're seeking a highly skilled Senior SRE / Site Reliability Engineer to join our team. As a key member of our Platform Team, you'll be responsible for ensuring the reliability, performance, and availability of our core platforms.Key Responsibilities:Design, build, and monitor systems...


  • London, United Kingdom Experian Full time

    Job Description We're looking for an accomplished and motivated Site Reliability Engineer (SRE) to join our Experian Data Quality team in London, on a hybrid working pattern. Reporting to the QA Director, you will ensure the reliability, performance, and scalability of our market leading suite of data management products, with an initial focus on...


  • London, Greater London, United Kingdom Client Server Ltd. Full time

    Job Description:As a Senior SRE / Site Reliability Engineer at Client Server Ltd., you will be responsible for ensuring the reliability, performance, and availability of the company's core platforms.You will design, build, and monitor systems to maximize uptime and efficiency for the best possible user experience, collaborating with software engineering...


  • London, Greater London, United Kingdom Apollo Solutions Full time

    {"h1": "Site Reliability Engineering Manager", "p": "At Apollo Solutions, we are seeking a highly skilled Site Reliability Engineering Manager to lead our team in ensuring services are operational while supporting program timelines and business outcomes. Responsibilities: * Lead the L1/L2 team to improve the cycle time and efficiency of incident & service...


  • London, Greater London, United Kingdom Apollo Solutions Full time

    {"h1": "Site Reliability Engineering Manager", "p": "At Apollo Solutions, we are seeking a highly skilled Site Reliability Engineering Manager to lead our team in ensuring services are operational while supporting program timelines and business outcomes. Responsibilities: * Lead the L1/L2 team to improve the cycle time and efficiency of incident & service...


  • London, Greater London, United Kingdom Experian Full time

    About the RoleWe're seeking a skilled Site Reliability Engineer to join our Experian Data Quality team in London, working on a hybrid schedule.As a key member of our QA team, you'll ensure the reliability, performance, and scalability of our market-leading data management products, focusing on observability to support incident resolution and drive ongoing...


  • London, Greater London, United Kingdom ESL FACEIT Group Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at ESL FACEIT Group. As a key member of our infrastructure team, you will be responsible for designing, analyzing, and troubleshooting large-scale distributed systems.As a Site Reliability Engineer, you will work closely with our software engineering teams to deploy and...


  • London, Greater London, United Kingdom Remotestar Full time

    Remotestar is seeking a Senior Site Reliability Engineering Manager to join our client's team in the UK. The client is a leading B2B marketplace for diamonds, and we're looking for a seasoned expert to lead our infrastructure and services team.The ideal candidate will have a strong track record of building and maintaining highly reliable infrastructure and...


  • London, United Kingdom Insight Global Full time

    Insight Global are looking for a Site Reliability Engineer (SRE) for one of their largest broadcasting and media clients. This SRE will be working to maintain and improve an existing system focused on linear channel delivery whilst working on a new system to modernize this process. This individual will be responsible for supporting streaming engineers and...


  • London, United Kingdom Insight Global Full time

    Insight Global are looking for a Site Reliability Engineer (SRE) for one of their largest broadcasting and media clients. This SRE will be working to maintain and improve an existing system focused on linear channel delivery whilst working on a new system to modernize this process. This individual will be responsible for supporting streaming engineers and...


  • London, United Kingdom Insight Global Full time

    Insight Global are looking for a Site Reliability Engineer (SRE) for one of their largest broadcasting and media clients. This SRE will be working to maintain and improve an existing system focused on linear channel delivery whilst working on a new system to modernize this process. This individual will be responsible for supporting streaming engineers and...

  • DevOps Engineer/SRE

    2 months ago


    London, United Kingdom CV-Library Full time

    DevOps Engineer/Site Reliability Engineer (SRE) (SC cleared, inside IR35) I am looking for an SC cleared (must be active) DevOps Engineer/SRE for a leading consultancy client. You will be part of the Enterprise technology team delivering best practices alongside the engineering and architecture teams for a key government department. Must Have...


  • London Area, United Kingdom Insight Global Full time

    Insight Global are looking for a Site Reliability Engineer (SRE) for one of their largest broadcasting and media clients. This SRE will be working to maintain and improve an existing system focused on linear channel delivery whilst working on a new system to modernize this process. This individual will be responsible for supporting streaming engineers and...


  • London Area, United Kingdom Insight Global Full time

    Insight Global are looking for a Site Reliability Engineer (SRE) for one of their largest broadcasting and media clients. This SRE will be working to maintain and improve an existing system focused on linear channel delivery whilst working on a new system to modernize this process. This individual will be responsible for supporting streaming engineers and...

Site Reliability Engineer SRE Terraform Azure

2 months ago


West London, United Kingdom Client Server Full time

Job Summary

We are seeking a highly skilled Site Reliability Engineer / SRE to join our team at Client Server. As a key member of our technology investment company, you will play a critical role in ensuring the reliability, performance, and availability of our core platforms.

About the Role

As a Site Reliability Engineer / SRE, you will be responsible for designing, building, and monitoring systems to maximize uptime and efficiency for the best possible user experience. You will provide expertise to software engineering teams to ensure that applications are built with reliability in mind and collaborate with the Platform Team to ensure the necessary infrastructure is scalable.

Key Responsibilities

  • Design and implement scalable and reliable infrastructure solutions using Terraform and Azure.
  • Develop and maintain monitoring and alerting systems to ensure proactive identification and resolution of potential outages and performance issues.
  • Collaborate with software engineering teams to ensure that applications are built with reliability in mind.
  • Work with the Platform Team to ensure that infrastructure is scalable and meets the needs of the business.
  • Identify and resolve potential outages and performance issues before they become a problem.

Requirements

  • Experience of managing a public cloud (Azure preferred).
  • Strong Terraform skills and experience.
  • Good knowledge of DataDog (or other monitoring tools e.g. Prometheus, Grafana).
  • Experience with Azure DevOps, Octopus, and other CI/CD tools.
  • Scripting experience with at least one of the following: PowerShell, Python, C#.
  • Good knowledge of Linux and Windows operating systems and Networking technologies.
  • Good knowledge of Ansible, Kubernetes, and Docker.
  • Excellent communication and stakeholder management skills.

What We Offer

  • Competitive salary to £110k.
  • 28 days holiday (plus Bank Holidays).
  • £1k learning and development budget.
  • Private Health Care, Travel Insurance, Mental Health sessions, Wellbeing allowance.
  • Annual anniversary awards (including 4 week paid sabbatical in year 4).
  • Perks such as cycle to work scheme, travel loan, office dogs, and free breakfast.