Site Reliability Engineer

3 weeks ago


London, Greater London, United Kingdom Node4 Full time

About Node4

Founded in 2004, Node4 has evolved into a diverse and vibrant technology company with a workforce of over 1200 passionate individuals. Our people are the driving force behind our success, and we pride ourselves on providing exceptional service as standard.

Our Culture

We value innovation, trust, and passion, and we believe that our employees are the key to our continued growth and success. Whether you're just starting out in your career or looking to progress as an industry professional, Node4 offers a welcoming and evolving environment that will help you develop your skills and career.

The Role

We're seeking a skilled Site Reliability Engineer to join our team, responsible for managing, supporting, and optimising our platforms to ensure security, scalability, performance, and reliability across the stack. As a Site Reliability Engineer, you will be expected to demonstrate excellent skills in DevOps, containerisation, infrastructure-as-code automation, cloud and virtualisation, and monitoring and reporting.

Your Skills

  • DevOps - CI/CD Pipelines (Azure Devops / Github Actions)
  • Containerisation (Kubernetes, Docker, Helm, Rancher)
  • Infrastructure-as-code automation (Terraform, Ansible, Bash, Python)
  • Cloud and virtualisation (VMWare, Azure, AWS)
  • Monitoring and reporting (Grafana, Prometheus, Grafana Loki, Logstash/ Fluent Bit)
  • Good Linux administration skills (Ubuntu / RHEL)
  • Excellent communication skills (verbal, written and face-to-face)
  • Effective time management
  • Excellent writing skills
  • Exceptional attention to detail, with a positive solution-driven mindset
  • Enjoy working in a fast-paced and energetic environment
  • Able to present, prepare and deliver ideas and projects
  • Organisation, numeracy and a structured approach to work
  • A team player who can work with diverse areas within Node4
  • Interest in container-focused technologies

What We Offer

  • Hybrid Working
  • Private Medical Insurance or Company-Paid Health Cash Plan
  • Employee Assistance Program
  • 25 days holidays plus your birthday off
  • Option to purchase additional holiday (up to 5 days)
  • Company Pension Scheme
  • Life Assurance x 4
  • A diverse workforce
  • Employee investment with Node4 Training Academy
  • Family savings and shopping discounts through the Node4 benefits portal
  • Discounted Gym Membership
  • Modern facilities with open and welcoming breakout areas
  • Company Social events
  • Never-ending supply of hot and cold drinks, biscuits, sweets, and fruit


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesAs a Site Reliability Engineer at Fourier, you will be responsible for designing and implementing tools to enhance the reliability and resilience of our production systems. This includes investigating failures, improving system performance, and automating manual processes.Required SkillsExcellent Python scripting skillsExperience with...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    Job SummaryJ Bandy Consulting is seeking an experienced Site Reliability Engineer to join our team. The ideal candidate will have a strong background in software engineering and a passion for building scalable and reliable systems.Key ResponsibilitiesDevelop and implement automation tools to improve the efficiency of our systemsCollaborate with...


  • London, Greater London, United Kingdom Selby Jennings Full time

    About Selby JenningsWe're a leading global financial services firm where technologists and investment professionals collaborate to drive innovation and operational excellence.About the RoleAs a Site Reliability Engineer, you'll apply your expertise in software and systems engineering to design, build, and maintain our robust infrastructure. You'll reduce...


  • London, Greater London, United Kingdom GoCardless Full time

    The RoleGoCardless is looking for a Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining the infrastructure and systems that support our payment and open banking products.Key ResponsibilitiesDesign and implement scalable and efficient infrastructure solutionsDevelop...


  • London, Greater London, United Kingdom Preqin Full time

    About the Role:Preqin is seeking an experienced Site Reliability Engineer to join our team in London. As a Site Reliability Engineer, you will work across Preqin's full suite of services, supporting our clients around the world.You will be responsible for designing, building, and operating our infrastructure, middleware, and CI/CD systems to ensure our teams...


  • London, Greater London, United Kingdom Highfield Professional Solutions Ltd Full time

    Highfield Professional Solutions Ltd is seeking a Site Reliability Engineer to join our team in Central London. The successful candidate will be responsible for managing and maintaining critical engineering systems within our Data Centre, ensuring that they operate efficiently and effectively. This role offers a competitive salary of up to 48,000 per year,...


  • London, Greater London, United Kingdom Kinetech Full time

    At Kinetech, we're seeking a talented Site Reliability Engineer to join our team. This role is responsible for ensuring the smooth operation of our software systems, with a focus on scalability, reliability, and performance.Key Responsibilities:Design and implement CI/CD pipelines to automate code integration, testing, and deployment.Automate repetitive...


  • London, Greater London, United Kingdom Rewardgateway Full time

    Engineering, LondonEarn a salary of £110,000 - £130,000 per year with Reward Gateway.We are seeking an experienced Site Reliability Engineer to lead our team and drive the transformation of our operational workloads to a Service Reliability Engineering (SRE) approach. The successful candidate will be responsible for establishing and managing our new SRE...


  • London, Greater London, United Kingdom Mondrian Alpha Recruitment Solutions Full time

    At Mondrian Alpha Recruitment Solutions, we are seeking a highly skilled Site Reliability Engineer to join our team responsible for engineering and supporting the company's critical infrastructure platforms.This team handles the centralized development infrastructure and works alongside engineering teams across the business to ensure the optimal route of...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesWe are seeking a skilled Site Reliability Engineering Specialist to join our team at Fourier. As a key member of our Site Reliability Engineering team, you will be responsible for developing tools for surveillance and enhancement of our production systems.You will work closely with our team to increase system resilience, investigate...


  • London, Greater London, United Kingdom Lorien Full time

    Key Responsibilities:Collaborate with the existing team to deliver a brand-new project.Work on a hybrid model with 1 day a week on-site in London.Develop and maintain reliable and efficient systems.Utilize experience with Java, Python, Splunk, ServiceNow, and MongoDB.Contribute to incident management and application monitoring.Ensure seamless interaction...


  • London, Greater London, United Kingdom loveholidays Full time

    About usWe are a dynamic and rapidly growing online travel agency that places technology at the heart of our success. With millions of people trusting us for their dream holidays, our focus is on delivering exceptional customer experiences through cutting-edge technology.We operate at scale, handling 100+ services and 8k requests per second while maintaining...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    Job OverviewJ Bandy Consulting seeks a skilled Site Reliability Engineer to ensure the reliability, scalability, and performance of our systems. The successful candidate will be responsible for developing the SRE culture, applying automation, and monitoring application performance.Key ResponsibilitiesDrive the evolution of the DevOps/GitOps toolchain to...


  • London, Greater London, United Kingdom Trade Nation Full time

    Site Reliability Engineer Job DescriptionAt Trade Nation, we're seeking a highly skilled Site Reliability Engineer to join our dynamic team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining scalable and reliable systems that ensure high availability and performance.Key ResponsibilitiesDesign and Implement...


  • London, Greater London, United Kingdom BenevolentAI Full time

    About the Role:BenevolentAI is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will be responsible for designing and implementing software solutions for cloud infrastructure, improving long-term infrastructure availability and reliability, and monitoring and handling incident response of...


  • London, Greater London, United Kingdom Apple Inc. Full time

    Unlock the Future of Cloud ServicesAt Apple Inc., we're not just building products - we're crafting experiences that our customers love and depend on. Our Apple Services Engineering (ASE) team is responsible for the systems that make these daily experiences possible. If you've used Apple products, you've likely interacted with us. Our iCloud Services SRE...


  • London, Greater London, United Kingdom Remotestar Full time

    Remotestar is seeking a Senior Site Reliability Engineering Manager to join our client's team in the UK. The client is building a B2B marketplace for diamonds, and we need someone to ensure the reliability, scalability, and performance of our infrastructure and services.The ideal candidate will have a strong track record of building and maintaining highly...


  • London, Greater London, United Kingdom Cisco Full time

    Job OverviewThe Cisco Site Reliability Engineering team is responsible for providing tools, services, and infrastructure to monitor and observe the ThousandEyes platform. As a Senior Site Reliability Engineer, you will own our logging pipeline and monitoring stack while working with developers to continuously improve our view of the platform.Key...


  • London, Greater London, United Kingdom ESL FACEIT Group Full time

    At ESL FACEIT Group, we're passionate about creating a culture that fosters innovation and community. As a Site Reliability Engineer, you'll play a crucial role in maintaining and improving our monitoring and observability tools, working closely with cross-functional teams to design, maintain, and operate systems at scale.Key ResponsibilitiesMaintain and...


  • London, Greater London, United Kingdom STAND 8 Full time

    Job SummaryWe are seeking an experienced Site Reliability Engineer to join our team at STAND 8. As a Site Reliability Engineer, you will be responsible for maintaining existing systems, working on infrastructure modernization, and supporting the streaming engineering team to ensure smooth operation of linear streaming channels.Key ResponsibilitiesMaintain...