Site Reliability Engineer

6 days ago


City Of London, United Kingdom Natobotics Full time

OverviewJoin to apply for the Site Reliability Engineer role at Natobotics.Location: London. Work Mode: Hybrid. Contract Role.Experience Level: 15+ Years.A Site Reliability Engineer is responsible for transforming the SDLC environment with engineering-focused role that emphasizes system reliability, automation, and performance in a non-production setting.ResponsibilitiesAutomate environment lifecycle: Develop Infrastructure as Code (IaC) to automate provisioning, teardown, and configuration of test environments, integrating them with the CI/CD pipeline.Establish service level objectives (SLOs): Define and measure SLIs for test environments, such as availability and provisioning time.Monitor environment health and performance: Use observability tools like Prometheus and Grafana to track the health of test environments, identify bottlenecks, and resolve issues proactively, not reactively.Manage incident response: Lead the incident management process for test environment issues, conducting blameless post-mortems to understand the root causes and implement lasting fixes.Minimize toil: Automate manual, repetitive tasks associated with test environments to free up engineering time for more strategic work.Strategic and cultural responsibilitiesDrive continuous improvement: Analyze environment performance data, incident reports, and post-mortems to identify opportunities for continuous improvement and innovation.Balance reliability and speed: Use an "error budget" for test environments. If environments are highly reliable, teams can use the budget for quicker feature development. If reliability is low, the focus shifts to improving stability.Instil a reliability culture: Promote a blameless culture around test environment incidents, encouraging shared ownership and collaboration between development, QA, and SRE teams.Capacity planning: Anticipate the future resource needs of test environments by analysing usage patterns and project forecasts. Ensure the infrastructure can scale to meet demand.Advance test data management: Work with Test Data Managers to ensure that test data is not only readily available but also consistent, compliant, and automatically provisioned with the environments.Technical SkillsExpertise in tooling: Proficiency with monitoring and logging tools (e.g., Prometheus, Splunk, Grafana), CI/CD platforms (e.g., Jenkins, GitLab CI), and configuration management tools (e.g., Ansible, Terraform).Cloud infrastructure knowledge: Deep understanding of cloud platforms like AWS, including experience with containerization technologies (Docker, Kubernetes) and serverless computing.Scripting and programming: Strong scripting skills in languages such as Python or Bash to automate environment management tasks.Systems and networking knowledge: Solid understanding of Linux systems, networking concepts, and database management.Soft SkillsLeadership and influence: The ability to champion SRE practices and influence technical and business stakeholders across different teams.Problem-solving: Strong analytical and debugging skills for investigating and resolving complex environment issues under pressure.Communication: Excellent communication and collaboration skills to bridge the gap between development, QA, and operations teams.Adaptability: A proactive and adaptable mindset to keep pace with evolving technology and development methodologies.Employment and LocationSeniority level: Mid-Senior levelEmployment type: ContractLocation: London, England, United KingdomNote: Referrals increase your chances of interviewing at Natobotics by 2x. #J-18808-Ljbffr



  • City Of London, United Kingdom N Consulting Limited Full time

    LocationLondon, United Kingdom# Site Reliability Engineer at N Consulting LtdLocationLondon, United KingdomSalary£70000 - £75000 /yearJob TypeContractDate PostedSeptember 22nd, 2025Apply NowRole : Site Reliability Engineer (SRE)Location : LondonWork Mode : HybridContract RoleJob Description:A Site Reliability Engineer is responsible for transforming the...


  • City of London, United Kingdom Amelco Limited Full time

    Role: Site Reliability EngineerType: Full-time permanent roleLocation: Hybrid/ Shoreditch, London 3 days per weekAbout UsAmelco Ltd are a leading gaming and gambling solution software provider with a strong presence in the USA, UK, and Europe. Through partnerships with global gaming companies, we build cutting-edge technical platforms across sportsbooks,...


  • City Of London, United Kingdom Different Technologies Pty Ltd. Full time

    OverviewWe are hiring for a next generation telecoms software company who are seeking a Network Autonomy Engineer to join their expanding team.Primary Function of the PositionReporting to the Site Reliability Engineer Team Lead, the Site Reliability Engineer will be responsible for ensuring the reliability, scalability and performance of our...


  • City Of London, United Kingdom Charles Simon Associates Ltd Full time

    Overview Site Reliability Engineer (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) Permanent Remote Location: Remote (occasional travel to Nottinghamshire HQ)Salary: Up to £95,000 per annum + benefitsStart Date: ASAP Charles Simon Associates are working with a global organisation who are looking to recruit a...


  • City Of London, United Kingdom Mistral AI Full time

    About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high‑performance, optimized, open‑source and cutting‑edge models, products and solutions. Our comprehensive AI platform is...


  • City of London, Greater London, United Kingdom Amelco Limited Full time

    Role: Site Reliability Engineer Type: Full-time permanent role Location: Hybrid/ Shoreditch, London 3 days per week About Us Amelco Ltd are a leading gaming and gambling solution software provider with a strong presence in the USA, UK, and Europe. Through partnerships with global gaming companies, we build cutting-edge technical platforms across sportsbooks,...


  • City of London, United Kingdom Barclays Bank Plc Full time

    Join us as a Site Reliability Engineer where you'll spearhead the evolution of our digital landscape, driving innovation and excellence. As a Microsoft SQL Database Site Reliability Engineer (SRE) at Barclays, you will assume a key technical role. You will assist in shaping the direction of our database administration, ensuring our technological approaches...


  • City Of London, United Kingdom Blackfluo.ai Full time

    About the job Site Reliability Engineer (SRE)Job DescriptionLocation: Full remote, EU timezone (CET +/- 2 hours)Start Date: As soon as possibleLanguages: English requiredWe are looking for a skilled Site Reliability Engineer (SRE) with deep expertise in AWS to help us scale and secure our infrastructure. As an SRE, you will be instrumental in ensuring the...


  • City Of London, England, United Kingdom Lorien Full time £65,000 - £130,000 per year

    Site Reliability Engineer - Live SC ClearanceLocation: City of London - Onsite 2/3 days a weekDuration: 6 monthsDaily Rate: £650 per day Inside of IR35We're looking for a Senior Site Reliability Engineer with deep expertise in Azure cloud migration and a strong DevOps background to join our team. This is a hands-on technical role reporting to the Cloud...


  • City Of London, United Kingdom GSR Full time

    Global Site Reliability Engineer Location: London About Us Founded in 2013, GSR is a leading market maker and programmatic trading firm in the fast‑evolving world of cryptocurrency trading. With over 200 employees across seven countries, we provide billions of dollars in liquidity daily to cryptocurrency protocols and exchanges. We build long‑term...