Platform Reliability Manager

2 days ago


London, Greater London, United Kingdom Devopshunt Full time
Job Overview

We are looking for a Senior Site Reliability Engineer to join our team in London, England. As a key member of our SRE team, you will be responsible for ensuring the availability, performance, security, and reliability of our platform and core services.

About the Role

You will design, implement, and maintain monitoring solutions, metric-driven alerting, logging, and tracing. You will also troubleshoot complex environments, establish and measure SLIs and SLOs with engineering teams, and continuously improve relationships and ways of working with other engineering teams.

This role involves hands-on work with technical projects, taking direction from team principals. You will also participate in periodic 24x7 paid on-call duties.

  • Main Responsibilities:
  • Design, implement, and maintain monitoring solutions, metric-driven alerting, logging, and tracing
  • Troubleshoot complex environments
  • Establish and measure SLIs and SLOs with engineering teams
  • Participate in periodic 24x7 paid on-call duties
Salary and Benefits

The salary for this role is estimated to be around £90,000 per annum. Additionally, you will receive a range of benefits, including health insurance, pension scheme, and flexible working hours.



  • London, Greater London, United Kingdom Harrington Starr Full time £120,000

    Key ResponsibilitiesWe are seeking a Platform Reliability Engineer to join our team at Harrington Starr. The successful candidate will have experience in designing, deploying, and maintaining cloud-native infrastructure for regulated financial services.In this role, you will lead CI/CD implementations, ensuring seamless deployments with robust test...


  • London, Greater London, United Kingdom Cutover Full time

    About CutoverCutover is a pioneering company that has developed the world's first enterprise-wide work orchestration and observability platform. This innovative technology enables seamless collaboration between humans and machines.We're looking for a skilled Cloud Platform Reliability Engineer to join our team. As a key member of our engineering team, you...


  • London, Greater London, United Kingdom Wipro Full time

    Job Title: Platform and Service Reliability ExpertCompany Overview: Wipro Limited is a global technology leader that provides innovative solutions to its clients' complex digital transformation needs. We strive to create a diverse and inclusive workplace culture.Salary: $125,000 per year.Job Description:We are seeking an experienced Platform and Service...


  • London, Greater London, United Kingdom Quantcast Full time

    About the Role:We are looking for a talented Software Engineer to join our Platform Reliability team at Quantcast. As a member of this team, you will play a key role in ensuring the health and maintainability of our systems. The successful candidate will have a strong background in software engineering, with experience in designing and building large-scale...


  • London, Greater London, United Kingdom Stacklok, Inc. Full time

    OverviewWe are seeking a skilled Cloud Security Platform Reliability Engineer to join our team at Stacklok, Inc. in London. As a key member of our product engineering team, you will be responsible for advancing the reliability and operational efficiency of our cloud-based security platform.About the RoleThis is a hybrid role that requires on-site work at our...


  • London, Greater London, United Kingdom Tbwa ChiatDay Inc Full time

    About StacklokStacklok is an innovative software supply chain security startup that empowers developers to make safer open source dependency choices.We're seeking a Senior Site Reliability Engineer (SRE) to support Trusty, our package intelligence service. This role focuses on driving essential initiatives in automation, system monitoring, configuration...


  • London, Greater London, United Kingdom iManage Full time

    We are seeking a skilled Senior Reliability Engineer to join our team and help build something from the ground up with our new cloud platform.About the RoleThis is an exciting opportunity for someone who is interested in building, learning, and delivering a platform that delights customers. As a Senior Reliability Engineer, you will be responsible for...


  • London, Greater London, United Kingdom Board Intelligence Full time

    About the RoleWe are seeking a highly skilled Technical Lead for our Platform Reliability and Scalability team at Board Intelligence. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining the scalability and reliability of our software platform.As a Technical Lead, you will work closely with our...


  • London, Greater London, United Kingdom Tide Platform Limited Full time

    About TideTide is a rapidly growing fintech company building a comprehensive finance platform for small businesses. Our mission is to empower entrepreneurs and save them time and money by providing innovative banking services, invoicing tools, and accounting solutions.With over 1 million members across the globe, we are headquartered in central London with...


  • London, Greater London, United Kingdom Node4 Full time

    About Node4Founded in 2004, Node4 has evolved into a diverse and vibrant technology company with a workforce of over 1200 passionate individuals. Our people are the driving force behind our success, and we pride ourselves on providing exceptional service as standard.Our CultureWe value innovation, trust, and passion, and we believe that our employees are the...

  • Software Engineer

    3 weeks ago


    London, Greater London, United Kingdom Quantcast Full time

    Job DescriptionWe are seeking a highly skilled Software Engineer to join our team as a Platform Reliability Specialist. As a key member of our engineering team, you will be responsible for designing, developing, and maintaining large-scale distributed systems that respond to millions of real-time requests per second efficiently.The ideal candidate will have...


  • London, Greater London, United Kingdom ENGINEERINGUK Full time

    Why Join Us?We offer a competitive salary of £45,000-£60,000 per annum, depending on experience. You will also receive a comprehensive benefits package, including pension scheme, life insurance, and annual leave.Key ResponsibilitiesManage and maintain the Salesforce platform, ensuring its security and reliability.Create and manage user accounts, import and...


  • London, Greater London, United Kingdom LoyaltyLion Full time

    About UsLoyaltyLion is a pioneering data-driven loyalty and engagement platform, trusted by thousands of e-commerce brands worldwide. Our mission is to help merchants succeed in the competitive e-commerce landscape by offering a loyalty program that increases customer engagement, retention, and spend.Our platform enables stores to generate substantial...

  • Reliability Engineer

    3 weeks ago


    London, Greater London, United Kingdom loveholidays Full time

    About the RoleWe are seeking a highly skilled Reliability Engineer to join our team at LoveHolidays. As a key member of our infrastructure team, you will be responsible for ensuring the reliability and performance of our systems, which handle millions of users and thousands of requests per second.Our runtime architecture is Service Based and hosted on cloud...


  • London, Greater London, United Kingdom Vodafone Full time

    At Vodafone, we're building a better future through innovation and technology. Our dynamic global community empowers us to achieve this by challenging conventional norms and embracing emerging trends.Job OverviewWe're seeking an experienced Technical SRE Platform Manager Lead to oversee the development, maintenance, and optimisation of our existing platform...


  • London, Greater London, United Kingdom Tyk Technologies Full time

    Job OverviewWe are seeking an experienced Site Reliability Engineer to join our team at Tyk Technologies. This is a critical role that will require you to manage, maintain, and improve our platform.You will be responsible for ensuring the stability, scalability, and performance of our platform, as well as providing top-notch support to our clients. If you...


  • London, Greater London, United Kingdom Squarepoint Capital Full time

    Squarepoint Capital is a leading investment management firm that seeks a System Reliability Specialist to maintain the stability and performance of our trading services across the cloud. As a key member of our technology operations team, you will play a critical role in ensuring the reliability of our systems and infrastructure.The ideal candidate will have...


  • London, Greater London, United Kingdom Board Intelligence Full time

    We are proud to offer an Excellent Opportunity for a Senior Monitoring Site Reliability Engineer to join our team at Board Intelligence. As a key member of our technical team, you will play a vital role in ensuring the smooth operation of our platform and core services.The successful candidate will have a strong background in SRE/DevOps or Linux System...


  • London, Greater London, United Kingdom Harnham Full time

    Job DescriptionHarnham's Chief Technology Officer will be responsible for leading our AI-driven platform, ensuring it meets high standards of quality and performance. Key responsibilities include leading rapid prototyping, MVP development, and iteration, while overseeing platform development, implementation, and scaling. This role requires technical...


  • London, Greater London, United Kingdom Industrial Light & Magic Full time

    Industrial Light & Magic is looking for a Platform Development Lead to join our team. As a key member of our Platform Team, you will be responsible for developing and maintaining our platform infrastructure, ensuring it is scalable, reliable, and performs optimally.The salary range for this role is estimated to be between $150,000 and $200,000 per year,...