Principal Site Reliability Engineer

2 months ago


London, United Kingdom Plutus Full time

BPP Education is entering a new phase of its growth and evolution, attracting thousands more students each year and expanding into new verticals and new markets globally. The BPP Product & Technology (P&T) organisation is evolving rapidly, and driving transformation of its platforms, digital products and experiences, in order to help BPP Education scale and meet the growth of the business in the coming years.

We’re looking for a talented principal software reliability engineer (SRE) to help us build best in class products and deliver amazing user experiences, to deliver scalable, secure and performant experiences that delight and engage learners during their time studying with BPP and beyond, throughout their working lives.

As the Principal Software Reliability Engineer, you will report to the Engineering Manager, bringing your technical expertise to our growing product engineering teams, leveraging modern software development practices that will deliver business value at pace. You will be accountable for designing, implementing and maintaining systems that ensure the reliability, scalability and availability of our software products. This role is key as we transform BPP Education to become more customer centred, design and data informed, to build products that meet and exceed our users’ needs across our education ecosystem.

Key Responsibilities:

  • Accountability for the execution of the technical vision and ensure it is aligned with business goals.
  • Coach & mentor SRE engineers across the business in designing and implementing systems that ensure the reliability and availability of software products.
  • Create and execute a strategy for monitoring, alerting & automation tools that improve system reliability, scalability & stability.
  • Lead incident management response in production systems.
  • Analyse system metrics and logs to identify opportunities for improvement and prevent future or recurring incidents.
  • Collaborate with your peers in architecture, product, design, data and security to identify & mitigate risks to the system reliability.
  • Contribute and evolve the internal software engineering practices and standards as the team scales.
  • Be up-to-date with industry best practices, new technologies, and emerging trends.
Essential Skills:
  • Proven experience in a similar software engineering or SRE role working in an agile environment.
  • Deep knowledge of cloud networking, security and native functionality in AWS.
  • Expertise in Infrastructure as Code (IaC) using frameworks such as Terraform.
  • Expertise in monitoring and automation tools such as New Relic, DataDog & GitHub Actions.
  • Strong background in software development, architecture or operations.
  • Proficient knowledge of modern full stack technologies such as Typescript, React, Node.js, & Next.js.
  • Expert knowledge in relational and non-relational database technologies such as RDS, Dynamo & Redis.
  • Experience coaching & mentoring a diverse group of engineers.
  • Excellent verbal and written communication skills.
Core Skills: AWS, Terraform, Datadog, Typescript, React Other Skills: Cloud Security, Github, DynamoDB, Redis Seniority: Lead #J-18808-Ljbffr

  • London, United Kingdom Plutus Full time

    BPP Education is entering a new phase of its growth and evolution, attracting thousands more students each year and expanding into new verticals and new markets globally. The BPP Product & Technology (P&T) organisation is evolving rapidly, and driving transformation of its platforms, digital products and experiences, in order to help BPP Education scale and...


  • London, United Kingdom T. Rowe Price Full time

    Principal Site Reliability Engineer (SRE) page is loaded Principal Site Reliability Engineer (SRE) Apply locations London, Warwick Court time type Full time posted on Posted 20 Days Ago job requisition id 70771 There is a place for you at T. Rowe Price to grow, contribute, learn, and make a difference. We are a premierassetmanagerfocused on delivering...


  • London, United Kingdom T. Rowe Price Full time

    Principal Site Reliability Engineer (SRE) page is loaded Principal Site Reliability Engineer (SRE) Apply locations London, Warwick Court time type Full time posted on Posted 20 Days Ago job requisition id 70771 There is a place for you at T. Rowe Price to grow, contribute, learn, and make a difference. We are a premierassetmanagerfocused on delivering...


  • London, United Kingdom T. Rowe Price Full time

    Principal Site Reliability Engineer (SRE) page is loaded Principal Site Reliability Engineer (SRE) Apply locations London, Warwick Court time type Full time posted on Posted 20 Days Ago job requisition id 70771 There is a place for you at T. Rowe Price to grow, contribute, learn, and make a difference. We are a premierassetmanagerfocused on delivering...


  • London, United Kingdom T Rowe Price Full time

    Principal Site Reliability Engineer (SRE) There is a place for you at T. Rowe Price to grow, contribute, learn, and make a difference. We are a premier asset manager focused on delivering global investment management excellence and retirement services that investors can rely on today and in the future. The work we do matters. We invite you to explore the...


  • London, United Kingdom T Rowe Price Full time

    Principal Site Reliability Engineer (SRE) There is a place for you at T. Rowe Price to grow, contribute, learn, and make a difference. We are a premier asset manager focused on delivering global investment management excellence and retirement services that investors can rely on today and in the future. The work we do matters. We invite you to explore the...


  • London, United Kingdom T Rowe Price Full time

    Principal Site Reliability Engineer (SRE) There is a place for you at T. Rowe Price to grow, contribute, learn, and make a difference. We are a premier asset manager focused on delivering global investment management excellence and retirement services that investors can rely on today and in the future. The work we do matters. We invite you to explore the...


  • London, United Kingdom Prism Digital Full time

    **Site Reliability Engineer | GCP OR AWS & Kubernetes | SaaS HealthTech** The local headcount currently is 35 in Ireland and 45 in the UK (remote sys admins, tech engineers, field engineers, project managers, programme managers and sales) and expanding the UK office - feels like a start-up with start-up good energy. Our client is around 50% through their...


  • London, United Kingdom Apple Inc. Full time

    Site Reliability Engineering (SRE) Manager, iCloud People at Apple don’t just build products — they craft experiences our customers love and depend on. Apple Services Engineering (ASE) builds and supports the systems that make many of these daily experiences possible. If you’ve used Apple products, you’ve likely interacted with us. iCloud Services...


  • London, United Kingdom Apple Inc. Full time

    Site Reliability Engineering (SRE) Manager, iCloud People at Apple don’t just build products — they craft experiences our customers love and depend on. Apple Services Engineering (ASE) builds and supports the systems that make many of these daily experiences possible. If you’ve used Apple products, you’ve likely interacted with us. iCloud Services...


  • London, United Kingdom Nominet Full time

    Press Tab to Move to Skip to Content Link Engineering Manager - Site Reliability Engineering Location: London / Hybrid, GB Engineering Manager – Site Reliability Engineering Contract Type: Permanent Location: Hybrid (minimum 20% on-site in our London Shoreditch office) We’re proud to be an Equal Opportunity and Affirmative Action...


  • London, United Kingdom Nominet Full time

    Press Tab to Move to Skip to Content Link Engineering Manager - Site Reliability Engineering Location: London / Hybrid, GB Engineering Manager – Site Reliability Engineering Contract Type: Permanent Location: Hybrid (minimum 20% on-site in our London Shoreditch office) We’re proud to be an Equal Opportunity and Affirmative Action...


  • London, United Kingdom TEKsystems Full time

    Site Reliability Engineer / SRE Description: My global client is looking for a Site Reliability Engineer / SRE to join their growing team who must have strong experience working within the financial services industry on large complex projects. To be successful in this Site Reliability / SRE project you will need expert experience within: AWS ...


  • London, United Kingdom Lorien Full time

    Site Reliability Engineer Location: London (hybrid remote working) **Salary**: Up to £100,000 + Very Generous Benefits Package One of the fastest growing software development organisation requires a Site Reliability Engineer to help be the glue between the companies Dev, QA and Product teams - enabling the smooth Continuous Build and Integration of new...


  • London, United Kingdom Lorien Full time

    Site Reliability Engineer Location: London (hybrid remote working) **Salary**: Up to £100,000 + Very Generous Benefits Package One of the fastest growing ecommerce organisation requires a Site Reliability Engineer to help be the glue between the companies Dev, QA and Product teams - enabling the smooth Continuous Build and Integration of new instances of...


  • London, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...


  • London, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and...


  • London, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and...


  • London, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer Check all associated application documentation thoroughly before clicking on the apply button at the bottom of this description.I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability,...


  • London, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer Check all associated application documentation thoroughly before clicking on the apply button at the bottom of this description.I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability,...