Site Reliability Engineer, Lead

3 weeks ago


Manchester, United Kingdom TekStream Solutions Full time

Our client is a remote-first company with team members across the globe Offering a SaaS-based Learning Management System powering the world's leading education programs. Our client helps large brands and fast-moving companies increase revenue, improve customer retention, and decrease support costs through external education. The platform includes all the tools an organization needs to create, manage, track, and improve highly personalized learning experiences for customers, partners, and employees.


Successful Candidate:

  • SaaS experience
  • Experienced and able to thrive in a small-medium high-growth environment
  • Invested in upskilling, learning new tech
  • Deeply curious, creative, and innovative
  • Flexible in working hours/ability to collaborate in different time zones


The Lead Site Reliability Engineer has a pivotal role at the forefront of our engineering operations, responsible for guiding the Platform Team toward achieving exceptional standards of reliability, performance, and stability across all our applications. The successful candidate will possess deep expertise in these core areas and will be instrumental in defining and implementing industry-leading practices. As a key leader, this role will not only shape the strategic direction of our platform operations but also establish the benchmarks and processes by which our engineering excellence is measured.


Responsibilities

  • Lead the SRE Team, setting clear goals and priorities in line with business objectives. In collaboration with the department Director develop and execute strategies that enhance technological capabilities across the company
  • Ensure all platforms and systems operate smoothly and remain highly available, scalable, and fault-tolerant. Implement best practices for continuous monitoring, preventive maintenance, and rapid response.
  • Continuously assess system performance, identify bottlenecks, and make data-driven recommendations for infrastructure enhancements.
  • Ensure that developers have access to the best tools and platforms to facilitate efficient coding practices and understand the performance of applications..
  • Educate the rest of engineering about best practices for writing performant code and troubleshoot problematic areas
  • Develop and refine incident management protocols. Lead efforts to troubleshoot and resolve high-impact issues, minimizing downtime and preventing future occurrences.
  • Work closely with other engineering teams and departments to understand their needs and ensure platform initiatives support overall company goals.
  • Monitor virtual infrastructure and be part of a 24x7 on-call rotation to respond to alerts


Requirements

  • 8+ years of experience as a software engineer
  • 5+ years of experience working with Ruby on Rails
  • Proven experience leading SR teams
  • 3+ years of experience working in infrastructure and operations
  • Expertise with SQL databases such as PostgreSQL
  • Experience with Cloud computing Amazon Web Services and/or Google Cloud
  • Ability to dig into unfamiliar code bases
  • Ability to document solutions and train operational teams on supportability
  • A sense of comfort working in a team-oriented and collaborative environment
  • Can communicate clearly and seek help and support proactively
  • Takes ownership of tasks and leads them to completion


Desired

  • Experience in developing solutions using server automation tools such as Ansible.
  • Experience writing and maintaining CI/CD pipelines and services.


Education

  • Bachelor’s degree in Computer Science or related technical field


  • Manchester, United Kingdom TekStream Solutions Full time

    Our client is a remote-first company with team members across the globe! Offering a SaaS-based Learning Management System powering the world's leading education programs. Our client helps large brands and fast-moving companies increase revenue, improve customer retention, and decrease support costs through external education. Deeply curious, creative, and...


  • Manchester, United Kingdom TekStream Solutions Full time

    Our client is a remote-first company with team members across the globe! Offering a SaaS-based Learning Management System powering the world's leading education programs. Our client helps large brands and fast-moving companies increase revenue, improve customer retention, and decrease support costs through external education. Deeply curious, creative, and...


  • Manchester, United Kingdom TekStream Solutions Full time

    Welcome to Our Client Our client is a remote-first company with team members across the globe! Offering a SaaS-based Learning Management System powering the world's leading education programs. Our client helps large brands and fast-moving companies increase revenue, improve customer retention, and decrease support costs through external education. The...


  • Manchester, United Kingdom TekStream Solutions Full time

    Welcome to Our Client Our client is a remote-first company with team members across the globe! Offering a SaaS-based Learning Management System powering the world's leading education programs. Our client helps large brands and fast-moving companies increase revenue, improve customer retention, and decrease support costs through external education. The...


  • Manchester, United Kingdom Contechs Consulting Full time

    Site Reliability Engineer (IT Infrastructure)10-month initial contractOnsite (Manchester)£33ph (Inside IR35)About the companyI am currently recruiting on behalf of a Luxury Automotive OEM, based in Manchester, seeking Site Reliability Engineers to join their teamJob DescriptionAs Site Reliability Engineer, your main responsibilities are:Software design and...


  • Manchester, United Kingdom Contechs Consulting Full time

    Site Reliability Engineer (IT Infrastructure)10-month initial contractOnsite (Manchester)£33ph (Inside IR35)About the companyI am currently recruiting on behalf of a Luxury Automotive OEM, based in Manchester, seeking Site Reliability Engineers to join their teamJob DescriptionAs Site Reliability Engineer, your main responsibilities are:Software design and...


  • Manchester, United Kingdom Contechs Full time

    Site Reliability Engineer (IT Infrastructure) 10-month initial contract Onsite (Manchester) £33ph (Inside IR35) About the company I am currently recruiting on behalf of a Luxury Automotive OEM, based in Manchester, seeking Site Reliability Engineers to join their team Job Description As Site Reliability Engineer, your main...


  • Manchester, United Kingdom Emotiv Technical Recruitment Full time

    Division: Automotive Electrical Engineering Location – Manchester Inside IR35 Position Description: The Role is for a SITE RELIABILITY ENGINEER - this is specific working in an AWS environment in Manchester on the SDV programme. A Senior Site Reliability Engineer with a passion for quality, and proven experience of cloud infrastructure, software...


  • Manchester, United Kingdom Emotiv Technical Recruitment Full time

    Division: Automotive Electrical EngineeringLocation – ManchesterInside IR35Position Description:The Role is for a SITE RELIABILITY ENGINEER - this is specific working in an AWS environment in Manchester on the SDV programme. A Senior Site Reliability Engineer with a passion for quality, and proven experience of cloud infrastructure, software engineering...


  • Manchester, United Kingdom Vantage Consulting Full time

    Junior Site Reliability Engineer/DevOps EngineerAltrincham, Manchester (Hybrid x2 days on site)Read on to fully understand what this job requires in terms of skills and experience If you are a good match, make an application.An International digital transformation consultancy based in Manchester are looking for Junior SREs and DevOps Engineers to join the...


  • Manchester, United Kingdom Vantage Consulting Full time

    Junior Site Reliability Engineer/DevOps Engineer Altrincham, Manchester (Hybrid x2 days on site) An International digital transformation consultancy based in Manchester are looking for Junior SREs and DevOps Engineers to join the team. Here is what you need --2 years+ experience with DevOps and SRE -Hands on experience with AWS -Programming knowledge with...


  • Manchester, United Kingdom Talented Recruitment Group Full time

    Are you passionate about crafting robust, fault-tolerant systems that power unforgettable travel experiences? Do you thrive in an environment where innovation and collaboration are valued? If so, we have an incredible opportunity for you! About the company: We are working with a leading global travel company dedicated to providing exceptional experiences...


  • Manchester, United Kingdom Cameron Connect Ltd Full time

    Join Our Clients Dynamic Mortgages Team at the Heart of Technological Innovation!Are you ready to apply Make sure you understand all the responsibilities and tasks associated with this role before proceeding.Are you an experienced Java or C# engineer with a passion for building and maintaining reliable, high-performing systems? Do you thrive in roles where...


  • Manchester, United Kingdom Cameron Connect Ltd Full time

    Join Our Clients Dynamic Mortgages Team at the Heart of Technological Innovation!Are you ready to apply Make sure you understand all the responsibilities and tasks associated with this role before proceeding.Are you an experienced Java or C# engineer with a passion for building and maintaining reliable, high-performing systems? Do you thrive in roles where...


  • Manchester, United Kingdom Cameron Connect Ltd Full time

    Join Our Clients Dynamic Mortgages Team at the Heart of Technological Innovation!Are you an experienced Java or C# engineer with a passion for building and maintaining reliable, high-performing systems? Do you thrive in roles where you can make a significant impact on the availability, performance, and efficiency of critical services? If you've previously...


  • manchester, United Kingdom Cameron Connect Ltd Full time

    Join Our Clients Dynamic Mortgages Team at the Heart of Technological Innovation!Are you an experienced Java or C# engineer with a passion for building and maintaining reliable, high-performing systems? Do you thrive in roles where you can make a significant impact on the availability, performance, and efficiency of critical services? If you've previously...


  • Manchester, United Kingdom Cameron Connect Ltd Full time

    Join Our Clients Dynamic Mortgages Team at the Heart of Technological Innovation!Are you an experienced Java or C# engineer with a passion for building and maintaining reliable, high-performing systems? Do you thrive in roles where you can make a significant impact on the availability, performance, and efficiency of critical services? If you've previously...


  • Manchester, United Kingdom Cameron Connect Ltd Full time

    Join Our Clients Dynamic Mortgages Team at the Heart of Technological Innovation!Are you an experienced Java or C# engineer with a passion for building and maintaining reliable, high-performing systems? Do you thrive in roles where you can make a significant impact on the availability, performance, and efficiency of critical services? If you've...


  • Manchester,, Greater Manchester, United Kingdom Cameron Connect Ltd Full time

    Join Our Clients Dynamic Mortgages Team at the Heart of Technological Innovation!Are you an experienced Java or C# engineer with a passion for building and maintaining reliable, high-performing systems? Do you thrive in roles where you can make a significant impact on the availability, performance, and efficiency of critical services? If you've...


  • Manchester, United Kingdom Cameron Connect Ltd Full time

    Job Description Join Our Clients Dynamic Mortgages Team at the Heart of Technological Innovation! Are you an experienced Java or C# engineer with a passion for building and maintaining reliable, high-performing systems? Do you thrive in roles where you can make a significant impact on the availability, performance, and efficiency of critical services? If...