Site Reliability Engineering Lead

3 weeks ago


London, Greater London, United Kingdom Harrington Starr Full time
Lead Site Reliability Engineer - Remote First

We are seeking a seasoned Site Reliability Engineering (SRE) Lead with strong leadership capabilities to elevate the non-functional and operational aspects of our platform, including availability, performance, efficiency, monitoring, and incident response.

Key Responsibilities:
  • Lead and Mentor: Provide hands-on guidance to your team, driving improvements in cloud operations and application analysis, ensuring our platform is always operational and capable.
  • Incident Management: Take the lead on incident management, orchestrating rapid responses to identify, mitigate, and minimize risks effectively.
  • Post-Mortem Analysis: Facilitate blameless post-mortems, implement actionable alerts, and drive the automation of incident management processes to improve overall efficiency.
  • Monitoring & Alerting: Develop and maintain monitoring systems and alerts for both pre-production and production environments, while working closely with the platform and application support teams to enhance platform reliability.
  • Automation: Identify repetitive, non-value-add tasks and lead efforts to automate these processes through coding and scripting, freeing up your team to focus on more strategic work.
Requirements:
  • Strong knowledge of incident management frameworks and a track record of evolving them to improve organizational response and recovery.
  • Experience in cloud operations, application analysis, and platform reliability.
  • Excellent leadership and mentoring skills, with the ability to drive improvements and growth within the team.


  • London, Greater London, United Kingdom McGregor Boyall Full time

    Lead Site Reliability EngineerWe are seeking a highly skilled Lead Site Reliability Engineer to join our team at McGregor Boyall. As a key member of our engineering team, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using...


  • London, Greater London, United Kingdom McGregor Boyall Full time

    Lead Site Reliability EngineerWe are seeking a highly skilled Lead Site Reliability Engineer to join our team at McGregor Boyall. As a key member of our engineering team, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using...


  • London, Greater London, United Kingdom Alevio Consulting Full time

    Lead Site Reliability EngineerAlevio Consulting is seeking a highly skilled Lead Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, building, and maintaining high-performance, scalable, and reliable services for our in-house operations.In this role, you will oversee the maintenance...


  • London, Greater London, United Kingdom Alevio Consulting Full time

    Lead Site Reliability EngineerAlevio Consulting is seeking a highly skilled Lead Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, building, and maintaining high-performance, scalable, and reliable services for our in-house operations.In this role, you will oversee the maintenance...


  • London, Greater London, United Kingdom McGregor Boyall Full time

    Lead Site Reliability EngineerMcGregor Boyall is seeking a highly skilled Lead Site Reliability Engineer to join our team in London. As a key member of our engineering team, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using...


  • London, Greater London, United Kingdom Alevio Consulting Full time £750

    Lead Site Reliability EngineerWe are seeking a highly skilled Lead Site Reliability Engineer to join our team at Alevio Consulting. As a key member of our engineering team, you will be responsible for designing, building, and maintaining high-performance, scalable, and reliable services for our clients.About the Role:Lead and manage the development and...


  • London, Greater London, United Kingdom Alevio Consulting Full time £750

    Lead Site Reliability EngineerWe are seeking a highly skilled Lead Site Reliability Engineer to join our team at Alevio Consulting. As a key member of our engineering team, you will be responsible for designing, building, and maintaining high-performance, scalable, and reliable services for our clients.About the Role:Lead and manage the development and...


  • London, Greater London, United Kingdom JPMorganChase Full time

    About the RoleWe're seeking a highly skilled Site Reliability Engineer Lead to join our Accelerators Engineering team at JPMorgan Chase. As a key member of our team, you will play a critical role in ensuring the reliability and scalability of our products and services.Key ResponsibilitiesDesign and implement high-quality designs, roadmaps, and program...


  • London, Greater London, United Kingdom JPMorganChase Full time

    About the RoleWe're seeking a highly skilled Site Reliability Engineer Lead to join our Accelerators Engineering team at JPMorgan Chase. As a key member of our team, you will play a critical role in ensuring the reliability and scalability of our products and services.Key ResponsibilitiesDesign and implement high-quality designs, roadmaps, and program...


  • London, Greater London, United Kingdom McGregor Boyall Full time

    Lead Site Reliability EngineerWe are seeking a highly skilled Lead Site Reliability Engineer to join our team at McGregor Boyall. As a key member of our engineering team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement scalable and highly available cloud...


  • London, Greater London, United Kingdom McGregor Boyall Full time

    Lead Site Reliability EngineerWe are seeking a highly skilled Lead Site Reliability Engineer to join our team at McGregor Boyall. As a key member of our engineering team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement scalable and highly available cloud...


  • London, Greater London, United Kingdom Alevio Consulting Full time £750

    Lead Site Reliability EngineerWe are seeking a highly skilled Lead Site Reliability Engineer to join our team at Alevio Consulting. As a key member of our engineering team, you will be responsible for designing, building, and maintaining high-performance, scalable, and reliable services for our clients.About the Role:Lead and manage the development and...


  • London, Greater London, United Kingdom Alevio Consulting Full time £750

    Lead Site Reliability EngineerWe are seeking a highly skilled Lead Site Reliability Engineer to join our team at Alevio Consulting. As a key member of our engineering team, you will be responsible for designing, building, and maintaining high-performance, scalable, and reliable services for our clients.About the Role:Lead and manage the development and...


  • London, Greater London, United Kingdom McGregor Boyall Full time

    Lead Site Reliability EngineerWe are seeking a highly skilled Lead Site Reliability Engineer to join our team at McGregor Boyall. As a key member of our engineering team, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure...


  • London, Greater London, United Kingdom McGregor Boyall Full time

    Lead Site Reliability EngineerWe are seeking a highly skilled Lead Site Reliability Engineer to join our team at McGregor Boyall. As a key member of our engineering team, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure...


  • London, Greater London, United Kingdom Mondrian Alpha Full time

    Mondrian Alpha: Senior SRE with Buy-Side ExperienceWe are seeking a senior Site Reliability Engineer with buy-side/HFT experience to join our HFT team at Mondrian Alpha, the largest and most successful hedge fund in the world.This individual will work alongside talented engineers in a rapidly growing department, receiving market-leading compensations and...


  • London, Greater London, United Kingdom Mondrian Alpha Full time

    Mondrian Alpha: Senior SRE with Buy-Side ExperienceWe are seeking a senior Site Reliability Engineer with buy-side/HFT experience to join our HFT team at Mondrian Alpha, the largest and most successful hedge fund in the world.This individual will work alongside talented engineers in a rapidly growing department, receiving market-leading compensations and...


  • London, Greater London, United Kingdom Mondrian Alpha Full time

    Mondrian Alpha: A Leader in Hedge Fund ManagementOur client, the largest and most successful hedge fund in the world, managing assets worth over $50 billion USD, is seeking a senior Site Reliability Engineer (SRE) with buy-side/HFT experience to join their High-Frequency Trading (HFT) team.Key Responsibilities:Improve the uptime, capacity, and performance of...


  • London, Greater London, United Kingdom Mondrian Alpha Full time

    Mondrian Alpha: A Leader in Hedge Fund ManagementOur client, the largest and most successful hedge fund in the world, managing assets worth over $50 billion USD, is seeking a senior Site Reliability Engineer (SRE) with buy-side/HFT experience to join their High-Frequency Trading (HFT) team.Key Responsibilities:Improve the uptime, capacity, and performance of...


  • London, Greater London, United Kingdom Google Full time

    Job DescriptionAt Google, we're looking for a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our services.Key ResponsibilitiesLead a team of Software/Systems Engineers on projects for users and be directly responsible for...