Lead Site Reliability Engineer

3 weeks ago


London, Greater London, United Kingdom GoCardless Full time

About GoCardless:

At GoCardless, we are committed to revolutionizing the payment landscape by leveraging bank payments as the most efficient means for both sending and receiving funds. We also recognize the significant role of bank account data in enabling faster and more informed decision-making. Our mission is to streamline the utilization of bank payments and data for businesses across the globe. With over 85,000 organizations relying on our services for domestic and international transactions, we process more than $30 billion across 30 countries. As a leading fintech firm based in London, with additional offices in Riga, Paris, and Melbourne, we are pioneers in direct bank payment solutions.

We are on the lookout for a seasoned Lead Site Reliability Engineer to join our dynamic team. In this pivotal role, you will be instrumental in ensuring the scalability, dependability, and performance of our payment technology platform. As a vital contributor to the global direct bank payment solutions industry, we are dedicated to providing seamless experiences for our clients. We seek an individual who is passionate about building and maintaining resilient infrastructure, enhancing system reliability, and driving platform initiatives.

Key Responsibilities:

  • Steer strategic platform initiatives aimed at enhancing scalability, reliability, and performance on GCP
  • Design, implement, and refine the Observability stack to ensure high availability and performance
  • Collaborate with cross-functional teams to improve the availability of critical system components
  • Develop and execute strategies to boost system uptime and reliability
  • Propose enhancements to platform infrastructure and operational processes
  • Lead infrastructure optimizations and architectural advancements
  • Create CI/CD pipelines for effective deployment and release management
  • Engage in on-call rotations and address production issues as they arise

Essential Qualifications:

  • 5+ years of experience in Site Reliability Engineering or Platform Engineering roles
  • Proficient in building platforms utilizing Kubernetes
  • Extensive experience with GCP (or AWS/Azure) and distributed systems
  • Experience in designing large-scale/multi-region architectures
  • Knowledgeable in creating secure systems
  • Strong analytical and problem-solving abilities
  • Excellent communication skills
  • Capable of managing complex solutions within distributed systems

If you are excited about this opportunity, we encourage you to consider applying even if you do not meet every requirement.

Benefits Include:

  • Wellbeing: Access to dedicated support and medical coverage
  • Work Away Scheme: Flexible working options outside your country of residence
  • Adaptive Working: Flexible working hours tailored to your lifestyle
  • Equity: All employees receive equity
  • Parental Leave: Customized to meet individual needs
  • Time Off: Generous holiday allowance and wellness days


  • London, Greater London, United Kingdom Legal & General Full time

    About the RoleWe are seeking a seasoned Site Reliability Engineer to join our team at Legal & General. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our systems, working closely with development, architecture, and service management teams.Key ResponsibilitiesSystem Reliability and Scalability:...


  • London, Greater London, United Kingdom RemoteStar Full time

    Remote Senior Site Reliability Engineer LeadRemoteStar is seeking a highly skilled Remote Senior Site Reliability Engineer Lead to join our client's team in the UK. This is a fully remote work opportunity.The client is a leading B2B diamond and gemstones marketplace, connecting jewellery retailers to gemstone suppliers.Job SummaryAs the SRE Lead, you will...


  • London, Greater London, United Kingdom Legal & General Full time

    About the RoleWe are seeking a seasoned Site Reliability Engineer to join our team at Legal & General. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our systems, working closely with development, architecture, and service management teams.Key ResponsibilitiesSystem Reliability and Scalability:...


  • London, Greater London, United Kingdom loveholidays Full time

    Company OverviewAt loveholidays, we are a dynamic online travel agency dedicated to utilizing innovative technology to enhance our services. Our goal is to facilitate unforgettable travel experiences for countless individuals each year.Position SummaryWe are in search of a skilled Site Reliability Engineer to become a vital member of our Platform...


  • London, Greater London, United Kingdom Opus Recruitment Solutions Full time

    Site Reliability Engineer | Remote | Competitive SalaryCloud Computing | DevOps | Google Cloud Platform | Amazon Web Services | Kubernetes | Infrastructure | SRE | ELK StackWe are collaborating with a dynamic online retail company seeking to enhance their technical team by adding a Site Reliability Engineer. This role focuses on managing the reliability and...


  • London, Greater London, United Kingdom Department for Work and Pensions Full time

    Position OverviewAre you adept at managing stakeholder relationships effectively?Do you enjoy diagnosing issues and creating automated solutions to prevent recurrence?If this resonates with you, we invite you to explore this opportunity.In the role of Senior Site Reliability Engineer, you will champion the implementation of SRE best practices throughout our...


  • London, Greater London, United Kingdom Apple Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will play a critical role in supporting and scaling cloud services for thousands of development and operations engineers.Key ResponsibilitiesCloud Service Maintenance: Automate deployment and orchestration of...


  • London, Greater London, United Kingdom Mondrian Alpha Full time

    About Mondrian AlphaMondrian Alpha is a renowned hedge fund with a global presence, seeking a seasoned Site Reliability Engineer to join their London team.Job SummaryWe are looking for a highly skilled Site Reliability Engineer to play a pivotal role in maintaining the technology infrastructure that drives our operations, directly contributing to our...


  • London, Greater London, United Kingdom Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our services.Key ResponsibilitiesDesign, implement, and maintain large-scale distributed systems and...


  • London, Greater London, United Kingdom Experian Full time

    Job Opportunity for a Skilled Site Reliability EngineerWe are seeking a highly skilled and driven Site Reliability Engineer to join our dedicated team at Experian Data Quality in London, with a flexible working arrangement.As a key member reporting to the QA Director, you will be responsible for ensuring the dependability, efficiency, and scalability of our...


  • London, Greater London, United Kingdom WeAreTechWomen Full time

    About the RoleWe are seeking a skilled Site Reliability Engineering Specialist to join our team at WeAreTechWomen. As a Site Reliability Engineer, you will be responsible for ensuring the resilience and reliability of our firm's critical platform services.Key ResponsibilitiesCollaborate with our businesses to build and run resilient and reliable production...


  • London, Greater London, United Kingdom WeAreTechWomen Full time

    About the RoleWe are seeking a skilled Site Reliability Engineering Specialist to join our team at WeAreTechWomen. As a Site Reliability Engineer, you will play a critical role in ensuring the resilience and reliability of our firm's most critical platform services.Key ResponsibilitiesCollaborate with our businesses to build and run resilient and reliable...


  • London, Greater London, United Kingdom Google Full time

    About the RoleAs a Site Reliability Engineering Manager at Google, you will be responsible for leading a team of Software/Systems Engineers on projects that impact users globally. Your primary focus will be on ensuring the uptime and availability of key services, while also building automation to prevent problem recurrence.You will be directly responsible...


  • London, Greater London, United Kingdom Harrington Starr Full time

    Job OverviewLead Site Reliability Engineer - Remote OpportunityInnovative Start-up EnvironmentCompensation: £95,000 - £105,000 base salaryPosition SummaryWe are excited to invite applications for the role of Lead Site Reliability Engineer as our client, a dynamic start-up, is poised for significant growth and the launch of essential services. This position...


  • London, Greater London, United Kingdom Sterlings Full time

    Job Opportunity at SterlingsKubernetes Site Reliability Engineer - Investment BankingSterlings, a leading global investment bank, is seeking a highly skilled Kubernetes Site Reliability Engineer to join our Site Reliability Engineering team.Key Responsibilities:Design and implement scalable and highly available infrastructure services using...


  • London, Greater London, United Kingdom Robert Walters Full time

    Job DescriptionSENIOR SITE RELIABILITY ENGINEERSalary: £100,000 + 5% bonusLocation: London, hybrid working with 2 days per week in the officeWe are thrilled to present a remarkable opportunity for a Senior Site Reliability Engineer to join our team at Robert Walters as a Workforce Consultant. As an Employed Workforce Consultant, you will enjoy the benefits...


  • London, Greater London, United Kingdom Trust In SODA Full time

    Job OverviewPosition: Site Reliability Engineering ManagerIndustry: InsurTechLocation: RemoteSalary: £75,000 - £85,000Benefits: Bonus, Equity Options, Comprehensive Health Coverage, Learning & Development Fund, 25 Days Annual Leave, Flexibility for International WorkAre you eager to join a fast-growing InsurTech firm that is transforming the Premium...


  • London, Greater London, United Kingdom WeAreTechWomen Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Cloud Infrastructure team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and resilience of our firm's critical platform services.Key ResponsibilitiesCollaborate with cross-functional teams to design, implement, and operate scalable and...


  • London, Greater London, United Kingdom WeAreTechWomen Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Cloud Infrastructure team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and resilience of our firm's critical platform services.Key ResponsibilitiesCollaborate with cross-functional teams to design, implement, and operate scalable and...


  • London, Greater London, United Kingdom MRJ Recruitment Full time

    Senior Reliability Engineer (SRE) RoleOur leading retail sector client is seeking a skilled SRE to collaborate with their team and enhance deployment practices to minimize downtime, expedite troubleshooting, and facilitate smooth reversals.Work alongside a diverse and supportive team to contribute to groundbreaking projects and enjoy a collaborative...