Current jobs related to Principal Site Reliability Engineer - London - GoCardless


  • London, Greater London, United Kingdom Insight Global Full time

    Site Reliability Engineer OpportunityInsight Global is seeking a skilled Site Reliability Engineer to join their team in West London. As a key member of the streaming engineering group, you will be responsible for ensuring the smooth operation of linear streaming channels.The ideal candidate will have previous experience in Site Reliability Engineering, with...


  • London, Greater London, United Kingdom Insight Global Full time

    Site Reliability Engineer OpportunityInsight Global is seeking a skilled Site Reliability Engineer to join their team in West London. As a key member of the streaming engineering group, you will be responsible for ensuring the smooth operation of linear streaming channels.The ideal candidate will have previous experience in Site Reliability Engineering, with...


  • London, Greater London, United Kingdom LinuxRecruit Full time

    Unlock Your Potential as a Site Reliability EngineerWe are seeking a seasoned Site Reliability Engineer to join our team and contribute to the development and maintenance of our cutting-edge platform.As a Site Reliability Engineer, you will be responsible for designing, developing, and maintaining systems and applications using Golang. You will work closely...


  • London, Greater London, United Kingdom LinuxRecruit Full time

    Unlock Your Potential as a Site Reliability EngineerWe are seeking a seasoned Site Reliability Engineer to join our team and contribute to the development and maintenance of our cutting-edge platform.As a Site Reliability Engineer, you will be responsible for designing, developing, and maintaining systems and applications using Golang. You will work closely...


  • London, Greater London, United Kingdom Lorien Full time

    Site Reliability Engineer / DevOps EngineerWe are seeking a skilled Site Reliability Engineer / DevOps Engineer to join our team at Lorien, a leading consultancy. The ideal candidate will have experience with Salesforce automation and a strong background in SRE / Site Reliability Engineering.Key Responsibilities:Design and implement scalable and efficient...


  • London, Greater London, United Kingdom Lorien Full time

    Site Reliability Engineer / DevOps EngineerWe are seeking a skilled Site Reliability Engineer / DevOps Engineer to join our team at Lorien, a leading consultancy. The ideal candidate will have experience with Salesforce automation and a strong background in SRE / Site Reliability Engineering.Key Responsibilities:Design and implement scalable and efficient...


  • London, Greater London, United Kingdom Lorien Full time

    Site Reliability Engineer / DevOps EngineerWe are seeking a skilled Site Reliability Engineer / DevOps Engineer to join our team at Lorien, a leading consultancy. The ideal candidate will have experience with Salesforce automation and a strong background in SRE / Site Reliability Engineering.Key Responsibilities:Design and implement scalable and efficient...


  • London, United Kingdom Switch Tech Talent Full time

    Role: Site Reliability Engineer Location: London/Hybrid (3 days a week in office) Salary: £75,000 Key Skills: AWS, IaC, Docker, Scripting As a Site Reliability Engineer you will be at the forefront of maintaining robust, scalable, and secure cloud solutions that power this cutting-edge e-commerce platform. Your expertise will ensure seamless,...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesWe are seeking a highly skilled Site Reliability Engineer to join our team at Fourier. As a member of our Site Reliability Engineering team, you will be responsible for developing tools to enhance and monitor our production systems.Required SkillsExcellent Python scripting skillsExperience with Version Control best practicesGood knowledge...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesWe are seeking a highly skilled Site Reliability Engineer to join our team at Fourier. As a member of our Site Reliability Engineering team, you will be responsible for developing tools to enhance and monitor our production systems.Required SkillsExcellent Python scripting skillsExperience with Version Control best practicesGood knowledge...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesWe are seeking a highly skilled Site Reliability Engineer to join our team at Fourier. As a member of our Site Reliability Engineering team, you will be responsible for developing tools to enhance and monitor our production systems.Required SkillsExcellent Python scripting skillsExperience with Version Control best practicesGood knowledge...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesWe are seeking a highly skilled Site Reliability Engineer to join our team at Fourier. As a member of our Site Reliability Engineering team, you will be responsible for developing tools to enhance and monitor our production systems.Required SkillsExcellent Python scripting skillsExperience with Version Control best practicesGood knowledge...


  • London, Greater London, United Kingdom iO Associates - UKEU Full time £500

    Job Opportunity: Site Reliability EngineeriO Associates - UK/EU is seeking a skilled Site Reliability Engineer to join our team for a short-term project within the Law Enforcement sector.Key Responsibilities:Monitor system performance and security to ensure optimal functionality.Collaborate with our team to identify and resolve technical issues.Project...


  • London, Greater London, United Kingdom iO Associates - UKEU Full time £500

    Job Opportunity: Site Reliability EngineeriO Associates - UK/EU is seeking a skilled Site Reliability Engineer to join our team for a short-term project within the Law Enforcement sector.Key Responsibilities:Monitor system performance and security to ensure optimal functionality.Collaborate with our team to identify and resolve technical issues.Project...


  • London, Greater London, United Kingdom iO Associates Full time

    Job Opportunity: Site Reliability EngineeriO Associates is seeking a skilled Site Reliability Engineer to join our team for a short-term project within the Law Enforcement sector.Key Responsibilities:Monitor system performance and security to ensure optimal functionality.Collaborate with our team to identify and resolve technical issues.This role offers a...


  • London, Greater London, United Kingdom iO Associates Full time

    Job Opportunity: Site Reliability EngineeriO Associates is seeking a skilled Site Reliability Engineer to join our team for a short-term project within the Law Enforcement sector.Key Responsibilities:Monitor system performance and security to ensure optimal functionality.Collaborate with our team to identify and resolve technical issues.This role offers a...


  • London, Greater London, United Kingdom LinuxRecruit Full time

    Unlock Your Potential as a Site Reliability EngineerAre you a seasoned Site Reliability Engineer looking to take your skills to the next level? Do you thrive in fast-paced environments and have a keen eye for detail? We're seeking a talented individual to join our team and contribute to the development and maintenance of our cutting-edge platform.As a Site...


  • London, Greater London, United Kingdom Fourier Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Fourier. As a key member of our Site Reliability Engineering team, you will be responsible for developing tools and processes to enhance the reliability and resilience of our production systems.Key ResponsibilitiesDevelop and maintain automation scripts using Python...


  • London, Greater London, United Kingdom Fourier Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Fourier. As a key member of our Site Reliability Engineering team, you will be responsible for developing tools and processes to enhance the reliability and resilience of our production systems.Key ResponsibilitiesDevelop and maintain automation scripts using Python...


  • London, Greater London, United Kingdom iO Associates - UKEU Full time £500

    Job Title: Site Reliability EngineeriO Associates - UK/EU is seeking a skilled Site Reliability Engineer with Active NPPV3 Clearance to join our team for a short-term project within the Law Enforcement sector.Key Responsibilities:Monitor system performance and security to ensure optimal functionality.Collaborate with our team to identify and resolve...

Principal Site Reliability Engineer

5 months ago


London, United Kingdom GoCardless Full time

About us

At GoCardless we believe bank payments are the best way to pay and get paid. We also believe that bank account data is a powerful tool to make better, faster decisions. We’re making it easy to use both- for businesses everywhere.

GoCardless is used for domestic and international payments by 85,000+ organisations and counting, processing more than $30 billion across 30 countries. We’re an award-winning London based fintech, with additional offices in Riga, Paris and Melbourne.

GoCardless is seeking a seasoned Principal Site Reliability Engineer to join our team and take a leading role in ensuring the scalability, reliability, and performance of our payment technology platform. As a global leader in direct bank payment solutions, we are committed to delivering a seamless experience for our customers. We are looking for an individual who is passionate about building and maintaining robust infrastructure, enhancing system reliability, and driving platform initiatives. 

You'll also advise engineers on the SRE team and beyond, nurturing their growth and collaborating closely with other developers throughout the end-to-end development cycle across technical design, implementation, review, and release.

What You'll Do:

Lead and drive strategic platform initiatives, focusing on the scalability, reliability, and performance of our technology platform on GCP. This includes a vision of where our Infrastructure needs to be to deliver business objectives and driving a roadmap to meet the vision. Design, implement and refine our Observability stack to ensure high availability and performance, focusing on SLA and availability metrics. Collaborate with engineering and operations teams to identify and enhance the availability measures of critical components and systems. Design and implement strategies, tooling, and processes to improve system uptime and reliability, leveraging our platform on GCP and GKE. Recommend improvements to platform infrastructure and processes, enhancing efficiency and reliability. Lead infrastructure optimisations and architectural improvements in collaboration with cross-functional teams, addressing complex challenges and ensuring scalability. Design, develop and maintain CI/CD pipelines for seamless deployment and release management, ensuring fast resolution of issues impacting SLOs and preventing incidents. Participate in SRE on-call rotation, triaging production issues, and defining appropriate remediation.

What Makes You a Match:

5+ years of experience in SRE / Platform Engineering roles, supporting, scaling, and ensuring the reliability of large-scale end-to-end infrastructures with a proven track record of strategic infrastructure work. Strong expertise and track record of building platforms on top of Kubernetes.  Extensive experience with GCP (or AWS/Azure) and developing distributed systems using cloud services.  Experience with designing large-scale/multi-region architectures. Experience in designing secure systems. Working closely with the Security team to address security posture ( mitigation, compliance etc., ) Exceptional problem-solving skills and a passion for developing robust, scalable, and secure solutions. Excellent communication skills to effectively collaborate with cross-functional teams and interface with customers. Ability to root cause sources of instability in high-traffic, distributed systems and manage complex solutions pragmatically.

We don’t expect you to meet every requirement. If you’re excited by this role, we encourage you to apply.

(some of) The good stuff

Wellbeing - stay healthy with dedicated support and medical cover Work away scheme - you can apply to work  away from your country of residence for up to 90 days in any 12 month period Adaptive Working - allows you to work flexibly, around your lifestyle Equity -all permanently employed GeeCees receive equity so we can share in the success we achieve together Parental leave -to suit everyone embarking on life's great adventure Time off - generous holiday allowance, + 3 annual volunteer days, + 4 annual business-wide wellness days (‘GC Fridays’)