Data & Reporting SRE

1 week ago


London, Greater London, United Kingdom ZILO Full time £60,000 - £120,000 per year

About:

Step forward into the future of technology with ZILO.

We're here to redefine what's possible in technology. While we're trusted by the global Transfer Agency sector, our technology is truly flexible and designed to transform any business at scale. We've created a unified platform that adapts to diverse needs, offering the scalability and reliability legacy systems simply can't match.

At ZILO, our DNA is built on Character, Creativity, and Craftsmanship. We face every challenge with integrity, explore new ideas with a curious mind, and set a high standard in every detail.

We are a team of dedicated professionals where everyone, regardless of their role, drives our progress and creates real impact. If you're ready to shape the future, let's talk.

Requirements

We are seeking an experienced Site Reliability Engineer (SRE) with deep subject-matter expertise in data processing and reporting. In this role, you will own the reliability, performance, and operational excellence of our real-time and batch data pipelines built on AWS, Apache Flink, Kafka, and Python. You'll act as the first line of defense for data-related incidents, rapidly diagnose root causes, and implement resilient solutions that keep critical reporting systems up and running.

Incident Management & Triage

  • Serve as on-call escalation for data pipeline incidents, including real-time stream failures and batch job errors.
  • Rapidly analyze logs, metrics, and trace data to pinpoint failure points across AWS, Flink, Kafka, and Python layers.
  • Lead post-incident reviews: identify root causes, document findings, and drive corrective actions to closure.

Reliability & Monitoring

  • Design, implement, and maintain robust observability for data pipelines: dashboards, alerts, distributed tracing.
  • Define SLOs/SLIs for data freshness, throughput, and error rates; continuously monitor and optimize.
  • Automate capacity planning, scaling policies, and disaster-recovery drills for stream and batch environments.

Architecture & Automation

  • Collaborate with data engineering and product teams to architect scalable, fault-tolerant pipelines using AWS services (e.g., Step Functions, EMR, Lambda, Redshift) integrated with Apache Flink and Kafka.
  • Troubleshoot & Maintain Python-based applications.
  • Harden CI/CD for data jobs: implement automated testing of data schemas, versioned Flink jobs, and migration scripts.

Performance Optimization

  • Profile and tune streaming jobs: optimize checkpoint intervals, state backends, and parallelism settings in Flink.
  • Analyze Kafka cluster health: tune broker configurations, partition strategies, and retention policies to meet SLAs.
  • Leverage Python profiling and vectorized libraries to streamline batch analytics and report generation.

Collaboration & Knowledge Sharing

  • Act as SME for data & reporting stack: mentor peers, lead brown-bag sessions on best practices.
  • Contribute to runbooks, design docs, and on-call playbooks detailing common failure modes and recovery steps.
  • Work cross-functionally with DevOps, Security, and Product teams to align reliability goals and incident response workflows.

Benefits

  • Enhanced leave - 38 days inclusive of 8 UK Public Holidays  
  • Private Health Care including family cover  
  • Life Assurance – 5x salary  
  • Flexible working-work from home and/or in our London Office  
  • Employee Assistance Program  
  • Company Pension (Salary Sacrifice options available)
  • Access to training and development  
  • Buy and Sell holiday scheme 
  • The opportunity for "work from anywhere/global mobility"


  • London, Greater London, United Kingdom Apple Full time £1,000,000 - £1,500,000 per year

    The people here at Apple don't just build products— they craft the kind of wonder that has revolutionised entire industries. It's the diversity of those people and their ideas that encourages the innovation that runs through everything we do, from amazing technology to industry-leading environmental efforts. Imagine what you could do here. Join Apple, and...

  • SRE DevOps Engineer

    4 days ago


    London, Greater London, United Kingdom Lloyds Banking Group Full time

    Job Title:SRE DevOps EngineerLocation:LondonSalary: £81,999 - £91,110Hours:Full timeWorking Pattern: Hybrid, 40% (or two days) in office a week.About us…Like the modern Britain we serve, we're evolving. Investing billions in our people, data and tech to transform the way we meet the ever-changing needs of our 26 million customers. We're growing with...

  • SRE DevOps Engineer

    2 days ago


    London, Greater London, United Kingdom Lloyds Banking Group Full time £81,999 - £91,110

    End DateSunday 02 November 2025Salary Range£81,999 - £91,110We support flexible working – click here for more information on flexible working optionsFlexible Working OptionsHybrid Working, Job ShareJob Description Summary.Like the modern Britain we serve, we're evolving. Investing billions in our people, data and tech to transform the way we meet the...


  • London, Greater London, United Kingdom Barclays Full time £80,000 - £120,000 per year

    Job DescriptionPurpose of the roleTo apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. AccountabilitiesAvailability, performance, and scalability of systems and services through proactive monitoring,...

  • Paralegal

    4 days ago


    London, Greater London, United Kingdom AMP Clean Energy Full time £30,000 - £45,000 per year

    London, WC1R 4PSOverviewHere at AMP, we are committed to supporting the energy transition by funding, developing, and delivering flexible energy solutions and helping businesses decarbonise.Our mission is to create a smarter energy future. We are relentless in our focus on tackling the central challenge of our age – the energy transition.The size and scale...

  • Senior SRE

    2 days ago


    London, Greater London, United Kingdom Focused Full time £75,000 - £105,000

    Who we are:At Focused, we move quickly to deliver quality software that achieves client outcomes and meets their customer's needs. We strategically partner with our clients to leverage our expertise in design and software, while our clients bring their own domain expertise. We work with a variety of clients from different industries, collaborating as we get...

  • Senior SRE

    1 week ago


    London, Greater London, United Kingdom Focused Labs Full time £75,000 - £105,000 per year

    Who we are:At Focused, we move quickly to deliver quality software that achieves client outcomes and meets their customer's needs. We strategically partner with our clients to leverage our expertise in design and software, while our clients bring their own domain expertise. We work with a variety of clients from different industries, collaborating as we get...

  • Data Engineer

    2 weeks ago


    London, Greater London, United Kingdom Northern Data Full time £60,000 - £120,000 per year

    Job DescriptionThe Data Engineer is responsible for overseeing the collection, management, and analysis of organizational data to ensure accuracy, consistency, and security. This role bridges technical data management and business strategy — ensuring that data assets are properly maintained, integrated, and leveraged to drive insights and informed...


  • London, Greater London, United Kingdom AMP Clean Energy Full time £40,000 - £70,000 per year

    1 Dover Street, London, W1S 4LDOverviewHere at AMP, we are committed to supporting the energy transition by funding, developing, and delivering flexible energy solutions and helping businesses decarbonise.Our mission is to create a smarter energy future. We are relentless in our focus on tackling the central challenge of our age – the energy transition.The...


  • London, Greater London, United Kingdom Bloomberg Full time £50,000 - £120,000 per year

    LocationLondonBusiness AreaEngineering and CTORef # Description & RequirementsAre you passionate about building high-performance systems that are fast, resilient, and operate at global scale? Join Bloomberg's Application Middleware SRE team, where you'll combine software engineering and systems expertise to keep the backbone of the Bloomberg Terminal running...