Reliability Engineer

3 weeks ago


London, Greater London, United Kingdom Butterworths Limited Company Full time

About the Role

You will be a key member of our team responsible for ensuring the reliability and performance of our systems.

As a Senior Site Reliability Engineer at Butterworths Limited Company, you will play a critical role in designing, implementing, and maintaining monitoring tools and processes to ensure continuous tracking of system performance, availability, and security.

Responsibilities

  • Design and implement monitoring tools and processes to ensure continuous tracking of system performance, availability, and security.
  • Proactively identify potential issues through trend analysis and monitoring data, and take corrective actions before they impact customers.
  • Oversee the monitoring and proactive management of product performance, availability, and reliability.
  • Oversee the smooth operation and availability of live systems, ensuring minimal downtime and prompt resolution of incidents.
  • Lead the incident management process, including identification, troubleshooting, resolution, and post-incident analysis.
  • Collaborate with product, development, infrastructure, quality engineering, and customer success teams to ensure seamless deployment and support of new features and updates.

Requirements

  • Demonstrate experience in software development, with a strong background in supporting and maintaining live products.
  • Demonstrate experience in site reliability engineering, live production support, or a related role.
  • Demonstrate experience managing and supporting live systems in a production environment.
  • Show experience working with multiple cloud platforms (e.g., AWS, Azure).
  • Demonstrate experience working with monitoring tools (e.g., Datadog, Splunk).
  • Demonstrate scripting skills (e.g., Python, JS) and familiarity with automation tools.
  • Show understanding and experience with incident management, monitoring tools, IT service management frameworks, and automation processes.

Work in a way that works for you

We promote a healthy work/life balance across the organisation. We offer an appealing working prospect for our people. With numerous wellbeing initiatives, shared parental leave, study assistance, and sabbaticals, we will help you meet your immediate responsibilities and your long-term goals.

  • Working flexible hours - flexing the times when you work in the day to help you fit everything in and work when you are the most productive.

Working for you

We know that your wellbeing and happiness are key to a long and successful career. These are some of the benefits we are delighted to offer

  • Generous holiday allowance with the option to buy additional days.
  • Health screening, eye care vouchers, and private medical benefits.
  • Wellbeing programs.
  • Life assurance.
  • Access to a competitive contributory pension scheme.
  • Save As You Earn share option scheme.
  • Travel Season ticket loan.
  • Electric Vehicle Scheme.
  • Optional Dental Insurance.
  • Maternity, paternity, and shared parental leave.
  • Employee Assistance Programme.
  • Access to emergency care for both the elderly and children.
  • RECARES days, giving you time to support the charities and causes that matter to you.
  • Access to employee resource groups with dedicated time to volunteer.
  • Access to extensive learning and development resources.
  • Access to employee discounts scheme via Perks at Work.


  • London, Greater London, United Kingdom AVT Reliability Ltd Full time

    About AVT Reliability LtdWe are a leading company in the field of asset integrity and reliability. Our team is passionate about delivering high-quality services to our clients.Job SummaryThis is an exciting opportunity for a talented engineering graduate to join our Asset Integrity Division as a specialist. You will be responsible for supporting a diverse...


  • London, Greater London, United Kingdom loveholidays Full time

    About the RoleWe are seeking a highly skilled Reliability Engineer to join our team at LoveHolidays. As a key member of our infrastructure team, you will be responsible for ensuring the reliability and performance of our systems, which handle millions of users and thousands of requests per second.Our runtime architecture is Service Based and hosted on cloud...


  • London, Greater London, United Kingdom Victrex Full time

    Senior Reliability Engineer RoleAbout the JobWe are seeking an experienced Senior Reliability Engineer to lead our asset management strategy and drive improvements in plant performance across all UK plants.Job SummaryThe successful candidate will be responsible for developing and implementing systems and procedures that enhance safety, asset availability,...


  • London, Greater London, United Kingdom Florida Crystals ASR Group Full time

    DESCRIPTIONS2: Job Overview">As a Maintenance Engineer at Tate & Lyle Sugars, you will be responsible for maintaining the efficiency and reliability of our plant and equipment.">Responsibilities">Perform routine maintenance tasks to prevent equipment failure and downtime.Conduct root cause analysis to identify and resolve equipment issues.Develop and...


  • London, Greater London, United Kingdom Viasat Full time

    Job Title: Digital Reliability EngineerJob Summary: We are seeking a Digital Reliability Engineer to join our platform team at Viasat. The successful candidate will be responsible for ensuring the reliability and resilience of our cloud-based systems.Lead the design and implementation of cloud-based solutions to enhance platform reliability and...


  • London, Greater London, United Kingdom 83zero Full time

    Job Description:We are seeking a skilled Cloud Reliability Engineer to join our team at 83zero, a global leader in digital services. As a Cloud Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and efficiency of our clients' platforms.Your Responsibilities:Ensure the reliability, scalability, and efficiency of clients'...


  • London, Greater London, United Kingdom Cutover Full time

    Cutover is a pioneering enterprise that has developed the world's first work orchestration and observability platform, enabling seamless collaboration between humans and machines.We're looking for a skilled Reliability Engineer to join our team and ensure the robustness and performance of our Cutover Enterprise platform.The platform features a ReactJS...


  • London, Greater London, United Kingdom Fourier Full time

    Key ResponsibilitiesAs a Site Reliability Engineer at Fourier, you will be responsible for designing and implementing tools to enhance the reliability and resilience of our production systems. This includes investigating failures, improving system performance, and automating manual processes.Required SkillsExcellent Python scripting skillsExperience with...


  • London, Greater London, United Kingdom GoCardless Full time

    About the RoleWe are seeking an experienced Cloud Reliability Engineer to join our distributed team at GoCardless. As a key member of our engineering team, you will be responsible for designing and implementing scalable and reliable infrastructure solutions.With a strong interest in infrastructure management and site reliability engineering, you will...


  • London, Greater London, United Kingdom Selby Jennings Full time

    About Selby JenningsWe're a leading global financial services firm where technologists and investment professionals collaborate to drive innovation and operational excellence.About the RoleAs a Site Reliability Engineer, you'll apply your expertise in software and systems engineering to design, build, and maintain our robust infrastructure. You'll reduce...


  • London, Greater London, United Kingdom Amazon UK Services Ltd. Full time

    Job Title: Reliability Engineer SpecialistWe are seeking a highly skilled Reliability Engineer Specialist to join our team at Amazon UK Services Ltd. in the dynamic field of reliability engineering and spare parts management.In this role, you will be responsible for developing and executing a comprehensive strategy for spare parts inventory management,...


  • London, Greater London, United Kingdom WorksHub Full time

    Work DetailsJob Type: Full-timeEstimated Salary: $120,000 - $180,000 per yearA reliable engineer is expected to maintain and improve the reliability of our Cutover platform, leveraging expertise in cloud infrastructure and automation tools. If you're passionate about crafting scalable and efficient systems, this is the perfect opportunity for you.


  • London, Greater London, United Kingdom Apple Inc. Full time

    Reliability Engineer - Apple Inc.At Apple, we're passionate about creating innovative products and services that make a difference in people's lives. We're seeking a highly skilled Reliability Engineer to join our team and contribute to the development of our cutting-edge technology.Key Responsibilities:Design and implement reliable systems and processes to...


  • London, Greater London, United Kingdom Palantir Technologies Full time

    About Palantir TechnologiesWe're a world-changing company that builds leading software for data-driven decisions and operations.Our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more.As a Reliability Engineering Specialist, you'll embed with our engineering and business teams to...


  • London, Greater London, United Kingdom BenevolentAI Full time

    BenevolentAIEstimated Salary: £110,000 - £140,000 per annum.Company Overview:BenevolentAI is a leading artificial intelligence company that uses machine learning to accelerate scientific discovery. We are seeking a highly skilled Senior Site Reliability Engineer to join our team and help us maintain the reliability and scalability of our cloud...


  • London, Greater London, United Kingdom loveholidays Full time

    We are a rapidly growing online travel agency with technology at the heart of our success.In 2022, we sent millions of people on their dream holiday. With a million visitors a day, our 100+ services handle 8k requests per second, while maintaining p95 search latency of 150ms.You will contribute to building reliable, performant, auto-scalable, and highly...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    Job SummaryJ Bandy Consulting is seeking an experienced Site Reliability Engineer to join our team. The ideal candidate will have a strong background in software engineering and a passion for building scalable and reliable systems.Key ResponsibilitiesDevelop and implement automation tools to improve the efficiency of our systemsCollaborate with...


  • London, Greater London, United Kingdom Amazon TA Full time

    About the RoleOur Reliability Maintenance Engineering team is responsible for ensuring the optimal performance of our equipment. As an RME Technician, you will play a critical role in maintaining and repairing our machinery, ensuring minimal downtime and maximum efficiency.Key Responsibilities• Perform proactive and preventative maintenance tasks on a wide...

  • Reliability Engineer

    4 weeks ago


    London, Greater London, United Kingdom Technip Energies Full time

    About Technip EnergiesWe are a global engineering and technology company accelerating the energy transition. Our Genesis business unit provides impartial consulting services to clients in traditional hydrocarbon and energy industries.This role will undertake reliability engineering activities under the direction of a Lead RAM Engineer for conceptual...


  • London, Greater London, United Kingdom Wayve Full time

    Our team at Wayve is committed to creating a diverse, fair, and respectful culture that welcomes everyone. As a Site Reliability Engineer, you'll play a crucial role in establishing Operational Excellence and best practices for our AI-driven autonomous vehicles.">We're seeking someone with over 8 years of experience in Site Reliability Engineering or a...