Cloud Operations Site Reliability Engineer

2 weeks ago


London, Greater London, United Kingdom Loftware Full time

A career at Loftware is more than just a job – it's an opportunity to help shape the supply chain of the future.

Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows knowledge.

The Cloud Operations Site Reliability Engineer will be hands-on and involved with building, maintaining, and troubleshooting customer environments for mission-critical application use across the range of cloud platforms used by Loftware, including AWS and Azure.

The Cloud Operations Site Reliability Engineer will work with the rest of the Cloud Operations team and alongside QA and Development to continually improve automated infrastructure and application deployment, to build and maintain reliable cloud infrastructure and services and to manage the highly available and scalable solutions that Loftware customers rely on.

This is an excellent opportunity to be part of a team helping to evolve our solutions for different cloud platforms as well as expand your skills in the cloud.

Help continue to improve monitoring systems in AWS, Azure, and our other cloud environments to track the health and performance of cloud-based applications and infrastructure.

Implement security best practices and compliance standards for our AWS, Azure, and other cloud environments. Continuously assess and mitigate security risks and vulnerabilities. Create, maintain, and execute disaster recovery plans and backup strategies to ensure data and service continuity.

Collaborate with software engineers to improve the reliability and resilience of applications through code and architecture changes and help identify performance bottlenecks to optimize applications and infrastructure.

Help define and configure cloud-based networking to customer devices and data systems that are sat outside of our cloud environments (VPN, direct connect, transit gateways)

Respond to and resolve incidents quickly to minimize service disruptions and conduct post-incident analysis to identify the root causes and prevent similar issues in the future.


Cloud Platform :
AWS and/or Azure

OS :
Linux and/or Windows

Database :
PostgreSQL Microsoft SQL Server
Python, Java, Bash, .NET/C#, Powershell

Cloud networking concepts :

Our team is made up of the most talented, curious, and inspiring people in their fields, each bringing something unique to the table.

We offer comprehensive training to all employees and place an emphasis on employee development.

  • London, Greater London, United Kingdom Bayside Solutions Full time £91,400 - £108,000

    Site Reliability Engineer Contract Location: London, England - Hybrid Role We seek a Site Reliability Engineer to join our team and play a crucial role in ensuring our applications and services' reliability, availability, and performance. This role requires a strong background in application support, monitoring, and cloud technologies, focusing on AWS,...


  • London, Greater London, United Kingdom MMC Corporate Full time

    Mercer IT Systems Engineering is seeking candidates for an experienced, Site Reliability Engineering Manager for AWS Cloud, based in our London office: We have ambitious and exciting plans to expand further into AWS,Here, you will have the opportunity to share your depth of technical AWS expertise with our great global SRE Cloud Engineering team plus wider...


  • London, Greater London, United Kingdom MMC Corporate Full time

    Mercer IT Systems Engineering is seeking candidates for an experienced, Site Reliability Engineering Manager for AWS Cloud, based in our London office: We have ambitious and exciting plans to expand further into AWS,Here, you will have the opportunity to share your depth of technical AWS expertise with our great global SRE Cloud Engineering team plus wider...


  • London, Greater London, United Kingdom Qurated Network Full time

    Job Description Site Engineering Manager | Cross-Border Payment Fintech We are working with the leading cross-border payments provider that went through an IPO last year and is now completing an extensive digital transformation. They are looking for a Site Reliability Engineer to join their greenfield team. You will get the opportunity to work in their...


  • London, Greater London, United Kingdom MetroBank Full time

    Cloud Site Reliability Engineer (SRE) Team IT, IT & Change Location Holborn Office County Central London Ref # 21449 Closing Date 21-Jun-2024 We have been awarded the "Most Loved Workplace" At Metro Bank, people come first - our culture is all about bringing the best out in our colleagues, and making sure everyone feels valued, respected, seen and included....


  • London, Greater London, United Kingdom ByteHire Full time

    Reference: BH-298cJob Role: Senior Site Reliability EngineerJob Type: ContractIR35: Inside IR35Day Rate: £600/DayContract Duration: 6 monthsWorking Hours: 5 days per weekRemote Working: 4 days remote working. 1 day on-site in LondonLocation: Hybrid Remote/London (UK only)Role Overview:We're looking for a Senior Site Reliability Engineer with deep Google...


  • London, Greater London, United Kingdom Palantir Technologies Full time

    Palantir builds the world's leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more. We're looking for Site Reliability Engineers who can help us build, operate, and...


  • London, Greater London, United Kingdom Prism Digital Full time

    Site Reliability Engineer | GCP OR AWS & Kubernetes | SaaS HealthTechThe local headcount currently is 35 in Ireland and 45 in the UK (remote sys admins, tech engineers, field engineers, project managers, programme managers and sales) and expanding the UK office - feels like a start-up with start-up good energy.Our client is around 50% through their GCP...


  • London, Greater London, United Kingdom Xcede Full time

    Site Reliability Engineering Manager is required by a global financial technology organisation. In this newly created role, the Site Reliability Engineering Manager will be responsible for deploying and managing a suite of enterprise-wide tools used for provisioning, automation, and monitoring as well as technical team leadership. Site Reliability...


  • London, Greater London, United Kingdom Kaluza Full time

    Location: Bristol, London, Edinburgh, (Including Hybrid) Kaluza wants to power a world where net-zero is within everyone's reach by building a platform that will accelerate a sustainable, affordable and resilient energy transition. Since launching in 2019, Kaluza's technology has empowered some of the biggest energy suppliers to better serve millions of...


  • London, Greater London, United Kingdom Cameron Connect Ltd Full time

    Join Our Clients Dynamic Mortgages Team at the Heart of Technological Innovation Are you an experienced Java or C# engineer with a passion for building and maintaining reliable, high-performing systems? Do you thrive in roles where you can make a significant impact on the availability, performance, and efficiency of critical services? These opportunities...


  • London, Greater London, United Kingdom LoadSpring Solutions Full time

    About the Site Reliability Engineer position: Join our team at LoadSpring and be part of our exciting journey into predictive transformation, fueled by our innovative LoadSpring Cloud Platform and data capabilities from LoadSpring INSIGHTS. We provide a secure hosting platform for critical project applications in industries with complex projects and high...


  • London, Greater London, United Kingdom Infogain Full time

    Job Title: Site Reliability Engineer Location: London We are seeking a Site Reliability Engineer to join our team. Minimum 5 years of experience as Developer/SysAdmin/DevOps engineer Experience with several open-source tools (Ansible, Jenkins ,Git, etc) Strong expertise in Jenkins Experience with Programming languages such as Java, C# and Scripting...


  • London, Greater London, United Kingdom Tec Partners Full time

    Job Title: Site Reliability Engineer (Software Dev Background) Type: Permanent Location: Fully remote Salary: 55-65K Our client are growing their team and are looking for a Site Reliability Engineer - (ideally from a software development / software engineering background)to contribute to the development and maintenance of our cloud infrastructure, help...


  • London, Greater London, United Kingdom Blockchain 121 Full time

    Site Reliability Engineer - Fully Remote - 6 Figure USD Salary + EquityWe are looking for an experienced Site Reliability Engineer to join a team of experts on a permanent full time basis. This role requires a mix of blockchain knowledge and experience in maintaining the reliability, scalability, and performance of complex systems.This is a unique chance to...


  • London, Greater London, United Kingdom Metro Bank Full time

    What you will do: • Drive system reliability and automation within non-production and production environments and resolve complex issues that L1 support are not able to fix and can triage the Service end to end • Eliminating repetitive manual processes using automation and use best practices, such as automation, to run and support stable, secure,...


  • London, Greater London, United Kingdom Speechmatics Limited Full time

    Speechmaticsare seeking a Site Reliability Engineer (SRE) whose focuswill be improving the reliability of our products, systems and infrastructure. You will tackle technical challenges related to distributed systems, networking, low latency and machine learning applications, GPU inference and training infrastructure, on premises datacentreand cloud...


  • London, Greater London, United Kingdom H&R Talent Full time

    A leading financial services company located in Central London is seeking a Site Reliability Engineer to join their growing Infrastructure team on a permanent basis with Hybrid working. Expand and fortify the IT architecture for optimal availability. Implement continuous integration and deployment practices for seamless development workflows. Engage in...


  • London, Greater London, United Kingdom Xcede Full time

    Site Reliability Engineering Manager is required by a global financial technology organisation. In this newly created role, the Site Reliability Engineering Manager will be responsible for deploying and managing a suite of enterprise-wide tools used for provisioning, automation, and monitoring as well as technical team leadership. Site Reliability...


  • London, Greater London, United Kingdom Sartre Group Full time

    A global high-frequency trading firm is looking for an experienced Site Reliability Engineer to join their Systems Infrastructure team, specializing in Linux, Python, and AWS.In this position, you will be instrumental in delivering scalable, secure, and reliable solutions for their AWS Linux digital assets trading platform.Develop and implement technical...