Cloud Operations Site Reliability Engineer

3 weeks ago


London, United Kingdom Loftware Full time

A career at Loftware is more than just a job – it’s an opportunity to help shape the supply chain of the future.


About the role:

Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows knowledge. The Cloud Operations Site Reliability Engineer will be hands-on and involved with building, maintaining, and troubleshooting customer environments for mission-critical application use across the range of cloud platforms used by Loftware, including AWS and Azure. The Cloud Operations Site Reliability Engineer is someone that is a team player with the desire and passion for modern technology and keen to take on large-scale responsibility for the cloud environment.


The Cloud Operations Site Reliability Engineer will work with the rest of the Cloud Operations team and alongside QA and Development to continually improve automated infrastructure and application deployment, to build and maintain reliable cloud infrastructure and services and to manage the highly available and scalable solutions that Loftware customers rely on.


This is an excellent opportunity to be part of a team helping to evolve our solutions for different cloud platforms as well as expand your skills in the cloud.


Key Roles & Responsibilities:

  • Help continue to improve monitoring systems in AWS, Azure, and our other cloud environments to track the health and performance of cloud-based applications and infrastructure. Develop cloud-based alerts to proactively identify and address issues before they impact users.
  • Develop and maintain automation tools to streamline operational tasks with Terraform and Ansible
  • Implement security best practices and compliance standards for our AWS, Azure, and other cloud environments. Continuously assess and mitigate security risks and vulnerabilities. Create, maintain, and execute disaster recovery plans and backup strategies to ensure data and service continuity.
  • Collaborate with software engineers to improve the reliability and resilience of applications through code and architecture changes and help identify performance bottlenecks to optimize applications and infrastructure.
  • Help define and configure cloud-based networking to customer devices and data systems that are sat outside of our cloud environments (VPN, direct connect, transit gateways)
  • Respond to and resolve incidents quickly to minimize service disruptions and conduct post-incident analysis to identify the root causes and prevent similar issues in the future.
  • Participate in an on-call rotation to address critical incidents outside of regular business hours to provide on-call support.


Required Qualifications:

  • Cloud Platform: AWS and/or Azure
  • OS: Linux and/or Windows


Preferred Experience:

  • Database: PostgreSQL Microsoft SQL Server
  • Scripting: Python, Java, Bash, .NET/C#, Powershell
  • IAC and Automation: Terraform, Terragrunt,Ansible, Rundeck, Jenkins
  • Cloud networking concepts: VPN, direct connect, transit gateways
  • Container Technologies: Docker, Kubernetes
  • Cloud-native technologies: RDS, Microservices, Serverless computing


Why join us?

Working for the undisputed global leader in a business-critical industry offers unparalleled possibilities.

  • Our team is made up of the most talented, curious, and inspiring people in their fields, each bringing something unique to the table.
  • We use the power of the global team.
  • We set you up for success. We offer comprehensive training to all employees and place an emphasis on employee development.


  • London, United Kingdom Loftware Full time

    A career at Loftware is more than just a job –it’s an opportunity to help shape the supply chain of the future.About the role:nLoftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows knowledge. The...


  • London, United Kingdom Marsh McLennan Companies Full time

    Description: Mercer IT Systems Engineering is seeking candidates for an experienced, Site Reliability Engineering Manager for AWS Cloud , based in our London office:   We have ambitious and exciting plans to expand further into AWS, Here, you will have the opportunity to share your depth of technical AWS expertise with our great global SRE Cloud...


  • London, United Kingdom McGregor Boyall Full time

    **GCP, Azure, AWS, CI/CD, Python, Groovy, Bash, SDLC, IaC, Security, Networking, SRE** **Google Cloud Site Reliability Engineer** **£50,000 - £65,000 + benefits** **Hybrid (1 day a week at nearest hub: London, Bristol, Manchester, Birmingham, Leeds, Edinburgh)** **The company** A leading financial institute **The responsibilities**: - Responsible...


  • London, Greater London, United Kingdom MMC Corporate Full time

    Mercer IT Systems Engineering is seeking candidates for an experienced, Site Reliability Engineering Manager for AWS Cloud, based in our London office: We have ambitious and exciting plans to expand further into AWS,Here, you will have the opportunity to share your depth of technical AWS expertise with our great global SRE Cloud Engineering team plus wider...


  • London, United Kingdom N Consulting Ltd Full time

    Job title: Site Reliability EngineerWork Mode: 3 days office MandatoryLocation: 5 Broadgate, London EC2M 2QS, United KingdomContract Duration: 12 monthsWe’re looking for a Site Reliability Engineer to:· determine the reliability of our digital products, technology services, and the infrastructure that underpins them· minimize the risk and impact of...


  • London, United Kingdom MMC Corporate Full time

    Mercer IT Systems Engineering is seeking candidates for an experienced, Site Reliability Engineering Manager for AWS Cloud, based in our London office: We have ambitious and exciting plans to expand further into AWS,Here, you will have the opportunity to share your depth of technical AWS expertise with our great global SRE Cloud Engineering team plus...


  • London, United Kingdom Qurated Network Full time

    Job Description Site Engineering Manager | Cross-Border Payment Fintech We are working with the leading cross-border payments provider that went through an IPO last year and is now completing an extensive digital transformation. They are looking for a Site Reliability Engineer to join their greenfield team. You will get the opportunity to work in...


  • London, United Kingdom Qurated Network Full time

    Site Engineering Manager | Cross-Border Payment FintechMake your application after reading the following skill and qualification requirements for this position.We are working with the leading cross-border payments provider that went through an IPO last year and is now completing an extensive digital transformation. They are looking for a Site Reliability...


  • London, United Kingdom Qurated Network Full time

    Job Description Site Engineering Manager | Cross-Border Payment Fintech We are working with the leading cross-border payments provider that went through an IPO last year and is now completing an extensive digital transformation. They are looking for a Site Reliability Engineer to join their greenfield team. You will get the opportunity to work in their...


  • London, United Kingdom Qurated Network Full time

    Job Description Site Engineering Manager | Cross-Border Payment Fintech We are working with the leading cross-border payments provider that went through an IPO last year and is now completing an extensive digital transformation. They are looking for a Site Reliability Engineer to join their greenfield team. You will get the opportunity to work in their...


  • London, United Kingdom Qurated Network Full time

    Site Engineering Manager | Cross-Border Payment FintechMake your application after reading the following skill and qualification requirements for this position.We are working with the leading cross-border payments provider that went through an IPO last year and is now completing an extensive digital transformation. They are looking for a Site Reliability...


  • London, United Kingdom Qurated Network Full time

    Site Engineering Manager | Cross-Border Payment Fintech We are working with the leading cross-border payments provider that went through an IPO last year and is now completing an extensive digital transformation. They are looking for a Site Reliability Engineer to join their greenfield team. You will get the opportunity to work in their small, fast-paced...


  • London, United Kingdom Qurated Network Full time

    Site Engineering Manager | Cross-Border Payment Fintech We are working with the leading cross-border payments provider that went through an IPO last year and is now completing an extensive digital transformation. They are looking for a Site Reliability Engineer to join their greenfield team. You will get the opportunity to work in their small, fast-paced...


  • London, United Kingdom Prism Digital Full time

    **Site Reliability Engineer | GCP OR AWS & Kubernetes | SaaS HealthTech** The local headcount currently is 35 in Ireland and 45 in the UK (remote sys admins, tech engineers, field engineers, project managers, programme managers and sales) and expanding the UK office - feels like a start-up with start-up good energy. Our client is around 50% through their...


  • london, United Kingdom Qurated Network Full time

    Site Engineering Manager | Cross-Border Payment FintechWe are working with the leading cross-border payments provider that went through an IPO last year and is now completing an extensive digital transformation. They are looking for a Site Reliability Engineer to join their greenfield team. You will get the opportunity to work in their small, fast-paced...


  • London, United Kingdom Qurated Network Full time

    Site Engineering Manager | Cross-Border Payment FintechWe are working with the leading cross-border payments provider that went through an IPO last year and is now completing an extensive digital transformation. They are looking for a Site Reliability Engineer to join their greenfield team. You will get the opportunity to work in their small, fast-paced...


  • London, United Kingdom Qurated Network Full time

    Job Description Site Engineering Manager | Cross-Border Payment Fintech We are working with the leading cross-border payments provider that went through an IPO last year and is now completing an extensive digital transformation. They are looking for a Site Reliability Engineer to join their greenfield team. You will get the opportunity to work in their...


  • London, United Kingdom Qurated Network Full time

    Site Engineering Manager | Cross-Border Payment FintechWe are working with the leading cross-border payments provider that went through an IPO last year and is now completing an extensive digital transformation. They are looking for a Site Reliability Engineer to join their greenfield team. You will get the opportunity to work in their small, fast-paced...


  • london, United Kingdom ByteHire Full time

    Reference: BH-298cJob Role: Senior Site Reliability EngineerJob Type: ContractIR35: Inside IR35Day Rate: £600/DayContract Duration: 6 monthsWorking Hours: 5 days per weekRemote Working: 4 days remote working. 1 day on-site in LondonLocation: Hybrid Remote/London (UK only)Role Overview:We’re looking for a Senior Site Reliability Engineer with deep Google...


  • London, United Kingdom ByteHire Full time

    Reference: BH-298cJob Role: Senior Site Reliability EngineerJob Type: ContractIR35: Inside IR35Day Rate: £600/DayContract Duration: 6 monthsWorking Hours: 5 days per weekRemote Working: 4 days remote working. 1 day on-site in LondonLocation: Hybrid Remote/London (UK only)Role Overview:We’re looking for a Senior Site Reliability Engineer with deep Google...