Cloud Operations Site Reliability Engineer

3 weeks ago


England, United Kingdom Loftware Full time

A career at Loftware is more than just a job – it’s an opportunity to help shape the supply chain of the future.


About the role:

Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows knowledge. The Cloud Operations Site Reliability Engineer will be hands-on and involved with building, maintaining, and troubleshooting customer environments for mission-critical application use across the range of cloud platforms used by Loftware, including AWS and Azure. The Cloud Operations Site Reliability Engineer is someone that is a team player with the desire and passion for modern technology and keen to take on large-scale responsibility for the cloud environment.


The Cloud Operations Site Reliability Engineer will work with the rest of the Cloud Operations team and alongside QA and Development to continually improve automated infrastructure and application deployment, to build and maintain reliable cloud infrastructure and services and to manage the highly available and scalable solutions that Loftware customers rely on.


This is an excellent opportunity to be part of a team helping to evolve our solutions for different cloud platforms as well as expand your skills in the cloud.


Key Roles & Responsibilities:

  • Help continue to improve monitoring systems in AWS, Azure, and our other cloud environments to track the health and performance of cloud-based applications and infrastructure. Develop cloud-based alerts to proactively identify and address issues before they impact users.
  • Develop and maintain automation tools to streamline operational tasks with Terraform and Ansible
  • Implement security best practices and compliance standards for our AWS, Azure, and other cloud environments. Continuously assess and mitigate security risks and vulnerabilities. Create, maintain, and execute disaster recovery plans and backup strategies to ensure data and service continuity.
  • Collaborate with software engineers to improve the reliability and resilience of applications through code and architecture changes and help identify performance bottlenecks to optimize applications and infrastructure.
  • Help define and configure cloud-based networking to customer devices and data systems that are sat outside of our cloud environments (VPN, direct connect, transit gateways)
  • Respond to and resolve incidents quickly to minimize service disruptions and conduct post-incident analysis to identify the root causes and prevent similar issues in the future.
  • Participate in an on-call rotation to address critical incidents outside of regular business hours to provide on-call support.


Required Qualifications:

  • Cloud Platform: AWS and/or Azure
  • OS: Linux and/or Windows


Preferred Experience:

  • Database: PostgreSQL Microsoft SQL Server
  • Scripting: Python, Java, Bash, .NET/C#, Powershell
  • IAC and Automation: Terraform, Terragrunt,Ansible, Rundeck, Jenkins
  • Cloud networking concepts: VPN, direct connect, transit gateways
  • Container Technologies: Docker, Kubernetes
  • Cloud-native technologies: RDS, Microservices, Serverless computing


Why join us?

Working for the undisputed global leader in a business-critical industry offers unparalleled possibilities.

  • Our team is made up of the most talented, curious, and inspiring people in their fields, each bringing something unique to the table.
  • We use the power of the global team.
  • We set you up for success. We offer comprehensive training to all employees and place an emphasis on employee development.


  • England, United Kingdom Loftware Full time

    A career at Loftware is more than just a job – it’s an opportunity to help shape the supply chain of the future. Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows knowledge. The Cloud...


  • England, United Kingdom Loftware Full time

    A career at Loftware is more than just a job – it’s an opportunity to help shape the supply chain of the future. Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows knowledge. The Cloud...


  • england, United Kingdom Loftware Full time

    A career at Loftware is more than just a job – it’s an opportunity to help shape the supply chain of the future.About the role:Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows knowledge. The...


  • England, United Kingdom Loftware Full time

    A career at Loftware is more than just a job – it’s an opportunity to help shape the supply chain of the future.Read all the information about this opportunity carefully, then use the application button below to send your CV and application.About the role:Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically...


  • England, United Kingdom Loftware Full time

    A career at Loftware is more than just a job – it’s an opportunity to help shape the supply chain of the future. About the role: Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows...


  • England, United Kingdom Loftware Full time

    A career at Loftware is more than just a job – it’s an opportunity to help shape the supply chain of the future. About the role: Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows...


  • England, United Kingdom Loftware Full time

    A career at Loftware is more than just a job – it’s an opportunity to help shape the supply chain of the future. About the role: Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows knowledge....


  • England, United Kingdom WaferWire Cloud Technologies Full time

    We are seeking a highly motivated and experienced Site Reliability Engineer to join our growing team. You will be responsible for ensuring the reliability, performance, and scalability of our production systems. You will play a critical role in ensuring our systems are designed and operated with resiliency and high availability in mind. Project Duration: ...


  • England, United Kingdom NP Group Full time

    Site Reliability Engineer Reference - 17574 Start Date: ASAP Location: London My client is one of the leading absolute return/hedge fund managers, overseeing assets on behalf of institutional investors from around the world, including pension funds, endowments, insurance companies, government agencies, private banks, and fund of funds. Essential...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability Engineer Location: Hybrid with onsite requirements in London as and when required Contract Length: Six Months Role Summary Our client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Ultimately...


  • england, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability EngineerLocation: Hybrid with onsite requirements in London as and when requiredContract Length: Six MonthsRole SummaryOur client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Ultimately we want to...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability EngineerLocation: Hybrid with onsite requirements in London as and when requiredContract Length: Six MonthsRole SummaryOur client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Ultimately we want to...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability Engineer Location: Hybrid with onsite requirements in London as and when required Contract Length: Six Months Role Summary Our client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Ultimately we...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability EngineerLocation: Hybrid with onsite requirements in London as and when requiredContract Length: Six MonthsCandidates should take the time to read all the elements of this job advert carefully Please make your application promptly.Role SummaryOur client has chosen to do something incredible. They are totally transforming their...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability Engineer Location: Hybrid with onsite requirements in London as and when required Contract Length: Six Months Role Summary Our client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Ultimately we...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability Engineer Location: Hybrid with onsite requirements in London as and when required Contract Length: Six Months Role Summary Our client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Ultimately we want...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability EngineerLocation: Hybrid with onsite requirements in London as and when requiredContract Length: Six MonthsRole SummaryOur client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Ultimately we want to...


  • England, United Kingdom GCS Ltd Full time

    Security Site Reliability Engineer Location: West Sussex - Hybrid, 2 days on-site Length of contract: 12 Months Inside IR35 - (Apply online only)pd This is an opportunity for a Security Site Reliability Engineer to join a newly formed team where you will support the organisation in maintaining the reliability of their applications. You will influence...


  • England, United Kingdom ManpowerGroup Full time

    Location: Hybrid with onsite requirements in London as and when required Contract Length: Six Months They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Our team combines software and systems engineering with system administration practices to develop creative...


  • England, United Kingdom ManpowerGroup Full time

    Location: Hybrid with onsite requirements in London as and when required Contract Length: Six Months They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Our team combines software and systems engineering with system administration practices to develop creative...