Site Reliability Engineer

3 weeks ago


Cambridge England, United Kingdom SoCode Recruitment Full time
As a Site Reliability Engineer, you will help us achieve our goals by continuously improving our SaaS offering’s features and robustness. You will participate in designing, developing, deploying, monitoring, supporting, documenting, and troubleshooting our SaaS solution.

For a complete understanding of this opportunity, and what will be required to be a successful applicant, read on.
This is an exciting opportunity to collaborate closely with the Cloud Operations team, the wider organization, and external vendors and customers.
This is a hybrid role based in our Cambridge or London office, so you will ideally be comfortable coming into the office once or twice a week. If you’re interested in the role but require more flexibility, please speak to us
Key Responsibilities:
Deploying, maintaining, monitoring, and upgrading production deployments of our SaaS solutions
Building software and systems to manage platform infrastructure and applications
Continually evaluating and improving our technology and processes to increase quality, decrease costs, and improve time-to-market
Periodically testing the service with predictable and unpredictable failures
Providing 2nd-line operational support for our SaaS customers
Gathering data and generating reports on the service performance
Developing and documenting internal processes
Working with engineering/data science to drive and develop new capabilities
Providing out-of-hours support for critical service issues as part of our on-call engineer rota
Preferred Skills/Experience:
While not all are essential, ideally you will have experience with the following:
Administering cloud infrastructure or developing cloud applications (preferably in AWS)
Configuration management, including Infrastructure as Code
Linux, shell-scripting, and command-line tools
Programming in one or more high-level programming languages (e.g. Python)
Networking (e.g. DNS, routing, firewalls)
Source-control management (e.g. Git)
Continuous Integration / Continuous Deployment (CI/CD)
Monitoring, metrics, and alerting
Containerization (e.g. Docker)
Administering, developing applications for, or deploying applications to Kubernetes
Using or developing applications with service mesh (e.g. Istio)
Object-oriented programming and design
Operating production-grade services
Providing technical support
Building serverless or cloud-native applications
Writing technical documentation
Developing processes and procedures
Securing applications, services, and data (e.g. authentication, authorization, encryption, and TLS)
Experience with any of the following: Terraform, SaltStack, MongoDB, Elasticsearch, Kafka, Prometheus, Grafana, HashiCorp Vault
We are looking for candidates who are passionate about technology, keen to continuously learn, and excited to contribute to a dynamic team environment. If you have the required skills and are looking for a challenging and rewarding role, we encourage you to apply.

  • Cambridge, Cambridgeshire, United Kingdom Adecco Full time

    Site Reliability Engineer - 3rd Level, Support, Degree, Cloud, Python, AWS, £competitive + benefits, hybrid / Cambridge My client is one of the most innovative software houses in the UK, with a bunch of dazzling awards behind them, they continue to take artificial intelligence to a whole new level. We have an impressive opening for a Site Reliability...


  • London, England, United Kingdom Qurated Network Full time

    Job DescriptionSite Engineering Manager | Cross-Border Payment FintechWe are working with the leading cross-border payments provider that went through an IPO last year and is now completing an extensive digital transformation.Are you the right candidate for this opportunity Make sure to read the full description below.They are looking for a Site Reliability...


  • Cambridge, Cambridgeshire, United Kingdom L&G Recruitment Full time

    Join Our Team as a Site Reliability Engineer (SRE)!SRE Engineer Responsibilities:- **Alerting and Monitoring Tools:** The SRE Engineer needs to be familiar with tools such as Splunk, Log DNA, Grafana, and AWS CloudWatch. - **CI/CD Tools:** Should have experience with CI/CD tools like TeamCity, Jenkins, IBM Tool Chain, etc. - **APM and Observability Tools:**...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability Engineer Location: Hybrid with onsite requirements in London as and when required Contract Length: Six Months Role Summary Our client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Ultimately...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability Engineer Location: Hybrid with onsite requirements in London as and when required Contract Length: Six Months Role Summary Our client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Ultimately we...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability Engineer Location: Hybrid with onsite requirements in London as and when required Contract Length: Six Months Role Summary Our client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Ultimately we...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability EngineerLocation: Hybrid with onsite requirements in London as and when requiredContract Length: Six MonthsRole SummaryOur client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Ultimately we want to...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability EngineerLocation: Hybrid with onsite requirements in London as and when requiredContract Length: Six MonthsRole SummaryOur client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Ultimately we want to...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability Engineer Location: Hybrid with onsite requirements in London as and when required Contract Length: Six Months Role Summary Our client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Ultimately we...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability EngineerLocation: Hybrid with onsite requirements in London as and when requiredContract Length: Six MonthsCandidates should take the time to read all the elements of this job advert carefully Please make your application promptly.Role SummaryOur client has chosen to do something incredible. They are totally transforming their...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability Engineer Location: Hybrid with onsite requirements in London as and when required Contract Length: Six Months Role Summary Our client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Ultimately we...


  • England, United Kingdom ManpowerGroup Full time

    Job Title: Site Reliability Engineer Location: Hybrid with onsite requirements in London as and when required Contract Length: Six Months Role Summary Our client has chosen to do something incredible. They are totally transforming their business and building our future on smoke-free products that are a better choice than continued smoking. Ultimately we want...


  • Cambridge, United Kingdom SoCode Limited Full time

    As a Site Reliability Engineer, you will help us achieve our goals by continuously improving our SaaS offering’s features and robustness. You will participate in designing, developing, deploying, monitoring, supporting, documenting, and troubleshooting our SaaS solution. This is an exciting opportunity to collaborate closely with the Cloud Operations team,...


  • England, United Kingdom Loftware Full time

    A career at Loftware is more than just a job – it’s an opportunity to help shape the supply chain of the future. Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows knowledge. The Cloud...


  • England, United Kingdom Loftware Full time

    A career at Loftware is more than just a job – it’s an opportunity to help shape the supply chain of the future. Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows knowledge. The Cloud...


  • England, United Kingdom Loftware Full time

    A career at Loftware is more than just a job – it’s an opportunity to help shape the supply chain of the future. Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows knowledge. The Cloud...


  • England, United Kingdom Loftware Full time

    A career at Loftware is more than just a job – it’s an opportunity to help shape the supply chain of the future. Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows knowledge. The Cloud...


  • England, United Kingdom GCS Ltd Full time

    Security Site Reliability Engineer Location: West Sussex - Hybrid, 2 days on-site Length of contract: 12 Months Inside IR35 - (Apply online only)pd This is an opportunity for a Security Site Reliability Engineer to join a newly formed team where you will support the organisation in maintaining the reliability of their applications. You will influence...


  • South West England, United Kingdom Twinstream Limited Full time

    SITE RELIABILITY ENGINEER / BRISTOL / UP TO £85K & GREAT BENEFTISAre you an experienced Site Reliability Engineer looking for an exciting new challenge? If so, we have the perfect opportunity for you. Excellent pay and extensive benefits package. In 2019, our founders were working as engineers solving complex cross domain problems in defence and security...


  • South West England, United Kingdom Twinstream Limited Full time €85,000

    SITE RELIABILITY ENGINEER / BRISTOL / UP TO £85K & GREAT BENEFTIS Are you an experienced Site Reliability Engineer looking for an exciting new challenge? If so, we have the perfect opportunity for you. Excellent pay and extensive benefits package. In 2019, our founders were working as engineers solving complex cross domain problems in defence and...