Site Reliability Engineer

2 weeks ago


London, United Kingdom Neo4j Inc Full time

The Role:

The Site Reliability Engineering team’s mission is to improve the overall reliability of Neo4j’s DBaaS product: Neo4j Aura. Our product operates at scale and spans all 3 major cloud providers, with hundreds of Kubernetes clusters running in production.

Until recently, the SRE function at Neo4j Aura achieved this by filling the shoes of a more traditional Ops team. We are in the process of transforming the team and need your help to implement what we believe to be a more authentic form of SRE, by:

  • Educating software engineers and product managers on SRE principles such as SLIs and SLOs
  • Reducing the barrier to effective Ops for the engineering department by building abstractions and automating away toily tasks
  • Applying software engineering to solve operational problems - we believe in writing operators rather than bash scripts
  • Encouraging engineering teams to take ownership of running their code in production

We are looking for people with some of the following skills:

  • Applying SRE practices in the wild: defining SLIs for key software, reducing toil through automation, monitoring applications for success
  • The ability to debug large and complex cloud-based systems
  • Extensive experience in monitoring systems and their performance
  • Experience deploying and working with observability systems such as: Prometheus, Grafana, Datadog, Google Logging (Stackdriver)
  • Extensive experience with deploying and managing applications running on Kubernetes (experience with administering Kubernetes clusters is a plus)
  • Knowledge of Go, Kustomize, and Terraform (some knowledge of Python is also a plus)
  • Production experience with proxy software (e.g, Envoy, NGINX, HAProxy) and networking in general
  • Experience with building CI/CD pipelines - we use GitHub Actions and TeamCity
  • Familiarity working with a variety of Cloud Native projects
  • Experience being on call is a plus
#J-18808-Ljbffr

  • London, United Kingdom TEKsystems Full time

    Site Reliability Engineer / SRE Description: My global client is looking for a Site Reliability Engineer / SRE to join their growing team who must have strong experience working within the financial services industry on large complex projects. To be successful in this Site Reliability / SRE project you will need expert experience within: AWS ...


  • London, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer Check all associated application documentation thoroughly before clicking on the apply button at the bottom of this description.I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability,...


  • London, United Kingdom Understanding Recruitment Full time

    Job DescriptionSite Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance,...


  • London, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer Check all associated application documentation thoroughly before clicking on the apply button at the bottom of this description.I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability,...


  • London, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and...


  • London, United Kingdom Understanding Recruitment Full time

    Job Description Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and...


  • London, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...


  • London, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and...


  • London, United Kingdom Experian Full time

    Job Description Work that matters – what you’ll be doing We’re looking for a Site Reliability Engineer to join our Experian Data Quality team where you will be working on cutting edge products within our Aperture suite (Data Studio and Data Governance). This role has aspects of both reliability engineering (SRE) and test engineering (SDET)....


  • London, United Kingdom N Consulting Ltd Full time

    Job title: Site Reliability EngineerWork Mode: 3 days office MandatoryLocation: 5 Broadgate, London EC2M 2QS, United KingdomContract Duration: 12 monthsWe’re looking for a Site Reliability Engineer to:· determine the reliability of our digital products, technology services, and the infrastructure that underpins them· minimize the risk and impact of...


  • London, United Kingdom McGregor Boyall Full time

    **Permanent role** **£70k - £120k per annum (+ package)** **SPONSORSHIP - AVAILABLE** **Location - Central London (hybrid working model)** **The Company** A Fortune 500 company based in Central London. **The Role** As a**Site Reliability Engineer**you will collaborate with product development teams. You will be instrumental providing engineering...


  • London Area, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...


  • London Area, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...


  • London Area, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...


  • London Area, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and...


  • London, United Kingdom eFinancialCareers Full time

    Join us as a Senior Site Reliability Engineer - We'll look to you to establish and run a SRE function to help design, build, deliver and run highly reliable, scalable and secure software systems - This is a great opportunity to hone your existing engineering skills and advance your career in this critical role **What you'll do** As a Senior Site Reliability...


  • London, United Kingdom Mondrian Alpha Full time

    Site Reliability Engineer / Windows Enviroment / Prestigious Hedge Fund / London My client, a renowned hedge fund with a global presence, is in search of a seasoned Site Reliability Engineer to join their London team. As part of this team, you'll play a pivotal role in maintaining the technology infrastructure that drives the fund's operations, directly...


  • London, United Kingdom Apple Inc. Full time

    Site Reliability Engineering (SRE) Manager, iCloud People at Apple don’t just build products — they craft experiences our customers love and depend on. Apple Services Engineering (ASE) builds and supports the systems that make many of these daily experiences possible. If you’ve used Apple products, you’ve likely interacted with us. iCloud Services...


  • London, United Kingdom Salt Full time

    Site Reliability Engineer – Hybrid – London Day rate: £500 - £700 (inside IR35) Duration: 6 – 12 months Start: ASAP My new client is looking for a Site Reliability Engineer to join the team on a contract basis. You must be currently working as an SRE for a few years. This is a hybrid role so 2 days per week in the London office. Must have...


  • London, United Kingdom Salt Full time

    Job Description Site Reliability Engineer – Hybrid – London Day rate: £500 - £700 (inside IR35) Duration: 6 – 12 months Start: ASAP My new client is looking for a Site Reliability Engineer to join the team on a contract basis. You must be currently working as an SRE for a few years. This is a hybrid role so 2 days per week in the London office. ...