Senior Site Reliability Engineer
4 weeks ago
Are you interested in making a difference? To work for a tech-for-good company whose reason for being is to help all boards and leadership teams to be a powerful driver of performance and a force for good? Board Intelligence is on a mission to bring kindness and success together and to drive companies to think about what matters. We work with over 30,000 Chairs, CEOs, and board members to embed the discipline of focus into their organisations, and we’re helping a new board every day to focus on what matters. We are in it for the long term, come join us on this journey.
As a Senior Site Reliability Engineer (SRE), you'll be joining a team whose mission is to ensure the availability, performance, security and reliability of our platform and core services, ensuring that they meet the needs of our internal and external users. You will take the lead on projects across the entire breadth of our tech stack, from planning all the way through to delivery and maintenance - you will bring others on the team with you on the journey too and not just go it alone. You will be responsible for visibility and monitoring of those systems, for building tooling and automation to reduce TOIL and for responding to incidents as part of our 24/7 SRE on-call team.
Key responsibilities of the roleWe're looking for a great Senior SRE to be a hands-on individual contributor to key technical projects and to help us build a first-class SRE function. This role will involve:
- Hands-on work with technical projects, taking direction from the team Principals
- Implement and maintain monitoring solutions / metric-driven alerting, logging and tracing
- Troubleshoot in complex environments
- Establish and measure SLIs and SLOs with engineering teams and continuously improve relationships and ways of working with other engineering teams
- Participate in periodic 24x7 paid on-call duties
- Build and manage systems, infrastructure and applications using infrastructure as code and automation (Terraform, Ansible, K8s, Helm, Go)
- Pair programming, knowledge sharing and running appropriate training sessions for the team
- Writing well-defined tickets (and supporting documentation when required) as well as keeping them up-to-date
- Strong communication skills with the ability and openness to work across a range of varied stakeholders and confidence to check and challenge when required.
- Cares about evolving SRE best practices (through a security lens) and is driven to find the right ways of working with the team
- Is self-driven and constantly striving to improve everything with automation and monitoring
- Is able and willing to travel to our physical datacenters in the U.K should the need arise
- Demonstrates and promotes positive attitudes and behaviours: collaboration, learning, sharing, respect and kindness
We prefer to work with the best talent regardless of whether you are familiar with all of the tools that we use. We don’t need you to be familiar with everything on this list but experience in some or all of these areas will be useful and a willingness to dive in and learn the others, essential.
- A strong background in SRE/DevOps or Linux System Administration
- A strong background in system automation using configuration management systems such as Ansible, Chef or Puppet.
- A solid understanding of containerisation and container orchestration using tools such as Kubernetes
- Experience with creation of automation using APIs
- Experience of automation testing in an Agile Software environment
- Close familiarity with some or all of:
- Network management and optimisation
- Postgresql Database management and optimisation
- With common security frameworks CIS, NIST, OWASP
- Familiarity with Public Cloud Services like AWS | GCP | Azure
- Familiarity with co-located physical infrastructure (we’re currently hybrid)
- Solid understanding of Continuous Integration (CI) and Continuous Deployment (CD)
- Close familiarity with or direct experience of the trade-offs and design decisions Software Engineers need to make when developing applications that must perform and scale well in the real world
- Experience with technical writing and or reviewing technical designs
- Strong experience and understanding of Agile practices including Scrum, Kanban etc
- An understanding of one or more of the following languages: Ruby, Java, Go, Bash/Shell
- Strong experience with issue tracking software like Jira and story management lifecycle in general
Everyone says it, but in our case it’s true: Each member of our engineering team is amazing in their own right, but together they are what brings our product to life.
We’re very proud of the team we’ve built – there’s around 50 of us in Product and Tech now after growing quickly in 2023/24. We have ambitious plans to further improve our ways of engineering and to continue to enable boards to ‘see what matters’. You’ll play a big role in helping us achieve this in 2025/26 and beyond.
Tech StackOur applications are written in Ruby (with Rails) or Java. Client-side web apps are written in React, and some services in Clojure, Java and Go.
Our platform consists of:
- Multiple Kubernetes Cluster for Container orchestration
- Apache Kafka and Redis shortly Postgres for event messaging
- Postgres for data storage
- OpenStack Swift for Object storage
- Juniper & Cisco networking devices
- A number of internally written tools for managing the platform written in Go
We run our own physical infrastructure co-located in three datacentres across the UK. We also run a public cloud Production Environment on GCP for one of our products and we’re moving in the direction of more public cloud for production and pre-production environments and pipelines.
Benefits- Competitive salary & pension scheme
- Personal performance bonus
- 26 days holiday each calendar year
- Bupa health & dental cover
- Group life insurance
- EAP; AIG Smart Health and Bereavement Counselling & Probate Helpline
- Regular training & development, mini MBA series, lunch & learns
- Cycle to work scheme
- Competitive parental policies
- Gym membership discounts
- Monthly company socials
-
Senior Site Reliability Engineer
1 week ago
London, United Kingdom X4 Group Full timeA leading financial data analytics company are seeking an experienced and ambitious Senior Site Reliability Engineer to join their established team on a permanent basis, taking up a senior or leading role in the design, build, and continual improvement oftheir cloud based microservices systems. The Senior Site Reliability Engineer would be joining the site...
-
Senior Site Reliability Engineer
1 week ago
London, United Kingdom eFinancialCareers Full timeJoin us as a Senior Site Reliability Engineer - We'll look to you to establish and run a SRE function to help design, build, deliver and run highly reliable, scalable and secure software systems - This is a great opportunity to hone your existing engineering skills and advance your career in this critical role **What you'll do** As a Senior Site Reliability...
-
London, United Kingdom Method Resourcing Full time**Senior Site Reliability Engineer | Senior Devops Engineer | Senior SRE | Senior DevOps | AWS | Terraform | Python | Docker | CI/CD | Jenkins | Kubernetes | Git** **Cambridge / Remote - Permanent - 100k + 10% Bonus + Benefits** Method Resourcing have the utmost privilege of working alongside a fantastic IOT organisation on a mission to scale...
-
Site Reliability Engineer
5 days ago
London, United Kingdom Prism Digital Full time**Senior Site Reliability Engineer (SRE) | GCP/AWS | Market Intelligence Leaders** We have an exciting opportunity for a Senior Site Reliability Engineer (SRE) to join a global organisation involved in the market intelligence space. Our client's AI-powered platform provides businesses with world-class and real-time consumer analytics. They are looking for...
-
Senior Site Reliability Engineer
2 weeks ago
London, United Kingdom Arcus Search Full time €105,000Senior Site Reliability Engineer Location : London (3–4 days in-office per week) Salary : Up to £105,000 A leading organization in the financial services industry is seeking a talented Senior Site Reliability Engineer to join their established technology team in London. This is an exciting opportunity to work on cutting-edge infrastructure,...
-
Senior Site Reliability Engineer
2 weeks ago
London, United Kingdom Arcus Search Full time €105,000Senior Site Reliability Engineer Location : London (3–4 days in-office per week) Salary : Up to £105,000 A leading organization in the financial services industry is seeking a talented Senior Site Reliability Engineer to join their established technology team in London. This is an exciting opportunity to work on cutting-edge infrastructure,...
-
Senior Site Reliability Engineer
2 weeks ago
London, United Kingdom Arcus Search Full timeSenior Site Reliability EngineerLocation: London (3–4 days in-office per week)Salary: Up to £105,000A leading organization in the financial services industry is seeking a talented Senior Site Reliability Engineer to join their established technology team in London. This is an exciting opportunity to work on cutting-edge infrastructure, middleware, and...
-
Senior Site Reliability Engineer
2 weeks ago
London Area, United Kingdom Arcus Search Full timeSenior Site Reliability Engineer Location : London (3–4 days in-office per week) Salary : Up to £105,000 A leading organization in the financial services industry is seeking a talented Senior Site Reliability Engineer to join their established technology team in London. This is an exciting opportunity to work on cutting-edge infrastructure, middleware,...
-
Senior Site Reliability Engineer
2 weeks ago
London Area, United Kingdom Arcus Search Full timeSenior Site Reliability EngineerLocation: London (3–4 days in-office per week)Salary: Up to £105,000A leading organization in the financial services industry is seeking a talented Senior Site Reliability Engineer to join their established technology team in London. This is an exciting opportunity to work on cutting-edge infrastructure, middleware, and...
-
Senior Site Reliability Engineer
2 weeks ago
London Area, United Kingdom Arcus Search Full timeSenior Site Reliability EngineerLocation: London (3–4 days in-office per week)Salary: Up to £105,000A leading organization in the financial services industry is seeking a talented Senior Site Reliability Engineer to join their established technology team in London. This is an exciting opportunity to work on cutting-edge infrastructure, middleware, and...
-
Senior Site Reliability Engineer
20 hours ago
London, United Kingdom Stratospherec Limited Full time**Senior Site Reliability Engineer** **Fully Remote - £110k to £120k + Benefits** Our client, a Global Digital SaaS Software Company have a fantastic fully remote opportunity for an experienced Senior Site Reliability Engineer to join their UK Cloud Infrastructure team. The SRE team work on a fully remote basis and work in conjunctionwith their US and...
-
Site Reliability Engineer Lead
1 week ago
London, Greater London, United Kingdom Board Intelligence Limited Full timeJob Title: Site Reliability Engineer LeadBoard Intelligence Limited is seeking a highly skilled Senior Monitoring Site Reliability Engineer to join our team. The estimated salary for this position is £80,000 - £110,000 per year.We are looking for an experienced SRE with a strong background in system automation, containerisation, and security frameworks. As...
-
Senior Site Reliability Engineer
3 months ago
London, United Kingdom numi Full time €95,000Senior Site Reliability Engineer (SRE) Location: London Hybrid Salary - Up to £95k Are you ready to make a real impact in the fintech world? We’re looking for a passionate Senior Site Reliability Engineer (SRE) to join our dynamic team. At our company, we believe in empowering businesses and communities to thrive in the digital economy. You’ll have...
-
Senior Site Reliability Engineer
4 weeks ago
London, United Kingdom numi Full time €95,000Senior Site Reliability Engineer (SRE) Location: London Hybrid Salary: Up to £95k Are you ready to make a real impact in the fintech world? We’re looking for a passionate Senior Site Reliability Engineer (SRE) to join our dynamic team. At our company, we believe in empowering businesses and communities to thrive in the digital economy. You’ll have...
-
Senior Site Reliability Engineer
4 weeks ago
London, United Kingdom numi Full timeSenior Site Reliability Engineer (SRE)Location: LondonHybridSalary: Up to £95kAre you ready to make a real impact in the fintech world? We’re looking for a passionate Senior Site Reliability Engineer (SRE) to join our dynamic team. At our company, we believe in empowering businesses and communities to thrive in the digital economy. You’ll have the...
-
Senior Site Reliability Engineer
3 weeks ago
London, United Kingdom numi Full timeSenior Site Reliability Engineer (SRE) Location: London Hybrid Salary: Up to £95k Are you ready to make a real impact in the fintech world? We’re looking for a passionate Senior Site Reliability Engineer (SRE) to join our dynamic team. At our company, we believe in empowering businesses and communities to thrive in the digital economy. You’ll have...
-
Site Reliability Engineer
3 weeks ago
London, Greater London, United Kingdom SNAPLOGIC Full timeSite Reliability Engineer JobWe are seeking a highly skilled Site Reliability Engineer to join our Infrastructure Engineering and Operations Team at SNAPLOGIC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based systems, as well as developing and implementing strategies to improve their...
-
Site Reliability Engineering Manager
2 weeks ago
London, Greater London, United Kingdom numi Full timeSenior Monitoring Site Reliability Engineer Job DescriptionJob Type: Full-timeLocation: London, UKSalary: Up to £95,000 per annum.We're looking for an experienced Senior Monitoring Site Reliability Engineer to join our team at numi. As a key member of our engineering team, you'll be responsible for designing and implementing monitoring tools to ensure high...
-
Site Reliability Engineer
1 month ago
London, United Kingdom Arcus Search Full timeSenior Site Reliability Engineer (SRE) Are you interested in shaping the future of infrastructure, automation, and reliability at a Leading Fintech? We’re on the lookout for a Senior Site Reliability Engineer who thrives on tackling complex challenges, building scalable systems, and leading the charge in creating a world-class engineering ecosystem....
-
Site Reliability Engineer
1 month ago
London, United Kingdom Arcus Search Full timeSenior Site Reliability Engineer (SRE) Are you interested in shaping the future of infrastructure, automation, and reliability at a Leading Fintech? We’re on the lookout for a Senior Site Reliability Engineer who thrives on tackling complex challenges, building scalable systems, and leading the charge in creating a world-class engineering ecosystem....