Senior Site Reliability Engineer

4 weeks ago


London Area, United Kingdom Vertex Search Full time

Senior/Lead SRE opportunity, top tier finance organisation


We are seeking a Senior SRE to join our client as their first SRE and play a pivotal role in constructing a comprehensive observability platform. If successful, you will be responsible for designing, deploying, and maintaining a system that grants visibility into their IT infrastructure and operations.


Your Role:


  • Architect and implement a comprehensive observability and traceability platform.
  • Identify and address gaps in monitoring coverage, collaborating with cross-functional teams to implement solutions.
  • Proactively identify and remediate system performance issues.
  • Develop and implement strategies to enhance system reliability and scalability.
  • Partner with stakeholders to define and configure alerting mechanisms.
  • Champion automation initiatives, utilizing automation tools and frameworks for efficient code deployment and system management.
  • Documentation of system configurations and operational procedures.


Qualifications:


  • 5+ years experience as a Senior SRE
  • Demonstrated expertise in both AWS and Azure.
  • In-depth understanding of Windows Server, Linux operating systems, and Kubernetes container orchestration.
  • Strong foundation in network monitoring principles, with familiarity of NetFlow and network telemetry streaming a significant advantage.
  • 5+ years of experience working with logging, tracing, and metrics platforms (experience with Grafana, Influx, Prometheus, ELK Stack, or Loki is preferred).
  • Proven ability to interact with third-party APIs for data integration and analysis.
  • Experience with data collection and transformation systems like Open Telemetry.
  • Strong scripting and programming skills with the likes of Bash/PowerShell/Python.


What They Can Offer:


  • Excellent salary and annual bonus potential, plus private healthcare, 11% pension and family-oriented benefits.
  • Being the first SRE into the business with scope to grow the team and evangelise for the SRE mindset, with the opportunity to make a significant impact on business-critical systems.
  • Hybrid work environment, with home working setup grant.
  • A commitment to continuous learning and professional development.


If you are a passionate SRE with a talent for building robust monitoring solutions, please apply



  • London, United Kingdom eFinancialCareers Full time

    Join us as a Senior Site Reliability Engineer - We'll look to you to establish and run a SRE function to help design, build, deliver and run highly reliable, scalable and secure software systems - This is a great opportunity to hone your existing engineering skills and advance your career in this critical role **What you'll do** As a Senior Site Reliability...


  • London Area, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...


  • London Area, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...


  • London Area, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...


  • London Area, United Kingdom Understanding Recruitment Full time

    Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and...


  • London, United Kingdom CIRCLE Full time

    Circle is a financial technology company at the epicenter of the emerging internet of money, where value can finally travel like other digital data — globally, nearly instantly and less expensively than legacy settlement systems. This ground-breaking new internet layer opens up previously unimaginable possibilities for payments, commerce and markets that...


  • London Area, United Kingdom Acquire Me Full time

    Site Reliability Engineer - Developer Tooling Our client is a renowned global market making firm. They're hiring for a SRE with strong full-stack SWE skills with a background working on complex high availability infrastructure. You'll join a small group of high calibre SWEs building custom tooling from the ground up through to production, with a core focus...


  • London, United Kingdom TEKsystems Full time

    Site Reliability Engineer / SRE Description: My global client is looking for a Site Reliability Engineer / SRE to join their growing team who must have strong experience working within the financial services industry on large complex projects. To be successful in this Site Reliability / SRE project you will need expert experience within: AWS ...


  • London Area, United Kingdom Salt Full time

    Site Reliability Engineer – Hybrid – London Day rate: £500 - £700 (inside IR35)Duration: 6 – 12 months Start: ASAP My new client is looking for a Site Reliability Engineer to join the team on a contract basis. You must be currently working as an SRE for a few years. This is a hybrid role so 2 days per week in the London office. Must have...


  • London Area, United Kingdom Salt Full time

    Site Reliability Engineer – Hybrid – London Day rate: £500 - £700 (inside IR35)Duration: 6 – 12 months Start: ASAP My new client is looking for a Site Reliability Engineer to join the team on a contract basis. You must be currently working as an SRE for a few years. This is a hybrid role so 2 days per week in the London office. Must have...


  • London Area, United Kingdom Salt Full time

    Site Reliability Engineer – Hybrid – London Day rate: £500 - £700 (inside IR35) Duration: 6 – 12 months Start: ASAP My new client is looking for a Site Reliability Engineer to join the team on a contract basis. You must be currently working as an SRE for a few years. This is a hybrid role so 2 days per week in the London office. Must have...


  • London Area, United Kingdom Salt Full time

    Site Reliability Engineer – Hybrid – London Day rate: £500 - £700 (inside IR35)Duration: 6 – 12 months Start: ASAP My new client is looking for a Site Reliability Engineer to join the team on a contract basis. You must be currently working as an SRE for a few years. This is a hybrid role so 2 days per week in the London office. Must have...


  • London, United Kingdom Mondrian Alpha Full time

    Our client, a leading multi strat fund is seeking a senior Site Reliability Engineer who'll bring expertise in windows storage and python development to a well established, very successful SRE / infrastructure team. We're looking for expert level python coding along with windows storage, kubernetes and experience with distributed data platforms.


  • London, United Kingdom ESL FACEIT Group Full time

    At EFG (ESL FACEIT Group) we create worlds beyond gameplay, where players and fans become a community. We pride ourselves in having a corporate social responsibility which is that “IT’S NOT GG, UNTIL IT’S GG FOR ALL”.Our passion, craft, and DNA are aligned to create and shape the world of esports, gaming tournaments, leagues, events, and holistic...


  • London Area, United Kingdom Mondrian Alpha Full time

    A world leading multi strat, systematic fund are seeking an automation heavy (python / powershell) infrastructure site reliability engineer who primarily has experience in windows environments and a specialism in storage. You'd be joining an SRE team that underpins the entirety of the funds systems meaning you'll have direct impact on the success of the...


  • London Area, United Kingdom Mondrian Alpha Full time

    A world leading multi strat, systematic fund are seeking an automation heavy (python / powershell) infrastructure site reliability engineer who primarily has experience in windows environments and a specialism in storage. You'd be joining an SRE team that underpins the entirety of the funds systems meaning you'll have direct impact on the success of the...


  • London Area, United Kingdom Mondrian Alpha Full time

    A world leading multi strat, systematic fund are seeking an automation heavy (python / powershell) infrastructure site reliability engineer who primarily has experience in windows environments and a specialism in storage. You'd be joining an SRE team that underpins the entirety of the funds systems meaning you'll have direct impact on the success of the...


  • London Area, United Kingdom Mondrian Alpha Full time

    A world leading multi strat, systematic fund are seeking an automation heavy (python / powershell) infrastructure site reliability engineer who primarily has experience in windows environments and a specialism in storage. You'd be joining an SRE team that underpins the entirety of the funds systems meaning you'll have direct impact on the success of the...


  • London, United Kingdom ESL Faceit Group Full time

    This job is brought to you by Jobs/Redefined, the UK's leading over-50s age inclusive jobs board. At EFG (ESL FACEIT Group) we create worlds beyond gameplay, where players and fans become a community. We pride ourselves in having a corporate social responsibility which is that "IT'S NOT GG, UNTIL IT'S GG FOR ALL". Our passion, craft, and DNA...


  • London Area, United Kingdom Acquire Me Full time

    Site Reliability Engineer - Developer ToolingOur client is a renowned global market making firm. They're hiring for a SRE with strong full-stack SWE skills with a background working on complex high availability infrastructure. You'll join a small group of high calibre SWEs building custom tooling from the ground up through to production, with a core focus on...