Senior SRE
3 days ago
Who we are:
At Focused, we move quickly to deliver quality software that achieves client outcomes and meets their customer's needs. We strategically partner with our clients to leverage our expertise in design and software, while our clients bring their own domain expertise. We work with a variety of clients from different industries, collaborating as we get new products to market, modernizing legacy systems, or helping teams learn the skills they need to be successful.
Our values:
- Listen first
• We are experts in product practices but life long learners in the domain of our customers. We research, collaborate, and understand. - Learn why
• We ask questions and talk to users to understand problem spaces, objectives, and goals, which allows us to deeply invest and drive towards the outcomes of our clients. - Love your craft
• We love diving into a variety of domains and solving problems. We take pride in delivering value, in communicating progress, and guiding our clients to success.
We are seeking an experienced Senior Observability Consultant with deep expertise in OpenTelemetry and strong Platform Engineering capabilities to help organizations implement, optimize, and scale their observability infrastructure. This role requires a seasoned consultant who can design comprehensive telemetry strategies, implement distributed tracing solutions, establish robust monitoring practices, and interface closely with clients on the observability journey.
Key Responsibilities:
OpenTelemetry & Observability
- Design and implement end-to-end OpenTelemetry solutions across diverse technology stacks
- Configure and deploy OpenTelemetry Collectors for efficient data collection, processing, sampling, and routing
- Establish telemetry pipelines for metrics, traces, and logs across microservices architectures
- Optimize collector configurations for performance, reliability, and cost-effectiveness
Platform Engineering & Infrastructure
- Augment existing infrastructure with with integrated observability solutions
- Implement Infrastructure as Code (IaC) solutions using Terraform, Pulumi, CloudFormation, etc.
- Architect and manage Kubernetes clusters with comprehensive monitoring and logging
- Build CI/CD pipelines with embedded observability and automated testing
Site Reliability Engineering (SRE)
- Establish and maintain Service Level Indicators (SLIs), Objectives (SLOs), and Agreements (SLAs)
- Implement error budgets, toil reduction strategies, and capacity planning
- Support incident response procedures and post-mortem processes
Cloud & DevOps Engineering
- Deploy and manage observability infrastructure across AWS, GCP, and Azure
- Establish security, compliance, and governance frameworks for telemetry data
- Experience automating Agent Evaluations in CI/CD pipelines and observability backends.
Required Qualifications:
Core Observability & OpenTelemetry
- 3-7 years of experience in observability, monitoring, and distributed systems
- Deep hands-on experience with OpenTelemetry ecosystem, including SDKs, APIs, and specifications
- Proficiency with OpenTelemetry Collector configuration, processors, exporters, and receivers
- Strong understanding of telemetry data models, semantic conventions, and instrumentation best practices
Platform Engineering & DevOps
- 5+ years of Platform Engineering or DevOps experience with focus on site reliability, observability, and incident response
- Proficiency with Infrastructure as Code tools (Terraform, Pulumi, CloudFormation, CDK)
- Strong experience with CI/CD platforms (GitHub Actions, GitLab CI, Jenkins, ArgoCD)
Cloud & Infrastructure
- Hands-on experience with major cloud providers (AWS, GCP, Azure) and their observability services
- Experience with container technologies (Docker, Podman) and container registries
- Knowledge of networking, security, load balancing, and distributed systems concepts
Site Reliability Engineering
- Experience implementing SRE practices including error budgets and toil metrics
- Proficiency in incident management, on-call procedures, and post-mortem culture
- Experience with capacity planning, performance optimization, and scalability design
Programming & Automation
- Proficiency in multiple programming languages preferred (Go, Python, Java, , Rust)
- Strong scripting and automation skills (Bash, Python, PowerShell)
- Understanding of software engineering best practices and testing methodologies
Preferred Qualifications (Exceptional Candidates)
AI & Agentic Frameworks
- Understanding of Large Language Models (LLMs) and their application in DevOps
- Knowledge of vector databases, embeddings, and retrieval-augmented generation (RAG)
- Experience with AI/ML model deployment and monitoring in production environments
Leadership & Communication
- Strong technical writing and documentation skills
- Ability to present complex technical concepts to diverse stakeholders
- A passion for knowledge sharing
Key Competencies
- Systems thinking and ability to design holistic observability solutions
- Strong analytical and troubleshooting skills for complex distributed systems
- Curiosity about emerging technologies, particularly AI applications in operations
- Adaptability to rapidly evolving cloud-native and observability technologies
- Collaborative mindset with focus on enabling developer productivity and system reliability
What Sets Exceptional Candidates Apart:
- Experience with Honeycomb
- Contributions to open-source observability or AI framework projects
- Track record of implementing platform engineering solutions that significantly improved developer experience
- Experience scaling observability infrastructure to handle high event volume
What to know before you apply:
- You will be expected to work on client sites in London or Luton for up to four days each week.
- The London base salary range for this role is £75,000 - £105,000 GBP.
-
Senior PostgreSQL SRE
1 week ago
London, Greater London, United Kingdom Barclays Full time £80,000 - £120,000 per yearJob DescriptionPurpose of the roleTo apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. AccountabilitiesAvailability, performance, and scalability of systems and services through proactive monitoring,...
-
Senior Engineering Manager, SRE
12 hours ago
London, Greater London, United Kingdom Pure Storage Full time £80,000 - £120,000 per yearWe're in an unbelievably exciting area of tech and are fundamentally reshaping the data storage industry. Here, you lead with innovative thinking, grow along with us, and join the smartest team in the industry.This type of work—work that changes the world—is what the tech industry was founded on. So, if you're ready to seize the endless opportunities and...
-
Senior SRE
1 week ago
London, Greater London, United Kingdom Focused Labs Full time £75,000 - £105,000 per yearWho we are:At Focused, we move quickly to deliver quality software that achieves client outcomes and meets their customer's needs. We strategically partner with our clients to leverage our expertise in design and software, while our clients bring their own domain expertise. We work with a variety of clients from different industries, collaborating as we get...
-
SRE DevOps Engineer
5 days ago
London, Greater London, United Kingdom Lloyds Banking Group Full timeJob Title:SRE DevOps EngineerLocation:LondonSalary: £81,999 - £91,110Hours:Full timeWorking Pattern: Hybrid, 40% (or two days) in office a week.About us…Like the modern Britain we serve, we're evolving. Investing billions in our people, data and tech to transform the way we meet the ever-changing needs of our 26 million customers. We're growing with...
-
SRE DevOps Engineer
2 days ago
London, Greater London, United Kingdom Lloyds Banking Group Full time £81,999 - £91,110End DateSunday 02 November 2025Salary Range£81,999 - £91,110We support flexible working – click here for more information on flexible working optionsFlexible Working OptionsHybrid Working, Job ShareJob Description Summary.Like the modern Britain we serve, we're evolving. Investing billions in our people, data and tech to transform the way we meet the...
-
London, Greater London, United Kingdom Jobs via eFinancialCareers Full time £60,000 - £75,000 per yearSenior DevOps Engineer (AWS Python Amazon Web Services DevOps SRE Kafka K-Streams Flink Kinesis Agile java GitHub Actions ArgoCD Site Reliability Platform Cloud Engineer Developer Kubernetes EKS Front Office Trading Finance Banking Asset Manager Investment Management) required by our trading software client in London.You MUST have the following:Very strong...
-
Senior Software Engineer/SRE
21 hours ago
London, Greater London, United Kingdom Bloomberg Full time £50,000 - £120,000 per yearLocationLondonBusiness AreaEngineering and CTORef # Description & RequirementsAre you passionate about building high-performance systems that are fast, resilient, and operate at global scale? Join Bloomberg's Application Middleware SRE team, where you'll combine software engineering and systems expertise to keep the backbone of the Bloomberg Terminal running...
-
Senior Software Engineer/SRE
5 days ago
London, Greater London, United Kingdom Jobs via eFinancialCareers Full time £60,000 - £120,000 per yearSenior Software Engineer/SRE - Managed Systems EngineeringLocationLondonBusiness AreaEngineering and CTORef # Description & RequirementsThe Bloomberg Terminal brings together real-time data on every market, breaking news, in-depth research, powerful analytics, communications tools and world-class execution capabilities - in one fully integrated solution. Key...
-
Senior Software Engineer, SRE
3 days ago
London, Greater London, United Kingdom Forter Full time £80,000 - £120,000 per yearAbout the roleWe're looking for a Senior Software Engineer with strong development skills and hands-on SRE experience to join our London-based team. You'll help shape the future of reliability and observability at Forter, ensuring we can build and run scalable, reliable and observable systems while collaborating with engineers across the globe. We maintain...
-
Senior Software Engineer, SRE
4 days ago
London, Greater London, United Kingdom Forter Full time £80,000 - £120,000 per yearAbout The RoleWe're looking for aSenior Software Engineerwith strong development skills and hands-onSREexperience to join our London-based team. You'll help shape the future of reliability and observability at Forter, ensuring we can build and run scalable, reliable and observable systems while collaborating with engineers across the globe. We maintain...