Site Reliability Engineer
1 month ago
We are seeking a highly skilled Site Reliability Engineer to join our team at ESL FACEIT Group. As a key member of our infrastructure team, you will be responsible for designing, analyzing, and troubleshooting large-scale distributed systems.
As a Site Reliability Engineer, you will work closely with our software engineering teams to deploy and operate our systems, ensuring they are reliable, scalable, and meet the needs of our users. You will also be responsible for maintaining and improving our monitoring and observability tools, as well as developing and driving adoption of SRE best practices across the company.
Key Responsibilities- Maintaining and improving the monitoring and observability tools (Grafana/Prometheus/Thanos/Jaeger);
- Working closely with your team and with other cross-functional teams to help design, maintain and operate systems at scale;
- Developing and driving adoption of SRE best practices across the company;
- Leading on incident management process and adoption;
- Using your troubleshooting skills to help identify and fix operational issues;
- Working with Cloud Native technologies such as Kubernetes, Envoy, Istio, Prometheus and Helm;
- Working with the "Hashi Stack" (terraform, packer, vault);
- Experimenting with and introducing cutting edge technologies.
- Proven experience as a Site Reliability Engineer, DevXP Engineer or Software Engineer, focusing on building and maintaining scalable infrastructures;
- Excellent working knowledge on at least one of the major cloud providers (GCP/AWS/Azure);
- You have experience with cluster management systems (Kubernetes);
- Knowledge of incident management: ability to investigate, troubleshoot, recover and prevent the recurrence of incidents that interfere with the normal delivery of IT services;
- Proficient in Go language and some level of proficiency in at least another language: Java, Python, Rust...;
- You have knowledge of GitOps practices;
- You have production scale experience with one of the following; MongoDB, Redis, MySQL;
- Experience contributing to open source technologies would be an added bonus.
-
Site Reliability Engineer
1 month ago
London, Greater London, United Kingdom LinuxRecruit Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at LinuxRecruit. As a key member of our engineering team, you will play a critical role in ensuring the reliability and scalability of our systems.Key ResponsibilitiesDesign and implement monitoring and automation solutions using Golang and other contemporary...
-
Site Reliability Engineer
3 weeks ago
London, Greater London, United Kingdom Fourier Full timeKey ResponsibilitiesAs a Site Reliability Engineer at Fourier, you will be responsible for designing and implementing tools to enhance the reliability and resilience of our production systems. This includes investigating failures, improving system performance, and automating manual processes.Required SkillsExcellent Python scripting skillsExperience with...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom Fourier Full timeKey ResponsibilitiesWe are seeking a skilled Site Reliability Engineer to join our team at Fourier. As a member of our Site Reliability Engineering team, you will be responsible for developing tools for surveillance and enhancement of our production systems.Key responsibilities include increasing system resilience, investigating failure, and improving...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom Curve Full timeAbout the RoleWe are seeking a skilled Site Reliability Engineer to join our team at Curve. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our infrastructure, identifying areas for improvement, and implementing solutions to optimize our systems.Key responsibilities include:Collaborating with...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom J Bandy Consulting Full timeJob SummaryThe Site Reliability Engineer will be responsible for ensuring the reliability, scalability, and performance of our systems. This role requires a strong understanding of SRE best practices, expertise in Git and GitOps, and experience with logging and monitoring solutions.Key ResponsibilitiesDevelop and maintain the Site Reliability Engineering...
-
Site Reliability Engineer
1 month ago
London, Greater London, United Kingdom https:www.energyjobline.comsitemap Full timeTransforming Industries through InnovationAt Apple, we don't just build products - we craft experiences that revolutionize entire industries. Our diverse team of innovators inspires groundbreaking solutions in everything we do. If you're passionate about designing, engineering, and running systems and infrastructure that impact millions, we want to hear from...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom J Bandy Consulting Full timeJob SummaryThe Site Reliability Engineer will be responsible for ensuring the reliability, scalability, and performance of our systems. This role requires a strong understanding of SRE best practices, expertise in Git and GitOps, and experience with logging and monitoring solutions.Key ResponsibilitiesDevelop and maintain the Site Reliability Engineering...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom J Bandy Consulting Full time**Job Summary**J Bandy Consulting is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our systems.**Key Responsibilities**Develop and maintain a culture of reliability and scalability across the team.Apply automation and...
-
Site Reliability Engineer
1 month ago
London, Greater London, United Kingdom eMFusion Global Full timeSite Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at eMFusion Global. This is a contract role offering the flexibility to work remotely or from our office in London.Key Responsibilities:Implement, maintain, and enhance monitoring solutions using Datadog, ensuring optimal performance and real-time...
-
Senior Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom JPMorganChase Full timeAbout the RoleWe're seeking a skilled Senior Site Reliability Engineer to join our team at JPMorgan Chase. As a key member of our Accelerators Engineering team, you will play a crucial role in ensuring the reliability and scalability of our products.As a Senior Site Reliability Engineer, you will be responsible for creating high-quality designs, roadmaps,...
-
Site Reliability Engineer
1 month ago
London, Greater London, United Kingdom Experian Full timeAbout the RoleWe're seeking a skilled Site Reliability Engineer to join our Experian Data Quality team in London, working on a hybrid schedule.As a key member of our QA team, you'll ensure the reliability, performance, and scalability of our market-leading data management products, focusing on observability to support incident resolution and drive ongoing...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom Apple Full timeJob SummaryAt Apple, we're looking for talented Site Reliability Engineers to join our team. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability and scalability of our services. You'll work closely with our development teams to design, build, and operate the systems and infrastructure that power our products and...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom Curve Full timeJob DescriptionAt Curve, we're on a mission to simplify your finances and help you live inspired. We're looking for a talented Site Reliability Engineer to join our team and help us scale our platforms to meet the needs of millions of customers.The ideal candidate will have a strong background in cloud infrastructure, with experience deploying...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom Curve Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Curve. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using Amazon Web Services and Google...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom STAND 8 Technology Services Full time $75 - $85About the RoleWe are seeking an experienced Site Reliability Engineer to support our systems focused on linear channel delivery and modernization efforts. The ideal candidate will be responsible for maintaining existing systems, working on infrastructure modernization, and supporting the streaming engineering team to ensure smooth operation of linear...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom Fourier Full timeKey ResponsibilitiesAs a Site Reliability Engineer at Fourier, you will be responsible for developing tools to enhance and monitor production systems, increasing system resilience, investigating failures, and improving reliability.You will also be responsible for automating manual processes and remediating incidents in real-time.RequirementsExcellent Python...
-
Site Reliability Engineer
1 month ago
London, Greater London, United Kingdom IO Associates Full timeJob OpportunityIO Associates is seeking a highly skilled Site Reliability Engineer to join their team for a short-term project within the Law Enforcement sector.Monitor system performance and security to ensure optimal functionality.Collaborate with the team to identify and resolve technical issues.This role offers a competitive daily rate of up to £500 per...
-
Site Reliability Engineer
3 weeks ago
London, Greater London, United Kingdom J Bandy Consulting Full timeJob SummaryJ Bandy Consulting is seeking an experienced Site Reliability Engineer to join our team. The ideal candidate will have a strong background in software engineering and a passion for building scalable and reliable systems.Key ResponsibilitiesDevelop and implement automation tools to improve the efficiency of our systemsCollaborate with...
-
Site Reliability Engineer
2 weeks ago
London, Greater London, United Kingdom Selby Jennings Full timeAbout Selby JenningsWe're a leading global financial services firm where technologists and investment professionals collaborate to drive innovation and operational excellence.About the RoleAs a Site Reliability Engineer, you'll apply your expertise in software and systems engineering to design, build, and maintain our robust infrastructure. You'll reduce...
-
Site Reliability Engineering Manager
1 month ago
London, Greater London, United Kingdom Apollo Solutions Full timeJob OverviewSite Reliability Engineering ManagerApollo Solutions is seeking a seasoned Site Reliability Engineering Manager to lead our team in ensuring the reliability and efficiency of our cloud-based services. As a key member of our Platform Engineering team, you will be responsible for driving the development and implementation of our cloud...