Senior Site Reliability Engineer
2 weeks ago
At EFG (ESL FACEIT Group) we create worlds beyond gameplay, where players and fans become a community. We pride ourselves in having a corporate social responsibility which is that “IT’S NOT GG, UNTIL IT’S GG FOR ALL”.
Our passion, craft, and DNA are aligned to create and shape the world of esports, gaming tournaments, leagues, events, and holistic ecosystems through our millions of players, fans, and heroes, as well as through our people, and culture.
About FACEIT
With more than 25m users playing 30m matches every month FACEIT is the leading competitive gaming platform. We provide gamers the best experience possible by making sure we are always on top of our tech – and continue to deliver industry-leading features to our already awesome platform.
The Team:
As a Senior Site Reliability Engineer at EFG, you will be designing, analyzing, and troubleshooting large-scale distributed systems. You will demonstrate a systematic problem-solving approach, and the ability to debug and optimize code and to automate routine tasks. You will ensure that EFG’s services and systems are reliable, that they have uptime appropriate to users' needs and they have a fast rate of improvement.
Apart from monitoring our systems' capacity and performance, you will also focus on optimizing existing systems, on building infrastructure and on eliminating work through automation. You will work collaboratively with the software engineering teams to deploy and operate our systems, and you will help to automate and streamline our operations and processes. Within this role, you will be given real responsibilities, and you have the opportunity to drive change and have a big impact on our products and platform.
- Maintaining and improving the monitoring and observability tools (Grafana/Prometheus/Thanos/Jaeger);
- Working closely with your team and with other cross-functional teams to help design, maintain and operate systems at scale;
- Developing and driving the adoption of SRE best practices across the company;
- Leading on incident management process and adoption;
- Using your troubleshooting skills to help identify and fix operational issues;
- Working with Cloud Native technologies such as Kubernetes, Envoy, Istio, Prometheus and Helm;
- Working with the “Hashi Stack” (terraform, packer, vault);
- Experimenting with and introducing cutting-edge technologies.
Requirements
- Proven experience as an Senior Site Reliability Engineer or Software Engineer, focusing on building and maintaining scalable infrastructures;
- Excellent working knowledge on at least one of the major cloud providers (GCP/AWS/Azure);
- You have experience with cluster management systems (Kubernetes);
- Knowledge of incident management: ability to investigate, troubleshoot, recover and prevent the recurrence of incidents that interfere with the normal delivery of IT services;
- Proficient in Go language and proficiency in at least another language: Java, Python, Rust…;
- You have knowledge of GitOps practices;
- You have production scale experience with one of the following; MongoDB, Redis, MySQL;
- Experience contributing to open source technologies would be an added bonus.
-
Senior Site Reliability Engineer
1 month ago
London, United Kingdom eFinancialCareers Full timeJoin us as a Senior Site Reliability Engineer - We'll look to you to establish and run a SRE function to help design, build, deliver and run highly reliable, scalable and secure software systems - This is a great opportunity to hone your existing engineering skills and advance your career in this critical role **What you'll do** As a Senior Site Reliability...
-
Senior Site Reliability Engineer
1 month ago
London, United Kingdom CIRCLE Full timeCircle is a financial technology company at the epicenter of the emerging internet of money, where value can finally travel like other digital data — globally, nearly instantly and less expensively than legacy settlement systems. This ground-breaking new internet layer opens up previously unimaginable possibilities for payments, commerce and markets that...
-
Site Reliability Engineer
2 weeks ago
London, United Kingdom TEKsystems Full timeSite Reliability Engineer / SRE Description: My global client is looking for a Site Reliability Engineer / SRE to join their growing team who must have strong experience working within the financial services industry on large complex projects. To be successful in this Site Reliability / SRE project you will need expert experience within: AWS ...
-
Senior Site Reliability Engineer Python
4 weeks ago
London, United Kingdom Mondrian Alpha Full timeOur client, a leading multi strat fund is seeking a senior Site Reliability Engineer who'll bring expertise in windows storage and python development to a well established, very successful SRE / infrastructure team. We're looking for expert level python coding along with windows storage, kubernetes and experience with distributed data platforms.
-
Senior Site Reliability Engineer
7 days ago
London, United Kingdom ESL Faceit Group Full timeThis job is brought to you by Jobs/Redefined, the UK's leading over-50s age inclusive jobs board. At EFG (ESL FACEIT Group) we create worlds beyond gameplay, where players and fans become a community. We pride ourselves in having a corporate social responsibility which is that "IT'S NOT GG, UNTIL IT'S GG FOR ALL". Our passion, craft, and DNA...
-
Site Reliability Engineer
2 weeks ago
London, United Kingdom Understanding Recruitment Full timeSite Reliability Engineer Check all associated application documentation thoroughly before clicking on the apply button at the bottom of this description.I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability,...
-
Site Reliability Engineer
3 weeks ago
London, United Kingdom Understanding Recruitment Full timeSite Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and...
-
Site Reliability Engineer
2 days ago
London, United Kingdom Understanding Recruitment Full timeSite Reliability Engineer Check all associated application documentation thoroughly before clicking on the apply button at the bottom of this description.I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability,...
-
Site Reliability Engineer
3 weeks ago
London, United Kingdom Understanding Recruitment Full timeSite Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...
-
Site Reliability Engineer
1 day ago
London, United Kingdom Understanding Recruitment Full timeJob Description Site Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and...
-
Site Reliability Engineer
3 weeks ago
London, United Kingdom Understanding Recruitment Full timeSite Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and...
-
Site Reliability Engineer
1 week ago
London, United Kingdom Understanding Recruitment Full timeJob DescriptionSite Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance,...
-
Site Reliability Engineer
4 weeks ago
London, United Kingdom Experian Full timeJob Description Work that matters – what you’ll be doing We’re looking for a Site Reliability Engineer to join our Experian Data Quality team where you will be working on cutting edge products within our Aperture suite (Data Studio and Data Governance). This role has aspects of both reliability engineering (SRE) and test engineering (SDET)....
-
Site Reliability Engineer
4 weeks ago
London, United Kingdom N Consulting Ltd Full timeJob title: Site Reliability EngineerWork Mode: 3 days office MandatoryLocation: 5 Broadgate, London EC2M 2QS, United KingdomContract Duration: 12 monthsWe’re looking for a Site Reliability Engineer to:· determine the reliability of our digital products, technology services, and the infrastructure that underpins them· minimize the risk and impact of...
-
Site Reliability Engineer
1 month ago
London, United Kingdom McGregor Boyall Full time**Permanent role** **£70k - £120k per annum (+ package)** **SPONSORSHIP - AVAILABLE** **Location - Central London (hybrid working model)** **The Company** A Fortune 500 company based in Central London. **The Role** As a**Site Reliability Engineer**you will collaborate with product development teams. You will be instrumental providing engineering...
-
Windows Site Reliability Engineer
1 month ago
London, United Kingdom Mondrian Alpha Full timeJob Description: Our client, a leading multi strat fund is seeking a senior Site Reliability Engineer who'll bring expertise in windows storage and python development to a well established, very successful SRE / infrastructure team. We're looking for expert level python coding along with windows storage, kubernetes and experience with distributed...
-
Site Reliability Engineer
3 weeks ago
London Area, United Kingdom Understanding Recruitment Full timeSite Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...
-
Site Reliability Engineer
3 weeks ago
London Area, United Kingdom Understanding Recruitment Full timeSite Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...
-
Site Reliability Engineer
3 weeks ago
London Area, United Kingdom Understanding Recruitment Full timeSite Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users.The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently...
-
Site Reliability Engineer
3 weeks ago
London Area, United Kingdom Understanding Recruitment Full timeSite Reliability Engineer I am seeking a Site Reliability Engineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and...