Cloud Site Reliability Engineer
7 days ago
About TikTok
TikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, with offices in New York, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo.
Why Join Us
Creation is the core of TikTok's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us.
Responsibilities
- Build, expand, and operate Bytedance's global infrastructures, including large-scale systems in public and private clouds, data centers, and content delivery networks.
- Build tools, automation, visualizations, and monitors to facilitate the operation and optimization of the global infrastructure.
- Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues.
- Help improve the whole lifecycle of infrastructure services from inception and design throughout development to deployment, user support, and refinement.
Qualifications
Minimum Qualifications
- Master's degree (or Bachelor's degree with 3+ years of experience) in Computer Engineering, Electrical Engineering, Computer Science, or related major.
- 3+ years of experience working with Unix/Linux systems from kernel to shell and beyond, with experience working with system libraries, file systems, and client-server protocols.
- 2+ years of experience working on Public Cloud Platforms, familiar with basic components of cloud products. Experience in building solutions with AWS, Google, OCI, or other cloud services.
- 2+ years experience in one or more programming languages such as Java, C++, Go, or scripting experience in Shell and Python.
- 2+ years experience with essential system-level apps, like DNS, APT, LDAP, Nginx, CI/CD, Ansible, Packer, etc.
Preferred Qualifications
- Experience in system and data security.
- Self-driven and capable of coping with ambiguity and moving projects from concept to delivery.
- Strong analytical skills and the ability to solve real-world problems in a fast-moving environment.
- Experience in designing, analyzing, and building automation and tools for large-scale systems.
- Strong communication and collaboration skills.
- The passion for solving problems
- Patient for supporting cases
TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform
#J-18808-Ljbffr-
Site Reliability Engineer
7 days ago
London, Greater London, United Kingdom Faptic Technology Full timeFaptic Technology is a leading provider of IT consulting and managed services, specializing in Azure cloud solutions, software development, and site reliability engineering (SRE). We partner with enterprises to optimize their IT operations, ensuring scalability, reliability, and innovation in the cloud.As part of our growth in managed cloud services, we are...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom Searchability Full timeSite Reliability Engineer (Contract) – eDV ClearedCheck you match the skill requirements for this role, as well as associated experience, then apply with your CV below.Location : London (100% On-Site)Duration : 6 months initiallyRate : Up to £700pdIR35 : InsideClearance Required : BPSS & Active eDVOn - Call : Required – includes evenings, weekends, and...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom Searchability NS&D Full timeReliability Engineer (Contract) – eDV ClearedLocation: London (100% On-Site)Duration: 6 months initiallyRate: Up to £700pdIR35: InsideClearance Required: BPSS & Active eDVOn-Call: Required – includes evenings, weekends, and bank holidaysThe OpportunityI am looking for a Site Reliability Engineer with active eDV clearance to join a highly secure,...
-
Site Reliability Engineer
1 week ago
London, Greater London, United Kingdom Apple Inc. Full timeSite Reliability Engineer (SRE) - iCloudLondon, England, United KingdomSoftware and ServicesThe Apple Service Engineering - iCloud SRE team is looking for Site Reliability Engineers to build and run the services that hundreds of millions of customers use every day. This team provides systems that are foundational for many of Apple's services such as iCloud,...
-
Site Reliability Engineer
4 weeks ago
London, Greater London, United Kingdom Bmbneon Full timeJoin to apply for theSite Reliability Engineerrole atBMB Neon .Like the look of this opportunity Make sure to apply fast, as a high volume of applications is expected Scroll down to read the complete job description.The SRE Teamis responsible for managing Neon's multi-region, multi-cloud deployment in close collaboration with the broader engineering team, as...
-
Site Reliability Engineer
7 days ago
London, Greater London, United Kingdom Prism Digital Full timeJob Description Site Reliability Engineer | Clickhouse, Kafka & Terraform | Outside IR35 | RemoteI'm working with a well established UK based consultancy who are consulting into an enterprise client to help them go through a cloud data migration. They're looking for an experienced SRE who has deep expertise with Clickhouse & Kafka.Length: 12...
-
Site Reliability Engineer
7 days ago
London, Greater London, United Kingdom The Rundown AI, Inc. Full timeA World-Changing Company Palantir builds the world's leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more. The Role We're looking for Site Reliability Engineers who...
-
Site Reliability Engineer
7 days ago
London, Greater London, United Kingdom Bmbneon Full timeJoin to apply for the Site Reliability Engineer role at BMB Neon.The SRE Team is responsible for managing Neon's multi-region, multi-cloud deployment in close collaboration with the broader engineering team, as well as improving the reliability of the overall platform. All the features we want to implement can only reach our customers if the changes are...
-
Site Reliability Engineer
3 days ago
London, Greater London, United Kingdom Toggle AI Full timeMinimum QualificationsBachelor's degree in Computer Science, related field, or equivalent practical experience.4 years of experience as a software engineer.Experience programming in one or more of the following languages: C, C++, Python, Go, Perl, or Ruby.Experience in Site Reliability Engineering, System Design, and Distributed Computing.Experience in...
-
Site Reliability Engineer
7 days ago
London, Greater London, United Kingdom Luupli Full timeLuupli is a social media app that has equity, diversity, and equality at its heart. We believe that social media can be a force for good, and we are committed to creating a platform that maximizes the value that creators and businesses can gain from it, while making a positive impact on society and the planet. Luupli started internal testing since June 2024...
-
London, Greater London, United Kingdom Cloud Bridge Full timeWe are partnering with a forward-thinking organisation looking for a highly skilled Lead Cloud Engineer to manage and enhance their cloud infrastructure. The successful candidate will work closely with cloud architects and engineering teams to design, maintain, and innovate within scalable cloud environments. This is a leadership position where you'll mentor...
-
Senior Site Reliability Engineer
7 days ago
London, Greater London, United Kingdom Stacklok Full timeStacklok is an innovative software supply chain security startup founded by Kubernetes co-founder, Craig McLuckie and Sigstore founder, Luke Hinds. Our mission is to make it easier to securely develop software. With our deep expertise in open source technologies and commitment to enhancing software security, we are seeking highly skilled and motivated...
-
Site Reliability Engineer
3 days ago
London, Greater London, United Kingdom TOYOTA Connected Full timeSite Reliability Engineer (SRE)Hybrid workingWho are we?Toyota Connected Europe wants to create a better world through connected mobility for all. We are a new company created to bring big data and a customer focus into all aspects of the mobility experience so everyone's experience is more personal, convenient, fun and safe. We create and enable...
-
Principal Site Reliability Engineer
3 days ago
London, Greater London, United Kingdom NielsenIQ Full timeCompany DescriptionDiscover growth opportunities in the Consumer Durables sector with NIQ. Our comprehensive data solutions and industry-leading insights empower businesses to master market measurement, understand consumer behavior, and drive innovation.Job DescriptionKey Responsibilities:Provide senior-level leadership and technical guidance to the Site...
-
Site Reliability Engineering
3 weeks ago
London, Greater London, United Kingdom Apple Inc. Full timeSite Reliability Engineering (SRE) Manager, iCloudPeople at Apple don't just build products — they craft experiences our customers love and depend on. Apple Services Engineering (ASE) builds and supports the systems that make many of these daily experiences possible. If you've used Apple products, you've likely interacted with us. iCloud Services SRE teams...
-
Site Reliability Engineer
5 days ago
London, Greater London, United Kingdom Apple Inc. Full timeSite Reliability Engineer (SRE) - PaymentsAre you passionate, curious, and do you have a desire to learn and explore? Can you communicate ideas clearly, thoughtfully and respectfully, to a diverse audience? Do you have a good grasp of computer science fundamentals, and a sound understanding of concurrent and asynchronous processing? The people here at Apple...
-
Site Reliability Engineer
7 days ago
London, Greater London, United Kingdom Apple Inc. Full timeShazam Site Reliability Engineers are not just responsible for making sure all services and systems that Shazam relies on are operating at their highest level; they're also responsible for helping development teams embrace these principles as they develop software. Shazam SREs embed themselves with development teams and act as extensions of those teams to...
-
Site Reliability Engineer
3 weeks ago
London, Greater London, United Kingdom Prism Digital Full timeSite Reliability Engineer | Clickhouse, Kafka & Terraform | Outside IR35 | RemoteI'm working with a well-established UK-based consultancy who are consulting into an enterprise client to help them go through a cloud data migration. They're looking for an experienced SRE who has deep expertise with Clickhouse & Kafka.Length: 12 monthsRate: Competitive -...
-
Site reliability engineer
7 days ago
London, Greater London, United Kingdom Quorso UK Limited Full timeThe roleAs a Site Reliability Engineer, you will focus on improving the stability and security aspects of the technical stack of Quorso by:Owning monitoring and logging integrations, as well as alerting capabilities by improving and automating currently manual processes.Identifying and logging discovered performance and security-related issues.Working on...
-
Site Reliability Engineer
7 days ago
London, Greater London, United Kingdom JR United Kingdom Full timeSocial network you want to login/join with:Site Reliability Engineer (Observability)London- Hybrid/ 3 DaysContract Inside IR35- 6 Months initiallyWe're looking for a Site Reliability Engineer (SRE) to join our client to build and maintain observability systems and to ensure their core services remain reliable, scalable, and...