Senior Software Engineer, Site Reliability Engineering, Cloud

2 weeks ago


London, Greater London, United Kingdom Google Full time

This job is brought to you by Jobs/Redefined, the UK's leading over-50s age inclusive jobs board.

Minimum qualifications:

  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience.
  • Candidates will typically have 5 years of experience with software development in one or more programming languages.
  • Typically 5 years of experience with data structures or algorithms.
  • Typically 3 years of experience in designing, analyzing, and troubleshooting large-scale distributed systems, and 2 years of experience leading projects and providing technical leadership.

Preferred qualifications:

  • Experience working in computing, distributed systems, storage, or networking.
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Ability to debug, optimize code, and to automate routine tasks.
  • Systematic problem-solving approach, coupled with effective verbal and written communication skills.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our externally-visible systems-have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE's will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you'll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.

Responsibilities

  • Engage in and improve the whole lifecycle of services-from inception and design, through to deployment, operation and refinement.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
  • Practice sustainable incident response and blameless postmortems.
#J-18808-Ljbffr

  • London, Greater London, United Kingdom HOVER SENIOR LIVING COMMUNITY Full time

    Senior Site Reliability Engineer- Remote ClickHouse Published 10 Apr 2024 Share this job UK Remote Role Highlights GO SQL Data Governance Computer Science Distributed Systems SRE Site Reliability Security Operations Automation Database Tools, Libraries and Frameworks GCP ClickHouse AWS Docker Terraform Cisco Ansible Description As...


  • London, Greater London, United Kingdom Cloud Software Group Full time

    You understand software development principles and apply those to craft code that's easy to understand, modify, and test. You also understand quality, resiliency and supportability. If you're a self-motivated developer who enjoys taking ownership and making a tangible impact, we want to hear from you Our team: The StoreFront Services team, based in...


  • London, Greater London, United Kingdom Google Inc. Full time

    Senior Software Engineer, Site Reliability Engineering, Google Cloud corporate_fare Google place London, UK Apply Bachelor's degree in Computer Science, a related field, or equivalent practical experience. Candidates will typically have 5 years of experience with software development in one or more programming languages. Typically 5 years of...


  • London, Greater London, United Kingdom Loftware Full time

    A career at Loftware is more than just a job – it's an opportunity to help shape the supply chain of the future. Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations Site Reliability Engineer with a strong cloud-based Linux and Windows knowledge. The Cloud...


  • London, Greater London, United Kingdom THINKalpha Full time

    Senior Site Reliability Engineer at ThinkAlpha ThinkAlpha is in search of a talented Senior Site Reliability Engineer to join our core infrastructure team. This role involves supporting our data analytics platform and transactional trading engine. Focus on scalability and reliability Help in building and shaping the infrastructure at ThinkAlpha Assist...


  • London, Greater London, United Kingdom Infogain Full time

    Job Title: Site Reliability Engineer Location: London We are seeking a Site Reliability Engineer to join our team. Minimum 5 years of experience as Developer/SysAdmin/DevOps engineer Experience with several open-source tools (Ansible, Jenkins ,Git, etc) Strong expertise in Jenkins Experience with Programming languages such as Java, C# and Scripting...


  • London, Greater London, United Kingdom ESL FACEIT Group Full time

    At EFG (ESL FACEIT Group) we create worlds beyond gameplay, where players and fans become a community. We pride ourselves in having a corporate social responsibility which is that "IT'S NOT GG, UNTIL IT'S GG FOR ALL".Our passion, craft, and DNA are aligned to create and shape the world of esports, gaming tournaments, leagues, events, and holistic ecosystems...


  • London, Greater London, United Kingdom MMC Corporate Full time

    Mercer IT Systems Engineering is seeking candidates for an experienced, Site Reliability Engineering Manager for AWS Cloud, based in our London office: We have ambitious and exciting plans to expand further into AWS,Here, you will have the opportunity to share your depth of technical AWS expertise with our great global SRE Cloud Engineering team plus wider...


  • London, Greater London, United Kingdom MMC Corporate Full time

    Mercer IT Systems Engineering is seeking candidates for an experienced, Site Reliability Engineering Manager for AWS Cloud, based in our London office: We have ambitious and exciting plans to expand further into AWS,Here, you will have the opportunity to share your depth of technical AWS expertise with our great global SRE Cloud Engineering team plus wider...


  • London, Greater London, United Kingdom Kaluza Full time

    Location: Bristol, London, Edinburgh, (Including Hybrid) Kaluza wants to power a world where net-zero is within everyone's reach by building a platform that will accelerate a sustainable, affordable and resilient energy transition. Since launching in 2019, Kaluza's technology has empowered some of the biggest energy suppliers to better serve millions of...


  • London, Greater London, United Kingdom CIRCLE Full time

    Circle is a financial technology company at the epicenter of the emerging internet of money, where value can finally travel like other digital data — globally, nearly instantly and less expensively than legacy settlement systems. This ground-breaking new internet layer opens up previously unimaginable possibilities for payments, commerce and markets that...


  • London, Greater London, United Kingdom ESL Faceit Group Full time

    This job is brought to you by Jobs/Redefined, the UK's leading over-50s age inclusive jobs board. At EFG (ESL FACEIT Group) we create worlds beyond gameplay, where players and fans become a community. We pride ourselves in having a corporate social responsibility which is that "IT'S NOT GG, UNTIL IT'S GG FOR ALL". Our passion, craft, and DNA are aligned...


  • London, Greater London, United Kingdom Society of Research Software Engineering Full time £55,920

    The IT & Technical Services department's Operations team is seeking a Senior Site Reliability Engineer to support the growing portfolio of services it provides to EMBl-EBI's service and research teams. The Operations team is responsible for maintaining and developing the Institute'sTransfer Services, the application and monitoring systems for our software...


  • London, Greater London, United Kingdom Palantir Technologies Full time

    Palantir builds the world's leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more. We're looking for Site Reliability Engineers who can help us build, operate, and...


  • London, Greater London, United Kingdom Barclay Simpson Full time £90,000

    Senior Site Reliability Engineer (SRE)| AWS | Central London | Up to £95k + 12% bonus + generous stock| Travel The world's largest travel company are looking for a Senior Site Reliability Engineer to join their team. In terms of scale, you will make a real impact in the growth and development of our clients well known application / website which operates...

  • Cloud Engineer

    2 weeks ago


    London, Greater London, United Kingdom CENTRIC SOFTWARE Full time

    This role is 100% Remote open to EU or UK-based applicants only. We are looking for a Cloud Architect to support the strategic directions for cloud infrastructure, drive operational delivery and help raise the service quality for 600+ global customers in Luxury Apparel, Outdoor Sports, Footwear, Cosmetics, Food and Beverage segments, with its industry...


  • London, Greater London, United Kingdom ByteHire Full time

    Reference: BH-298cJob Role: Senior Site Reliability EngineerJob Type: ContractIR35: Inside IR35Day Rate: £600/DayContract Duration: 6 monthsWorking Hours: 5 days per weekRemote Working: 4 days remote working. 1 day on-site in LondonLocation: Hybrid Remote/London (UK only)Role Overview:We're looking for a Senior Site Reliability Engineer with deep Google...


  • London, Greater London, United Kingdom NP Group Full time

    Site Reliability Engineer – Google CloudLondonExcellent Salary & Package including BonusKey Skills – SRE, GCP (Enterprise Deployments), HELM, Python/Golang/Java, IAC/Automation, Blockchain Technologies, Node Infrastructure, Security HardeningOverviewAn important member of a skilled engineering team creating cloud native infrastructure and development...


  • London, Greater London, United Kingdom JAM Software GmbH Full time

    Title: Senior Engineer - Client Lead Portfolios Job Type: Contract The company has been helping their clients build better financial futures for over 50 years. join our Technology and Enterprise Services team and feel like you're part of something bigger. About Technology Technology and Enterprise Services refers to the running of the Technology, Cyber,...


  • London, Greater London, United Kingdom Freshtechit Full time

    Senior Site Reliability Engineer ( Global organisation) Hybrid workingWe are working with a leading global organisation based in the heart of London. We are looking for an experienced hands-on Senior SRE to join the talented Infrastructure team – helping to maintain and develop the platform that powers all our online products.You will work closely with the...