Senior Site Reliability Engineer

1 week ago


London, Greater London, United Kingdom Cisco Full time

About the Role

The Site Reliability Engineering team at Cisco is responsible for providing the tools, services, and infrastructure to monitor and observe the ThousandEyes platform. As a Senior Site Reliability Engineer, you will work together with the team to own our logging pipeline and monitoring stack while working with developers to continuously improve our view of the platform.

Key Responsibilities

  • Design and implement visibility into our platform as we grow to multi-region scale.
  • Design, deploy, and maintain cloud native monitoring services in AWS and GCP that are elastic and resilient to failure.
  • Provide standards and best practices for instrumentation of container based services and cloud managed services.
  • Maintain our alerting pipeline so that we are notified of the right things, at the right time, in the right places.
  • Drive automation wherever possible, enabling our monitoring platforms to scale effortlessly.
  • Participate in and contribute to improve our 24x7 incident response and on-call rotation.

Requirements

  • Strong Infrastructure as Code skills, ideally with Terraform and Kubernetes.
  • Strong knowledge of modern logging tool sets, including Logstash or Fluentd.
  • Understanding of Prometheus and its ecosystem, including Alertmanager.
  • Good knowledge of Application Performance Monitoring tools and crash reporting tools, such as Sentry.
  • Good knowledge of cloud provider managed services, and how they can be leveraged in our context.
  • Ability to write high quality code in Python, Go, or equivalent languages.

About Cisco

Cisco is a global technology leader that has been shaping the future of the internet for over 30 years. We are a company that values diversity, equity, and inclusion, and we are committed to creating a workplace where everyone can thrive. We believe that everyone has something to offer, and we are looking for talented individuals who share our passion for innovation and our commitment to making a positive impact.



  • London, Greater London, United Kingdom Robert Walters Full time

    Job DescriptionSENIOR SITE RELIABILITY ENGINEERSalary: £100,000 + 5% bonusLocation: London, hybrid working with 2 days per week in the officeWe are thrilled to present a remarkable opportunity for a Senior Site Reliability Engineer to join our team at Robert Walters as a Workforce Consultant. As an Employed Workforce Consultant, you will enjoy the benefits...


  • London, Greater London, United Kingdom loveholidays Full time

    {"About us": "At loveholidays, we're a rapidly growing online travel agency that's revolutionizing the way people plan their dream holidays. With a passion for technology and a commitment to innovation, we're constantly pushing the boundaries of what's possible. Our team of experts is dedicated to delivering exceptional customer experiences, and we're...


  • London, Greater London, United Kingdom loveholidays Full time

    {"About us": "At loveholidays, we're a rapidly growing online travel agency that's revolutionizing the way people plan their dream holidays. With a passion for technology and a commitment to innovation, we're constantly pushing the boundaries of what's possible. Our team of experts is dedicated to delivering exceptional customer experiences, and we're...


  • London, Greater London, United Kingdom RemoteStar Full time

    Remote Senior Site Reliability Engineer LeadRemoteStar is seeking a highly skilled Remote Senior Site Reliability Engineer Lead to join our client's team in the UK. This is a fully remote work opportunity.The client is a leading B2B diamond and gemstones marketplace, connecting jewellery retailers to gemstone suppliers.Job SummaryAs the SRE Lead, you will...


  • London, Greater London, United Kingdom Opus Recruitment Solutions Full time

    Site Reliability Engineer | Remote | Competitive SalaryCloud Computing | DevOps | Google Cloud Platform | Amazon Web Services | Kubernetes | Infrastructure | SRE | ELK StackWe are collaborating with a dynamic online retail company seeking to enhance their technical team by adding a Site Reliability Engineer. This role focuses on managing the reliability and...


  • London, Greater London, United Kingdom Canonical Full time

    Job SummaryThis role presents an exceptional opportunity for a seasoned technologist to drive innovation and excellence in cloud infrastructure and automation at Canonical. As a Senior Site Reliability Engineer, you will be responsible for designing and implementing cutting-edge automation solutions, collaborating with cross-functional teams, and ensuring...


  • London, Greater London, United Kingdom Canonical Full time

    Job SummaryThis role presents an exceptional opportunity for a seasoned technologist to drive innovation and excellence in cloud infrastructure and automation at Canonical. As a Senior Site Reliability Engineer, you will be responsible for designing and implementing cutting-edge automation solutions, collaborating with cross-functional teams, and ensuring...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    {"h1": "Site Reliability Engineer", "p": "At J Bandy Consulting, we're seeking a skilled Site Reliability Engineer to join our team of experienced engineers. As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability and performance of our cloud-agnostic, micro-service network management platform.Your primary responsibilities...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    {"h1": "Site Reliability Engineer", "p": "At J Bandy Consulting, we're seeking a skilled Site Reliability Engineer to join our team of experienced engineers. As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability and performance of our cloud-agnostic, micro-service network management platform.Your primary responsibilities...


  • London, Greater London, United Kingdom FactSet Full time

    Site Reliability EngineerAt FactSet, we're seeking a skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you'll play a critical role in ensuring the reliability and performance of our systems.ResponsibilitiesCollaborate with cross-functional teams to design, implement, and maintain highly available and scalable...


  • London, Greater London, United Kingdom FactSet Full time

    Site Reliability EngineerAt FactSet, we're seeking a skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you'll play a critical role in ensuring the reliability and performance of our systems.ResponsibilitiesCollaborate with cross-functional teams to design, implement, and maintain highly available and scalable...


  • London, Greater London, United Kingdom Apple Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will play a critical role in supporting and scaling cloud services for thousands of development and operations engineers.Key ResponsibilitiesCloud Service Maintenance: Automate deployment and orchestration of...


  • London, Greater London, United Kingdom Apple Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will play a critical role in supporting and scaling cloud services for thousands of development and operations engineers.Key ResponsibilitiesAutomate Deployment and Orchestration: Automate the deployment and...


  • London, Greater London, United Kingdom Apple Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will play a critical role in supporting and scaling cloud services for thousands of development and operations engineers.Key ResponsibilitiesAutomate Deployment and Orchestration: Automate the deployment and...


  • London, Greater London, United Kingdom Mondrian Alpha Full time

    About Mondrian AlphaMondrian Alpha is a renowned hedge fund with a global presence, seeking a seasoned Site Reliability Engineer to join their London team.Job SummaryWe are looking for a highly skilled Site Reliability Engineer to play a pivotal role in maintaining the technology infrastructure that drives our operations. As part of this team, you will be...


  • London, Greater London, United Kingdom Mondrian Alpha Full time

    About Mondrian AlphaMondrian Alpha is a renowned hedge fund with a global presence, seeking a seasoned Site Reliability Engineer to join their London team.Job SummaryWe are looking for a highly skilled Site Reliability Engineer to play a pivotal role in maintaining the technology infrastructure that drives our operations. As part of this team, you will be...


  • London, Greater London, United Kingdom Mondrian Alpha Full time

    About Mondrian AlphaMondrian Alpha is a renowned hedge fund with a global presence, seeking a seasoned Site Reliability Engineer to join their London team.Job SummaryWe are looking for a highly skilled Site Reliability Engineer to play a pivotal role in maintaining the technology infrastructure that drives our operations, directly contributing to our...


  • London, Greater London, United Kingdom Mondrian Alpha Full time

    About Mondrian AlphaMondrian Alpha is a renowned hedge fund with a global presence, seeking a seasoned Site Reliability Engineer to join their London team.Job SummaryWe are looking for a highly skilled Site Reliability Engineer to play a pivotal role in maintaining the technology infrastructure that drives our operations, directly contributing to our...


  • London, Greater London, United Kingdom Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our services.Key ResponsibilitiesDesign, implement, and maintain large-scale distributed systems and...


  • London, Greater London, United Kingdom Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our services.Key ResponsibilitiesDesign, implement, and maintain large-scale distributed systems and...