Consulting/Principal Site Reliability Engineer

2 weeks ago


London, Greater London, United Kingdom LexisNexis Risk Solutions Full time
Technology
Consulting/Principal Site Reliability Engineer
  • Location: HOME BASED, London, United Kingdom
  • Contract Type: Regular
  • Schedule: 40
  • Job ID: R78731

Principal/Consultant Site Reliability Engineer

Location: The position is open to applicants based in the region of Cardiff, United Kingdom, or looking for Home-Based/Fully-Remote based in the United Kingdom.

LexisNexis Risk Solutions is seeking a Principal/Consultant Site Reliability Engineer with proven industry experience to join our global engineering team.

Our teams are collaborative and forward-thinking; the successful candidate will help shape the operations and support for critical applications, customers and projects, working closely with Development, QA, IT Operations and Customer Operations teams. You will be required to communicate and solve problems effectively, whilst handling a fast-paced working environment.

This will be combined with a transition to DevOps practices, agile support and deployment processes. You will be a leading member of our team working with a diverse range of technologies. You will enjoy working in a friendly environment and benefit from our investment in staff. The role also requires On-Call rotation for off peak hours to maintain 24/7, 365 system availability

Role Definition
This is an advanced professional level role for an SRE. Individuals may be responsible for one or more complex reliability and toil reduction projects. At this level, SREs operate as a subject matter expert in the discipline and will provide guidance to others including product and development teams to define and improve reliability within a product group. Principle SRE requires a deep understanding of system and application code, and will make data-driven recommendations which balance customer, development and operational needs. They are champions for shared services, platforms and architectural standards. Individuals in this role train and/or mentor junior staff.

Scope and Key Responsibilities

  • Recommends service level objectives in partnership Product and Dev teams
  • Master in observability tools and techniques
  • Acts as an escalation during incidents
  • Collaborates with dev to troubleshoot systems and app performance issues
  • Improves the SRE framework
  • Champions shared services and platforms to drive reliability
  • Can create disaster recovery plans including advanced fault injection
  • Advises on SRE training curriculum and content
  • Delivery of resilient application stacks via "Infrastructure as Code" and other DevOps practices
  • Monitoring and on-going support of critical, high revenue business applications
  • Diagnosis and resolution of complex system and application issues
  • Working with diverse technical and non-technical teams, including Development, QA, IT Operations, Customer Operations and Project Management teams
  • Write and maintain systems / application documentation for technical and non-technical
  • Migration of existing applications to Cloud environments

Essential Skills and Attributes

  • Professional experience of working within the public cloud – AWS, Azure
  • Use of orchestration tools such as Terraform, CloudFormation
  • Continuous Integration/Delivery Tools such as - Gitlab, Github, Jenkins
  • Coding and scripting experience such as - PowerShell, Bash, Python or equivalent
  • Configuration management tools such as - Ansible, Puppet, Chef or equivalents
  • Hands-on experience of Windows and Linux servers, including support and troubleshooting.
  • Previous analytic and troubleshooting experience is required
  • Cloud architecture and system design to solve key business problems and facilitate team goals.
  • Experience migrating application from on-premises to public cloud.
  • Experience working with containerised workloads such as Docker and Kubernetes.
  • System and application monitoring such as - Prometheus, Grafana, CloudWatch
  • Familiarity with Log Management tools such as - Elastic Stack, Graylog or Splunk
  • Experience working with relational databases such as MySQL, MS SQL Server or similar
  • Use of Secret Management services such as - Hashicorp Vault
  • Knowledge of change control and associated procedures.
  • Hands-on experience performing application static/dynamic security and penetration assessment with tools such as – SonarQube, CheckMarx, AppScan, BurpSuite, OWASP ZAP Proxy, WebInspect, Fortify, Veracode, Nessus etc.
  • Familiarity with different types of security vulnerabilities and tools for countermeasure
  • Experience with any high-level programming language.

Technical Skills

  • Observability
    • Has a deep technical understanding of observability techniques across the full stack and can bring clarity to complex incidents or performance issues.
    • Able to create templated observability dashboards and configuration using code so that others can implement quickly for their products.
    • Can influence the setting of appropriate SLOs and Error Budgets.
  • Can prepare and assess other SREs for on-call readiness. Can act as a mentor and identifies what training is required.
  • Can conduct a post-mortem so that participants feel safe to contribute. Encourages a culture of learning from failure and shares post-mortem learning with a wider audience.
  • Can act as a senior on-call escalation point for SREs and engineers and provide guidance on restoration.
  • Can work with SRE and Development leads in identifying and rectifying the cause of excessive alerting and pager load caused by bugs, alerting or human processes.
  • Design for Reliability:
    • Has an advanced understanding of systems design including high availability, software deployment and recovery techniques. Advanced understanding of failure modes and systems behaviours.
    • Specialises in one of more technical areas like denial-of-service protection or containerisation. Can provide consulting to others on some specialist topics.
    • Can make decisions on resilience and recoverability of a system by through load testing data and results from fault injection experiment.
    • Can document good SRE design practices and contributes to the SRE Framework.
  • Disaster Recovery.
    • Able to contribute to DR plans including setting recovery priorities, procedures and helping others to carry out specific runbook.
    • Able to create and lead DR practice scenarios and incident response procedures. Skilled in assessing gaps in knowledge and recommendations to improve.
    • Able to plan fault injection (chaos engineering) scenarios to simulate faults on components of system (memory, network, IO) or distributed system
  • Platforms and Automation:
    • Seeks engineering and architecture consensus for new standard components and services for inclusion into the Paved Road, including Platforms, CI/CD.
    • Promotes the adoption and standardisation, and broader contributions via inner sourcing. Has a good understanding of how it benefits the SDLC.
    • Champions the use of Paved Road and drives efficiency by removing rework and silos across technology.
  • Reliability Culture:
    • Creates and shares SRE good practices and training material within the team.
    • Can coach other SREs on specific practices. Able to identify gaps in knowledge and provide guidance for personal development.
    • Can make recommendations to eliminate larger toil projects.
  • Collaboration and Teamwork
  • Customer & External Focus
  • Solves Problems and Analyses Issues
  • Learning Agility
  • Builds Relationships
  • Develops Others

What is it like to work here?

Outstanding - you have probably already got a feel for what we do and the technology we are involved with but what is really stands us out from the crowd is our culture. We are an agile, dynamic, and forward-thinking organisation who understands the importance of looking after our staff. We pride ourselves on delivering high-quality products, providing our employees with interesting challenges for their personal and career development whilst also striking the right balance between work and family life.

Why Work for LexisNexis Risk Solutions (RSG)

Explore our passion for discovery.

Global companies and governmental entities rely on us to solve their most complex data challenges. Our employees collaborate to reduce risks and create opportunities for customers in more than 100 countries. We are adaptable, curious, and ambitious. That is why here, you will have the freedom to drive change, the trust to find your own path, and the space to explore more.

Women in technology:

LexisNexis Risk Solutions Group (RSG) is very supportive of women in Technology and has been a founding signature for the Tech Talent Charter.

Currently, 27% of our Technology workforce are women which is much higher than the UK average of 17%. We have the following initiatives in place to support women in technology:

  • Mentoring scheme for women in technology
  • Women's network forum
  • Regularly run events for schoolgirls about careers in technology to inspire the next generation of girls in tech.

About LexisNexis Risk Solutions Group

LexisNexis Risk Solutions Group is a portfolio of brands that span multiple industries providing customers with innovative technologies, information-based analytics and decision tools and data services that help businesses and governmental entities reduce risk and improve decisions to benefit people around the globe. Headquartered in metro Atlanta, Georgia, we have offices throughout the world and are part of RELX (LSE: REL/NYSE: RELX), a global provider of information and analytics for professional and business customers across industries.

At Lexis Nexis Risk Solutions Group having diverse employees with different perspectives is key to creating innovative new products for our global customers. We have 35 diversity employee networks globally and prioritise ensuring inclusive leadership is part of our culture. Our aim is for every employee to be the best version of themselves. We would actively welcome applications from candidates of diverse backgrounds and underrepresented groups.

We encourage applicants and employees to tell us about any health issues they may have to allow us to put in place reasonable adjustments that may support applicants in the application process and support employees to succeed in their role.

Please read our Candidate Privacy Policy

We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law. We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact EEO is the Law Supplement . Pay Transparency .

#J-18808-Ljbffr

  • London, Greater London, United Kingdom Plutus Full time

    BPP Education is entering a new phase of its growth and evolution, attracting thousands more students each year and expanding into new verticals and new markets globally. The BPP Product & Technology (P&T) organisation is evolving rapidly, and driving transformation of its platforms, digital products and experiences, in order to help BPP Education scale and...


  • London, Greater London, United Kingdom Kbeventservices Full time $10

    About The Role Nexperia aims to achieve a $10 billion turnover by 2030. This goal relies on next-generation technologies, sustainable manufacturing practices, modern facilities, and above all, investing in its workforce. We are currently seeking a Principal GaN Reliability Engineer to join our team.What our Principal GaN Reliability Engineers do- Work within...


  • London, Greater London, United Kingdom J Bandy Consulting Full time

    Our client, a next generation software provider is looking for a Site Reliability Engineer to join their incredible team of experienced, talented and enthusiastic multi-platform engineers. This role is for an site reliability engineer to work on a next generation cloud-agnostic, micro-service network management platform. Remote Working - Based in the UK...


  • London, Greater London, United Kingdom Workingmums Full time £55,450 - £85,000

    For nearly 70 years, AWE has been at the forefront of nuclear weapons research and development. We also use our unique skills to provide wider UK government with counter-terrorism and nuclear threat reduction solutions. The UK Atomic Weapons Establishment is embarking on a Replacement Nuclear Warhead programme, to ensure the UK continuous at sea deterrent. ...


  • London, Greater London, United Kingdom Prism Digital Full time

    Site Reliability Engineer | GCP OR AWS & Kubernetes | SaaS HealthTechThe local headcount currently is 35 in Ireland and 45 in the UK (remote sys admins, tech engineers, field engineers, project managers, programme managers and sales) and expanding the UK office - feels like a start-up with start-up good energy.Our client is around 50% through their GCP...


  • London, Greater London, United Kingdom McDonald's Limited Full time

    The Opportunity:The OpportunityAn exciting opportunity to work as part of the Service Operations Team, the Site Reliability Officer will be responsible for improving the value of IT to the business by reducing the occurrence of systematic issues within our services. These improvements could be technical, procedural or behavioural and will require working...

  • Principal Consultant

    2 weeks ago


    London, Greater London, United Kingdom Bramwith Consulting Full time

    ** Principal Procurement Consultant - Indirect Generalist ** Leading UK Consultancy for Care Sector - Principal Consultant (FM) - London x1 a week - £60k + travel expensed + 10% bonus and other perks Make a significant impact on the care sector with this leading UK consultancy My client is the UK's leading procurement specialist consultancy dedicated to...


  • London, Greater London, United Kingdom Kbeventservices Full time $10

    This role gives you the opportunity to be part of a dynamic team that is driving innovation and sustainability in GaN technology.Work in collaboration with the GaN team and report to the Principal Engineer to ensure the reliability and durability of our products.Location options in Manchester, Hamburg, or Munich.Analyze reliability test results to extract...


  • London, Greater London, United Kingdom Aldwych Consulting Full time £60,000

    Principal Civil Engineer - Exciting Opportunity in Southwark, London A premier civil engineering and transport planning consultancy headquartered in London is seeking a talented Principal Civil Engineer to join their dynamic team of 10 professionals in Southwark. Specializing in customized solutions for the rail, marine, infrastructure, and renewables...


  • London, Greater London, United Kingdom Spencer Ogden Full time

    My client is currently resourcing for a Principal HVDC Engineering Consultant. They are a global energy consultancy, powered by the expertise and experience of it's unique and diverse people. Within the Renewables Team, my client is at the forefront of major international projects where they bring a unique, integrated perspective, specialist expertise and...


  • London, Greater London, United Kingdom NonStop Consulting Full time

    Principal Geotechnical Engineer Award Winning Independent ConsultancyWe are seeking an accomplished Principal Geotechnical Engineer with a track record of excellence to join our esteemed client, an award winning independent consultancy. You'll play a pivotal role within their expanding Southeast-based team.Distinguished for their work with a diverse...


  • London, Greater London, United Kingdom Bramwith Consulting Full time

    Principal Procurement Consultant (indirects) Boutique Procurement & Supply Chain Consultancy London very flexi £60k + Travel Expensed + 10% bonus + Private Medical Operating as the UK's leading procurement & supply chain consultancy for the care sector for over 10 years, they have made a significant impact in the industry and are now looking to expand...


  • London, Greater London, United Kingdom Bramwith Consulting Full time

    Principal Procurement Consultant (indirects) London very flexi £60k + Travel Expensed + 10% bonus + Private Medical Operating as the UK's leading procurement & supply chain consultancy for the care sector for over 10 years, they have made a significant impact in the industry and are now looking to expand their team to accommodate for new and existing...


  • London, Greater London, United Kingdom NonStop Consulting Ltd Full time

    Hi all, we are currently recruiting for Digital Site Reliability Engineer to join Government Department on a contract for 6 months, fully remote work.Essentials skills:- experience with Terraform, CI, CD;- leading assessments;- programming;- eligibility for SC Clearance.Don't miss this


  • London, Greater London, United Kingdom Aldwych Consulting Full time £58,000 - £68,000

    We are looking for a talented Principal Civil Engineer! Are you a skilled Principal Civil Engineer interested in being part of impactful projects that you can take pride in? Do you fancy joining a company that puts a strong emphasis on top-notch engineering, values its employees, and prioritizes work-life balance? Let me tell you about our client - an...


  • London, Greater London, United Kingdom Anson McCade Full time

    Principal Data Engineer London - £90k - £100k + Package2 days per week in Office/ Client SiteUnfortuantely due to the nature of this role we cannot accept candidates who hold a VisaMy client is seeking a Principal Data Engineer to join their dynamic team. As a Principal Data Engineer, you'll work in multidisciplinary teams to create, support, and maintain...


  • London, Greater London, United Kingdom Conrad Consulting Ltd. Full time

    Principal Structural Design EngineerLondon£65k-£70k plus benefitsAre you an experienced structural design engineer looking to work at a principal level? Our client has an industry leading team and are looking to bring in a principal engineer who is hard working and driven. They specialise in interesting projects across a range of sectors including...

  • Reliability Engineer

    2 weeks ago


    London, Greater London, United Kingdom Lenzing Full time

    On a retained and exclusive basis, Hays Engineering is delighted to be partnering with Lenzing in Grimsby to appoint their Reliability Engineer. The newly Appointed Reliability Engineer will be joining an established Engineering team, solving complex engineering / process problems to maximise plant reliability, applying industry standard techniques in order...


  • London, Greater London, United Kingdom Bramwith Consulting Full time

    Job Details: Location:London Posting date:19 Mar 2023 Job type:Permanent Sector:Technology Salary£ £80000Title:Principal Consultant - Strategic ProcurementIndustries: IT and/or MarketingLocation: Home-based contractSalary: £70k-80K + Large BonusSector: Indirect ProcurementThe OpportunityAre you an experienced procurement professional with a track record...

  • Principal Engineer

    2 weeks ago


    London, Greater London, United Kingdom Eames Consulting Full time

    Java Principal EngineerIf you think you are the right match for the following opportunity, apply after reading the complete description.Hybrid - London£95-120,000 base with bonusEames consulting are delighted to be working with an established bank that are on the lookout for an experienced . You will be providing technical leadership and guidance to a team...