Director Site Reliability Engineer

5 days ago


Greater London, United Kingdom Encinos Kapital Full time

Director Site Reliability Engineer Encinos Kapital Responsibilities Lead engineers who design, implement, and provide operational support for technical infrastructure. Work within and across multi-geography teams to design, develop, test, implement, and support technical solutions across a full stack of development tools and technologies. Translate business requirements into technical designs, considering automation, availability, performance, scale, and cost. Ensure technical & security best practices along with Broadridge standards are adhered to in the design of technical infrastructure. Involved in technical design sessions and working closely with multiple teams, including application development teams, infrastructure teams, vendors, and clients, if needed, to review the infrastructure designs for new projects. Deliver high quality technical infrastructure, on-time, following Broadridge processes. Lead ongoing automation initiatives for the implementation and operational support of the infrastructure. Provide project planning and oversight, including costs and schedule estimates, scope and deployment scheduling considering resource and capacity constraints and business priorities. Lead technical implementations ensuring the quality of the infrastructure, compliance with Broadridge best practice and policy standards. Provide leadership and strategic direction to the SRE team. Manage and track SLIs, SLAs, SLOs, NFRs, to maintain and adhere to established operational standards. Conduct preventative maintenance to ensure capacity, scaling, security and availability of Broadridge services. Understand hardware and software dependencies between infrastructure components, across the processing stack supporting Broadridge Services. Collaborate with peers and other technical teams, such as development teams, architecture and subject matter experts to reduce or eliminate incident recovery times. Define Service Level Objectives (SLOs) for Broadridge Services. Implement operational improvements through automation, monitoring, and incident management to increase the reliability of Broadridge services. Work with technology vendors to understand their product roadmaps and how these align with Broadridge’s Target State Architecture. Provide management of the SRE team including hiring performance reviews, providing direction, day to day workloads and identifying areas of improvement and leading by example. Continuously inspire, mentor, and train the SRE team, providing direction on modern practices and technologies. Work with senior leaders to architect solutions with technical vision, maintainability and total cost of ownership in mind. Participate and contribute to strategic planning discussions with technical, business, and client stakeholders. Establish Design Patterns used by the technical infrastructure. Qualifications 10+ years of experience with commercial service infrastructure at both a software and infrastructure level. 3+ years of experience managing engineers or technical teams. Strong proven track record in supporting and delivering production services preferably in a financial services environment. Functional skills System Design and Architecture, DevOps / Deployment automation, Troubleshooting, Service Monitoring. Passionate leader who understands and respects personal & cultural differences. Knowledge of next-generation design patterns/architecture like micro-services, layered patterns, and cloud. Ability to work under pressure and be highly adaptable with an aptitude for learning new skills and new technologies. Strong written and communications skills for collaboration with various teams and upper management. Solid analytical skills, especially translating business requirements into technical design with a continuous focus on aligning technical roadmap with the immediate and long-term business strategy. Able to adapt and embrace change and support business strategy and vision. We are dedicated to fostering a collaborative, engaging, and inclusive environment and are committed to providing a workplace that empowers associates to be authentic and bring their best to work. We believe that associates do their best when they feel safe, understood, and valued, and we work diligently and collaboratively to ensure Broadridge is a company and ultimately a community that recognizes and celebrates everyone's unique perspective. Use of AI in Hiring as part of the recruiting process, Broadridge may use technology, including artificial intelligence (AI)-based tools, to help review and evaluate applications. These tools are used only to support our recruiters and hiring managers, and all employment decisions include human review to ensure fairness, accuracy, and compliance with applicable laws. Please note that honesty and transparency are critical to our hiring process. Any attempt to falsify, misrepresent, or disguise information in an application, resume, assessment, or interview will result in disqualification from consideration. #J-18808-Ljbffr



  • Greater London, United Kingdom Encinos Kapital Full time

    A leading financial services firm in Greater London seeks a Director Site Reliability Engineer to lead technical infrastructure efforts. The role entails managing a team of engineers, designing solutions, and ensuring high availability, security, and performance of services. Candidates should have over 10 years of experience in service infrastructure and...


  • Greater London, United Kingdom Arrows Full time

    This range is provided by Arrows. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range Site Reliability Engineer | Contract | London | Up to £600/day Inside IR35 | Hybrid - Up to £650 per day (Inside IR35) - 2 days per week onsite in London I'm working with a leading media and technology...


  • Greater London, United Kingdom Trades Workforce Solutions Full time

    Site Reliability Engineer (SC Cleared) Duration: 12 Months Rate: £675 per day Location: London or Manchester & remote (hybrid working) IR35 Status: Inside Start: ASAP Role Overview: A Site Reliability Engineer (SC Cleared) is required for our government department to be part of a multidisciplinary team developing and supporting the clients data hub which...


  • Greater London, United Kingdom J Bandy Consulting Full time

    We are hiring for a next generation telecoms software company who are seeking a Network Autonomy Engineer to join their expanding team. Primary Function of the Position Reporting to the Site Reliability Engineer Team Lead, the Site Reliability Engineer will be responsible for ensuring the reliability, scalability and performance of our systems....


  • Greater London, United Kingdom Charles Simon Associates Ltd Full time

    Site Reliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – RemoteTHIS IS AN AZURE FOCUSED ROLE, IF YOU APPLY AND DO NOT WORK EITHER SOLEY OR MAINLY ON AZURE YOU WILL NOT BE CONSIDERED.Location : Remote (occasional travel to Nottinghamshire HQ)Salary : Up to £95,000 per annum...


  • Greater London, United Kingdom TP ICAP Full time

    Join to apply for the Site Reliability Engineer role at TP ICAP. The TP ICAP Group is a world leading provider of market infrastructure. Our purpose is to provide clients with access to global financial and commodities markets, improving price discovery, liquidity, and distribution of data, through responsible and innovative solutions. Through our people and...


  • Greater London, United Kingdom Stratospherec Ltd Full time

    Overview Senior DevOps Engineer / Senior Site Reliability Engineer Fully Remote working for candidates based in the UK – Salary to £90k + Benefits We are looking for a Senior DevOps Engineer that has strong C# code knowledge combined with strong knowledge of DevOps tools like Kubernetes (EKS or AKS) and Azure or AWS Cloud platforms. We are looking for a...


  • Wigan, Greater Manchester, United Kingdom Searchability® Full time

    SITE RELIABILITY ENGINEER £70,000 p/a Join a growing, technology-driven business operating at scale within the online gaming and sports sector. Opportunity to shape the SRE strategy. ABOUT THE CLIENT Our client is a fast-growing digital technology company at the forefront of delivering high-availability platforms for the sports and gaming industry. They...


  • City of London, Greater London, United Kingdom Amelco Limited Full time

    Role: Site Reliability Engineer Type: Full-time permanent role Location: Hybrid/ Shoreditch, London 3 days per week About Us Amelco Ltd are a leading gaming and gambling solution software provider with a strong presence in the USA, UK, and Europe. Through partnerships with global gaming companies, we build cutting-edge technical platforms across sportsbooks,...


  • London, United Kingdom Prism Digital Full time

    **Site Reliability Engineer | DevOps, Kubernetes | Prestigious Retailer** A prestigious fashion retailer, well known and respected within the industry, is looking to expand their engineering team with a talented Site Reliability Engineer. Our client, a well-funded scale-up, has offices across the globe, and they are entering anexciting and innovative period...