Director Machine Learning
2 weeks ago
Summary
The Director of IT Incident and Problem Management is a senior leader responsible for shaping and transforming incident and problem management into a predictive and proactive discipline. You will drive a proactive, agile approach to incident response, building and leveraging AI-driven insights to enhance responsiveness and operational efficiency. Your leadership will underpin our pivot from a product to a platform-focused service, ensuring seamless, resilient service delivery that meets our high standards for reliability and customer satisfaction.
As a forward-thinking leader, you will balance traditional ITIL frameworks with modern tools and practices, such as incident.io and FireHydrant, and embed accountability across engineering and operational teams. You will work closely with cross-functional stakeholders including Engineering, Product, and Customer Support to ensure that incidents are resolved promptly and root causes are addressed comprehensively, with the overarching goal of minimizing business impact.
How will you contribute?
Strategic Leadership: Provide visionary leadership to evolve our incident and problem management practices, embedding modern approaches that use AI and automation and predictive capabilities to reduce response times and predict potential issues before they impact service.
Accountability and Performance: Foster a culture of accountability, holding engineering teams and incident responders to high standards for incident resolution. Ensure robust tracking and reporting of incident response metrics, creating transparency and setting clear performance expectations.
Platform-Centric Incident Management: Drive alignment between incident/problem management and the organization's shift towards a unified platform model, ensuring that incident management processes are scalable, adaptable, and aligned with platform objectives.
Modern Tool Proficiency: Deploy and optimize advanced incident management platforms such as incident.io and FireHydrant, utilizing these tools to enhance visibility, speed, and effectiveness of response across our platform. Adapt methodologies beyond traditional ITIL to remain agile and customer-focused.
Root Cause Analysis and Prevention: Lead comprehensive root cause analysis for major incidents, advocating a preventative stance through continuous improvement and resilience-focused practices. Apply SRE principles and drive actionable outcomes to prevent recurrence.
Data-Driven Insights and Reporting: Utilize data-driven insights to inform incident response strategies. Present trends, risk factors, and improvement opportunities to senior executives and stakeholders, supporting business decisions with clear, actionable metrics.
Typical Tasks:
Define and implement strategic roadmaps for incident and problem management, ensuring alignment with business objectives and platform goals. Regularly update practices to incorporate the latest in AI, automation, and predictive analytics.
Oversee major incident response efforts, ensuring fast, effective containment, resolution, and customer impact mitigation. Lead executive-level post-mortems and ensure comprehensive follow-ups.
Conduct and oversee in-depth root cause analyses for recurring or high-impact incidents, developing and deploying preventive measures across the platform to reduce recurrence.
Collaborate closely with IT operations, engineering, product, and support teams to ensure a unified approach to incident and problem resolution, with a focus on consistent customer experience.
Define, monitor, and optimise KPIs and performance metrics related to incident and problem management. Lead continuous improvement initiatives to ensure process agility and alignment with evolving business requirements.
Lead continuous improvement initiatives, including evaluating and refining AI algorithms and predictive models to align with evolving business needs and platform scalability.
Drive modular and scalable incident management practices, adaptable to the complexities of a multi-service platform architecture.
Develop and deliver reports on incident and problem management metrics for stakeholders, including executive leadership, product management, and customer success teams, to provide insights into trends, risks, and opportunities for improvement.
What will you bring?
Strategic Incident and Problem Management Expertise: 10-15 years of experience in IT incident and problem management, ideally within SaaS and platform-based environments, with a minimum of 5 years in a senior leadership capacity.
Modern Practices in Incident Management: Demonstrated expertise in using cutting-edge incident management tools (e.g., incident.io , FireHydrant) and AI-driven solutions to streamline processes, drive rapid response, and enhance service reliability.
Problem Management: Expertise in leading comprehensive root cause analysis and problem resolution efforts, incorporating Google SRE principles for preventive actions.
Google SRE Methodologies: In-depth knowledge of Google SRE philosophies, including error budget management, service level indicators/objectives (SLIs/SLOs), and effective incident response strategies.
Platform and SaaS Experience: Strong understanding of platform-oriented operations within B2B SaaS, ideally with experience in supporting a pivot from product to platform. FinTech experience is advantageous but not required.
Leadership and Accountability: Proven record of building and leading high-performing teams, with an emphasis on holding teams accountable to clear standards and ensuring consistency in incident response and resolution.
Collaborative Communication Skills: Excellent ability to influence and collaborate with cross-functional teams and executive-level stakeholders. Skilled in delivering complex insights to both technical and non-technical audiences.
Innovation and Continuous Improvement: Ability to drive continuous improvement through innovative practices, data insights, and strategic thinking. An advocate for evolving incident/problem management to proactively support business goals.
Cross-cloud environments: Experience managing incident and problem resolution in cross-cloud environments, ideally with a focus on seamless integration of diverse platforms.
Preferred Qualifications:
Bachelor’s degree in Computer Science, Information Systems, or a related field; a Master’s degree is preferred.
ITIL Expert certification and familiarity with Google SRE principles; advanced certifications in cloud platforms (AWS, GCP, Azure) or incident management tools are highly advantageous.
Familiarity with leveraging AI and machine learning within incident and problem management to predict incidents, automate responses, or identify root causes, showcasing an ability to bring innovative solutions to the role.
#J-18808-Ljbffr
-
Machine Learning Strategist
1 month ago
Belfast, United Kingdom Ocho Full timeAbout the RoleOcho is a leading tech recruitment agency connecting top talent with innovative organizations. We're seeking a seasoned Principal Machine Learning Architect to lead our client's Machine Learning Engineering team. The ideal candidate will have a strong background in statistical modeling, data mining, and machine learning algorithms, with...
-
Machine Learning Specialist
3 weeks ago
Belfast, United Kingdom Ocho Full timeR&D Engineer AI/ML Specialist - VFX/Gaming NEW US FDI...Visual FX/CGI /Entertainment tech industry - Belfast - maybe this is the role to come back home to?Ocho is excited to partner with an new US FDI who are looking for an experienced R&D Engineer AI/ML Specialist About the Role: Lead the development of animation diffusion models, utilizing existing...
-
Machine Learning Specialist
3 weeks ago
Belfast, United Kingdom Ocho Full time €70,000R&D Engineer AI/ML Specialist - VFX/Gaming NEW US FDI... Visual FX/CGI /Entertainment tech industry - Belfast - maybe this is the role to come back home to? Ocho is excited to partner with an new US FDI who are looking for an experienced R&D Engineer AI/ML Specialist About the Role: Lead the development of animation diffusion models, utilizing existing...
-
Machine Learning Specialist
3 weeks ago
Belfast, United Kingdom Ocho Full timeR&D Engineer AI/ML Specialist - VFX/GamingNEW US FDI... Visual FX/CGI /Entertainment tech industry - Belfast - maybe this is the role to come back home to?Ocho is excited to partner with an new US FDI who are looking for an experienced R&D Engineer AI/ML SpecialistAbout the Role:Lead the development of animation diffusion models, utilizing existing animation...
-
Machine Learning Specialist
3 weeks ago
Belfast, United Kingdom Ocho Full timeR&D Engineer AI/ML Specialist - VFX/Gaming NEW US FDI... Visual FX/CGI /Entertainment tech industry - Belfast - maybe this is the role to come back home to? Ocho is excited to partner with an new US FDI who are looking for an experienced R&D Engineer AI/ML Specialist About the Role: Lead the development of animation diffusion models, utilizing existing...
-
Machine Learning Specialist
3 weeks ago
Belfast, United Kingdom Ocho Full time €70,000R&D Engineer AI/ML Specialist - VFX/Gaming NEW US FDI... Visual FX/CGI /Entertainment tech industry - Belfast - maybe this is the role to come back home to? Ocho is excited to partner with an new US FDI who are looking for an experienced R&D Engineer AI/ML Specialist About the Role: Lead the development of animation diffusion models, utilizing existing...
-
Machine Learning Specialist
3 weeks ago
Belfast, United Kingdom Ocho Full timeR&D Engineer AI/ML Specialist - VFX/GamingNEW US FDI... Visual FX/CGI /Entertainment tech industry - Belfast - maybe this is the role to come back home to?Ocho is excited to partner with an new US FDI who are looking for an experienced R&D Engineer AI/ML SpecialistAbout the Role:Lead the development of animation diffusion models, utilizing existing animation...
-
Machine Learning Specialist
3 weeks ago
Belfast, United Kingdom Ocho Full timeR&D Engineer AI/ML Specialist - VFX/GamingNEW US FDI... Visual FX/CGI /Entertainment tech industry - Belfast - maybe this is the role to come back home to?Ocho is excited to partner with an new US FDI who are looking for an experienced R&D Engineer AI/ML SpecialistAbout the Role:Lead the development of animation diffusion models, utilizing existing animation...
-
Machine Learning Specialist
3 weeks ago
Belfast, United Kingdom Ocho Full timeR&D Engineer AI/ML Specialist - VFX/Gaming NEW US FDI... Visual FX/CGI /Entertainment tech industry - Belfast - maybe this is the role to come back home to? Ocho is excited to partner with an new US FDI who are looking for an experienced R&D Engineer AI/ML Specialist About the Role: Lead the development of animation diffusion models,...
-
Machine Learning Architect
3 weeks ago
Belfast, United Kingdom Hayward Hawk Full timeCompany OverviewHayward Hawk is an innovative technology company dedicated to building transformative solutions that make a real-world impact. Our mission is to leverage technology for good and contribute to projects that benefit society.SalaryThe estimated salary for this position is $120,000 - $180,000 per year, depending on experience.Job DescriptionWe...
-
Machine Learning Platform Architect
3 weeks ago
Belfast, United Kingdom Intapp Full timeAbout the RoleAs a Senior MLOps Engineer at Intapp, you will play a pivotal role in accelerating applied AI. Your primary focus will be on designing, building, and maintaining secure, scalable, and efficient ML platforms that automate the end-to-end life cycle for traditional ML models and LLM models within the Cloud Platforms Engineering (CPE)...
-
Machine Learning Platform Architect
3 weeks ago
Belfast, United Kingdom iO Associates Full timeWe are seeking a highly skilled AI Engineering Team Lead to join our team in Belfast. This is an exceptional opportunity to drive the development of a cutting-edge machine learning platform.As a key member of our team, you will be responsible for leading the development and deployment of machine learning models and systems, ensuring best practices in model...
-
AWS Machine Learning Engineer
3 weeks ago
Belfast, United Kingdom Divvy Cloud Corp. Full timeAbout Divvy Cloud Corp.We are a leading innovator in the field of cloud security, dedicated to delivering cutting-edge solutions to our customers. As an AWS Machine Learning Engineer - Security, you will be part of a talented team that is passionate about using machine learning to improve the security of our customers' digital assets.Your primary...
-
Machine Learning Researcher
2 weeks ago
Belfast, United Kingdom Divvy Cloud Corp. Full timeDetecting threats before they happen is a critical aspect of cybersecurity, and AI plays a key role in making it possible. At Rapid7, we're leveraging AI to supercharge our cybersecurity detections and triage alerts quickly.We're seeking talented AI Engineers to join our team and help us stay ahead of attackers. As an AI Engineer II in Model R&D, you will...
-
Data Scientist with Machine Learning Expertise
4 weeks ago
Belfast, United Kingdom VanRath Full timeSoftware Architect - AI/MLVANRATH is partnering with a prominent global software company to find a skilled Software Architect with expertise in Machine Learning principles.Key Responsibilities:Collaborate with data scientists, software engineers, and stakeholders to design and implement software solutions focusing on Machine Learning principles.Architect...
-
Machine Learning Innovation Lead
3 weeks ago
Belfast, United Kingdom Ocho Full timeAbout OchoOcho is a cutting-edge organization that is pushing the boundaries of what is possible with machine learning. We are passionate about delivering innovative AI solutions and managing AWS infrastructure to support scalable AI deployment.Job Description:We are seeking a highly skilled Machine Learning Innovation Lead to join our team as an R&D...
-
Machine Learning Specialist
2 weeks ago
Belfast, Northern Ireland, Northern Ireland, United Kingdom Ocho Full timeR&D Engineer AI/ML Specialist - VFX/GamingNEW US FDI... Visual FX/CGI /Entertainment tech industry - Belfast - maybe this is the role to come back home to?Ocho is excited to partner with an new US FDI who are looking for an experienced R&D Engineer AI/ML SpecialistAbout the Role:Lead the development of animation diffusion models, utilizing existing animation...
-
Machine Learning Software Engineer
3 weeks ago
Belfast, United Kingdom Divvy Cloud Corp. Full timeResponsibilitiesDesign and implement AI-powered systems for detecting patterns and anomalies in data.Collaborate with cross-functional teams to develop and deploy AI models.Work with senior engineers and researchers to integrate AI technology into existing products.Develop and maintain high-quality code, following best practices and standards.Participate in...
-
Belfast, United Kingdom iO Associates - UKEU Full timeAre you a seasoned Senior Machine Learning Engineer looking to make a meaningful impact in the mental health space?iO Associates - UK/EU is supporting a pioneering Belfast-based scale-up that's harnessing the power of complex technology for good. Their innovative solutions focus on enhancing user performance, guiding individuals and high-performing groups...
-
Machining Technician
3 weeks ago
Belfast, United Kingdom Hunter Savage Full timeJob SummaryThis is an exciting opportunity to join Hunter Savage as a skilled Machining Technician. We are seeking a highly experienced individual to work on our cutting-edge equipment, producing high-quality automotive components. The ideal candidate will have a strong background in manual machining and CNC programming, with experience in setting up and...