Senior AI Infrastructure Engineer

3 weeks ago


London, Greater London, United Kingdom microTECH Global LTD Full time
Job Title: Senior AI Infrastructure Engineer

Job Type: Fixed Term Contract

We are seeking a highly skilled Senior AI Infrastructure Engineer to join our team at microTECH Global LTD. As a key member of our AI Infrastructure Team, you will be responsible for managing large-scale AI development and training infrastructure.

Key Responsibilities:

* Oversee GPU servers, Kubernetes clusters (Rancher), and storage systems to ensure seamless operations and optimized performance.

* Collaborate with development teams to ensure they have the resources and support needed to run their projects efficiently.

* Configure, scale, and maintain Kubernetes clusters and Rancher for multi-cluster management, ensuring optimal performance and resource allocation.

* Manage GPU resources and servers, ensuring efficient resource scheduling, load balancing, and performance optimization for AI workloads.

* Maintain and optimize large storage systems, ensuring high availability, performance, and data persistence.

* Implement and manage role-based access control (RBAC) and ensure data security, encryption, and backup procedures are in place.

Requirements:

* Strong understanding of GPU resource management and optimization for AI workloads.

* Expertise in managing large storage systems and implementing data persistence strategies.

* Proficiency in scripting and automation (Python, Bash, Go), with experience in infrastructure as code (IaC) using Terraform, Ansible, or similar tools.

* Experience with monitoring and logging tools such as Prometheus, Grafana, and ELK.

* Experience in managing hybrid cloud environments.

Preferred Qualifications:

* Mandarin Speaker.

  • London, Greater London, United Kingdom Chan Zuckerberg Initiative Full time

    About the RoleWe are seeking a highly skilled Senior AI Infrastructure Engineer to join our team at the Chan Zuckerberg Initiative. As a key member of our AI/ML and Data Infrastructure organization, you will play a critical role in building and scaling our shared tools and platforms to support our initiatives.As a Senior AI Infrastructure Engineer, you will...


  • London, Greater London, United Kingdom Chan Zuckerberg Initiative Full time

    About the RoleWe are seeking a highly skilled Senior AI Infrastructure Engineer to join our team at the Chan Zuckerberg Initiative. As a key member of our AI/ML and Data Infrastructure organization, you will play a critical role in building and scaling our shared tools and platforms to support our initiatives.As a Senior AI Infrastructure Engineer, you will...


  • London, Greater London, United Kingdom Chan Zuckerberg Initiative Full time

    The Chan Zuckerberg Initiative is a leading organization in the field of AI/ML and Data Infrastructure. We are seeking a highly skilled Senior AI Infrastructure Engineer to join our team.About the RoleThe Senior AI Infrastructure Engineer will be responsible for designing and building efficient, stable, performant, scalable, and secure AI/ML and Data...


  • London, Greater London, United Kingdom Chan Zuckerberg Initiative Full time

    The Chan Zuckerberg Initiative is a pioneering organization that leverages technology to drive meaningful change. As a Senior AI Infrastructure Engineer, you will play a critical role in building and maintaining the technical foundation that enables our mission.The OpportunityWe are seeking a highly skilled engineer to join our AI/ML and Data Infrastructure...


  • London, Greater London, United Kingdom Chan Zuckerberg Initiative Full time

    About the RoleWe are seeking a highly skilled Senior AI Infrastructure Engineer to join our team at the Chan Zuckerberg Initiative. As a key member of our AI/ML and Data Infrastructure organization, you will play a critical role in building shared tools and platforms to support our initiatives across the organization.Our team is responsible for designing,...


  • London, Greater London, United Kingdom Chan Zuckerberg Initiative Full time

    About the RoleWe are seeking a highly skilled Senior AI Infrastructure Engineer to join our team at the Chan Zuckerberg Initiative. As a key member of our AI/ML and Data Infrastructure organization, you will play a critical role in building shared tools and platforms to support our initiatives across the organization.Our team is responsible for designing,...


  • London, Greater London, United Kingdom Chan Zuckerberg Initiative Full time

    The Chan Zuckerberg Initiative is a leader in harnessing the power of technology to drive social impact. We are seeking a highly skilled Senior AI Infrastructure Engineer to join our team and help us build a more inclusive, just, and healthy future for everyone.About the RoleWe are looking for a talented engineer to design, build, and scale software systems...


  • London, Greater London, United Kingdom Chan Zuckerberg Initiative Full time

    About the RoleThe Chan Zuckerberg Initiative is seeking a highly skilled Senior AI Infrastructure Engineer to join our AI/ML and Data Infrastructure team. As a key member of our team, you will be responsible for designing, building, and scaling software systems to support our mission to build a more inclusive, just, and healthy future for everyone.We are...


  • London, Greater London, United Kingdom Chan Zuckerberg Initiative Full time

    About the RoleThe Chan Zuckerberg Initiative is seeking a highly skilled Senior AI Infrastructure Engineer to join our AI/ML and Data Infrastructure team. As a key member of our team, you will be responsible for designing, building, and scaling software systems to support our mission to build a more inclusive, just, and healthy future for everyone.We are...


  • London, Greater London, United Kingdom ZipRecruiter Full time

    Job Title: Senior MLOps EngineerLa Fosse is currently working with a cutting-edge AI start-up that utilises advanced robots to maximise human capacity and effectiveness. In this role, you will oversee the end-to-end lifecycle of AI/ML models, from development to deployment. You will ensure the reliability, scalability, and security of AI/ML infrastructure,...


  • London, Greater London, United Kingdom ZipRecruiter Full time

    Job Title: Senior MLOps EngineerLa Fosse is currently working with a cutting-edge AI start-up that utilises advanced robots to maximise human capacity and effectiveness. In this role, you will oversee the end-to-end lifecycle of AI/ML models, from development to deployment. You will ensure the reliability, scalability, and security of AI/ML infrastructure,...


  • London, Greater London, United Kingdom Gradient Labs AI Full time

    We are Gradient Labs, a pioneering AI company based in the UK.Our mission is to redefine customer support for the next decade by building a suite of LLM-based autonomous agents that can safely automate complex queries.We are looking for a skilled Backend Engineer to contribute to our "operating system" for future AI agents, ensuring it is safe, scalable, and...


  • London, Greater London, United Kingdom Xcede Full time

    Xcede is seeking an experienced Ai Infrastructure Engineer to join our growing GenAI team. This role requires a strong background in Python and proficiency in AWS, with a bonus for experience with Kafka, Databricks, and RAG. Your primary responsibility will be to develop effective prompts for AI models while fine-tuning them, collaborating with Data Science...


  • London, Greater London, United Kingdom Aitopics Full time

    Job Title: Senior Infrastructure Engineer - AI Development and TrainingHuawei R&D UK is seeking a highly skilled Senior IT Engineer to manage a large-scale AI development and training infrastructure.The role involves overseeing GPU servers, Kubernetes clusters (Rancher), and storage systems to ensure seamless operations and optimized performance.You will...


  • London, Greater London, United Kingdom Artifact AI Full time

    At Artifact AI, we're pushing the boundaries of accounting automation with intelligent, enterprise-grade AI agents. Our agentic workflows streamline complex, end-to-end accounting processes for businesses and accounting firms, enabling them to scale efficiently and focus on high-value tasks. Artifact AI empowers organizations by delivering automation with...


  • London, Greater London, United Kingdom Chan Zuckerberg Initiative Full time

    The Chan Zuckerberg Initiative is a pioneering organization that combines technology with grantmaking, impact investing, and collaboration to drive progress toward its mission of building a more inclusive, just, and healthy future for everyone.Our Central Operations & Partners team provides the support needed to push this work forward, and we are seeking a...

  • Senior AI Engineer

    2 weeks ago


    London, Greater London, United Kingdom FactSet Full time

    FactSet Senior AI Engineer - Cloud Infrastructure ExpertAt FactSet, we are looking for a highly skilled Senior AI Engineer to join our team as a Cloud Infrastructure Expert. This exciting opportunity will involve developing and maintaining machine learning pipelines to support our cutting-edge models, ensuring seamless integration and maintenance of model...


  • London, Greater London, United Kingdom microTECH Global LTD Full time

    Job Title: DevOps EngineerJob Type: Fixed Term Contract Our client, microTECH Global LTD, is a global telecommunication company seeking a highly skilled Senior DevOps Engineer to manage their AI Infrastructure Team. We are looking for an experienced professional to oversee the large-scale AI development and training infrastructure, ensuring seamless...


  • London, Greater London, United Kingdom ZipRecruiter Full time

    Job Title: Senior MLOps Engineer (AWS)Job Type: Full-timeLocation: Central LondonJob Description:We are seeking a highly skilled Senior MLOps Engineer to join our team in Central London. As a key member of our AI/ML infrastructure team, you will be responsible for overseeing the end-to-end lifecycle of AI/ML models, from development to deployment.Key...


  • London, Greater London, United Kingdom ZipRecruiter Full time

    Job Title: Senior MLOps Engineer (AWS)Job Type: Full-timeLocation: Central LondonJob Description:We are seeking a highly skilled Senior MLOps Engineer to join our team in Central London. As a key member of our AI/ML infrastructure team, you will be responsible for overseeing the end-to-end lifecycle of AI/ML models, from development to deployment.Key...