Senior AI Infrastructure Manager

4 weeks ago


London, Greater London, United Kingdom microTECH Global LTD Full time
Job Title:
DevOps Engineer

Job Type:
Fixed Term Contract
Our client, microTECH Global LTD, is a global telecommunication company seeking a highly skilled Senior DevOps Engineer to manage their AI Infrastructure Team.

We are looking for an experienced professional to oversee the large-scale AI development and training infrastructure, ensuring seamless operations and optimized performance.

The successful candidate will collaborate with development teams, providing them with the necessary resources and support to run their projects efficiently.

Key Responsibilities:

  • Configure, scale, and maintain Kubernetes clusters and Rancher for multi-cluster management, ensuring optimal performance and resource allocation.
  • Manage GPU resources and servers, ensuring efficient resource scheduling, load balancing, and performance optimization for AI workloads.
  • Maintain and optimize large storage systems, ensuring high availability, performance, and data persistence.
  • Monitor and troubleshoot AI infrastructure, ensuring the smooth operation of AI workloads.
  • Collaborate with data scientists and AI developers to optimize AI frameworks and ensure data security, encryption, and backup procedures are in place.
  • Strong understanding of GPU resource management and optimization for AI workloads.
  • Expertise in managing large storage systems and implementing data persistence strategies.
  • Proficiency in scripting and automation (Python, Bash, Go), with experience in infrastructure as code (IaC) using Terraform, Ansible, or similar tools.
  • Experience with monitoring and logging tools such as Prometheus, Grafana, and ELK.
  • Experience in managing hybrid cloud environments.
  • Preferred Mandarin Speaker.

What We Offer:
A challenging and rewarding role with opportunities for professional growth and development in a dynamic and innovative company.

  • London, Greater London, United Kingdom AI Safety Institute Full time

    Role OverviewThe AI Safety Institute is seeking a highly skilled Senior AI Safety Researcher to join its Safeguard Analysis Team. The successful candidate will play a key role in researching and developing interventions that secure systems from abuse by bad actors.About the RoleThis is a challenging and rewarding opportunity for an experienced researcher to...


  • London, Greater London, United Kingdom Sacher AI Full time

    **Job Summary:**Sacher AI is looking for a skilled Senior Generative AI Specialist to join our team. As a key member of our AI Research and Innovation Lab, you will lead the advancement of LLM and generative AI projects, managing the entire lifecycle from concept to deployment.**About Us:**We are a small but fast-growing team at an exciting stage of...


  • London, Greater London, United Kingdom Aitopics Full time

    Job Title: Senior Infrastructure Engineer - AI Development and TrainingHuawei R&D UK is seeking a highly skilled Senior IT Engineer to manage a large-scale AI development and training infrastructure.The role involves overseeing GPU servers, Kubernetes clusters (Rancher), and storage systems to ensure seamless operations and optimized performance.You will...


  • London, Greater London, United Kingdom Ai Brainer Full time

    Senior Product Manager - AI DivisionWe are seeking an experienced Senior Product Manager to lead the development of our AI-powered learning solutions. The ideal candidate will have a strong background in product management, with a focus on AI technologies and education.About PreplyPreply is a leading online language learning platform that connects learners...


  • London, Greater London, United Kingdom Signal AI Full time

    About the Reputation TeamThe Reputation Team at Signal AI is dedicated to delivering exceptional customer experiences in the Reputation space. Our mission is to provide innovative tools and solutions that help PR executives and Chief Communications Officers navigate the vast volume of world media data.As a key member of our team, you will be responsible for...

  • AI Expert

    3 days ago


    London, Greater London, United Kingdom Engine AI Full time

    Senior AI EngineerWe're expanding our AI capabilities at Engine AI and seeking a seasoned Senior AI Engineer to spearhead the development of Data Agents. This role involves crafting tools that translate natural language queries into actionable insights, including SQL query generation, entity matching, and data visualizations.As a key member of our team,...

  • Senior AI Architect

    1 week ago


    London, Greater London, United Kingdom Stealth AI Startup Full time

    We are a stealth-mode startup revolutionizing AI orchestration with our groundbreaking Agentic platform, enabling seamless collaboration between human and AI agents. Backed by visionary founders with a proven track record, this role offers the opportunity to pioneer AI-driven innovation.About the OpportunityOur cutting-edge Agentic platform requires a...


  • London, Greater London, United Kingdom Recursion Full time

    About the RoleWe are seeking a Senior AI/HPC Storage Engineer to join our innovative team at Recursion, a pioneering TechBio company that leverages AI and machine learning to decode biology and accelerate drug discovery.In this role, you will be instrumental in designing, implementing, and managing advanced AI/HPC data systems that propel our groundbreaking...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    AI Safety Institute Job OverviewThe Department for Science, Innovation and Technology is seeking a highly skilled Senior AI Research Specialist to join its esteemed team at the forefront of artificial intelligence safety research. This role offers an exceptional opportunity for individuals with expertise in machine learning, large language models, and...


  • London, Greater London, United Kingdom Encord Full time

    About UsAt Encord, we're pushing the boundaries of AI infrastructure. Our biggest challenge is ensuring data quality, which is crucial for AI applications. We're expanding our Product team to build a better product for our customers.The RoleAs a Product Manager at Encord, you'll drive the development and growth of our AI infrastructure products. Your...


  • London, Greater London, United Kingdom Tag Full time

    AI Infrastructure SpecialistWe are seeking a highly skilled AI Infrastructure Specialist to join our team in London. In this role, you will be responsible for designing and implementing AI infrastructure that meets the needs of our data science teams.About the RoleYou will work closely with our data science teams to ensure seamless integration of machine...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    As advanced AI systems continue to evolve, the potential risks associated with their cyber capabilities pose a significant threat to organizational and individual security. These risks are particularly concerning when combined with other AI risk areas, such as harmful outcomes from biological and chemical capabilities, and autonomous systems.The AI Safety...


  • London, Greater London, United Kingdom Aitopics Full time

    About the JobHuawei R&D UK is seeking an experienced Senior AI Engineering Manager to lead the development and training of large-scale AI infrastructure. The ideal candidate will have a strong background in Kubernetes, hardware management, and automation.Key ResponsibilitiesOversee the setup and maintenance of GPU servers, Kubernetes clusters, and storage...


  • London, Greater London, United Kingdom Xcede Full time

    Xcede is seeking an experienced Ai Infrastructure Engineer to join our growing GenAI team. This role requires a strong background in Python and proficiency in AWS, with a bonus for experience with Kafka, Databricks, and RAG. Your primary responsibility will be to develop effective prompts for AI models while fine-tuning them, collaborating with Data Science...


  • London, Greater London, United Kingdom Genie AI Full time

    About the Role:We are seeking a highly skilled Senior Product Designer to join our team at Genie AI. As a Senior Product Designer, you will play a key role in shaping the future of law with AI.Key Responsibilities:Develop and implement user-centered design strategies to improve the user experience of our AI-powered legal document drafting...


  • London, Greater London, United Kingdom Writer Full time

    About WriterWriter is a leading full-stack generative AI platform that delivers transformative ROI for top-tier enterprises. Named one of the top 50 companies in AI by Forbes, Writer empowers hundreds of customers like Accenture, Intuit, L'Oreal, and Vanguard to revolutionize their workflow.We offer an all-in-one solution that simplifies deploying...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    We're pushing the boundaries of AI safety research at the AI Safety Institute. As a research scientist, you'll be part of a dynamic team exploring the risks of autonomous AI systems. Your expertise will help us advance the state of the science in risk modeling, incorporating insights from safety-critical and adversarial domains. You'll work closely with...


  • London, Greater London, United Kingdom Microsoft Full time

    We are looking for a highly skilled and motivated AI Infrastructure Specialist to join our team at Microsoft AI.About UsAt Microsoft AI, we are on a mission to create the leading pretraining platform to develop the world's most capable AI frontier models. This platform will span one of the world's foremost GPU clusters, pushing the boundaries of scale,...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Estimated Salary: £80,000 - £110,000 per annumAbout the RoleWe are seeking an exceptional Senior AI Safety Researcher to join our team at the AI Safety Institute. This is a unique opportunity to contribute to the development of safety cases and advance the field of AI governance.Key ResponsibilitiesConduct foundational research on safety cases to help...

  • Senior Data Scientist

    3 weeks ago


    London, Greater London, United Kingdom Higher - AI recruitment Full time

    About the RoleOVO Group is a leading energy technology company driven to create a world with clean, affordable energy for everyone. Since its launch in 2009, the company has welcomed over a million members and planted a million trees. They aim to change energy for the better by driving progress towards net zero carbon living.The Tech org at OVO is entering...