AI Infrastructure Specialist

2 weeks ago


London, Greater London, United Kingdom Microsoft Full time

We are looking for a highly skilled and motivated AI Infrastructure Specialist to join our team at Microsoft AI.

About Us

At Microsoft AI, we are on a mission to create the leading pretraining platform to develop the world's most capable AI frontier models. This platform will span one of the world's foremost GPU clusters, pushing the boundaries of scale, performance, and reliability.

The AI Pre-training Platform team is responsible for all aspects of infrastructure including scalability, benchmarking, kernel development, performance optimizations, communications, and fault tolerance to support our model pre-training operations.

Job Description

You will design and develop Python and CUDA/HIP C++ code that enables distributed training of multimodal LLMs ingesting text, audio, images, or video data. Additionally, you will build and maintain cutting-edge infrastructure that can store and process the petabytes of data needed to power models.

You will also partner with the pretraining and post-training teams to improve our data recipe by rigorous and careful experimentation. Furthermore, you will collaborate with the product team and other engineers and researchers across Microsoft AI to identify gaps in the current generation of models.

Requirements
  • A Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modeling, or data engineering work;
  • OR A Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work.
Estimated Salary Range: $140,000 - $200,000 per year

  • London, Greater London, United Kingdom Tag Full time

    AI Infrastructure SpecialistWe are seeking a highly skilled AI Infrastructure Specialist to join our team in London. In this role, you will be responsible for designing and implementing AI infrastructure that meets the needs of our data science teams.About the RoleYou will work closely with our data science teams to ensure seamless integration of machine...


  • London, Greater London, United Kingdom Mesh-AI Full time

    About Us: Mesh-AI is a forward-thinking company dedicated to harnessing the power of artificial intelligence to drive business innovation and efficiency.Job Title: AI Business Growth SpecialistAbout the Role:We are seeking an experienced AI Business Growth Specialist to join our team. As a key member of our organization, you will be responsible for...


  • London, Greater London, United Kingdom Sacher AI Full time

    **Job Summary:**Sacher AI is looking for a skilled Senior Generative AI Specialist to join our team. As a key member of our AI Research and Innovation Lab, you will lead the advancement of LLM and generative AI projects, managing the entire lifecycle from concept to deployment.**About Us:**We are a small but fast-growing team at an exciting stage of...

  • AI Safety Specialist

    3 weeks ago


    London, Greater London, United Kingdom Mistral AI Full time

    Our team at Mistral AI is looking for an AI Safety Specialist to join our ranks. As an AI Safety Specialist, you will be responsible for evaluating, enhancing, and building safety mechanisms for our large language models. The ideal candidate will have a strong background in AI, computer science, or a related field. They will be familiar with Python and...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    AI Safety Institute Job OverviewThe Department for Science, Innovation and Technology is seeking a highly skilled Senior AI Research Specialist to join its esteemed team at the forefront of artificial intelligence safety research. This role offers an exceptional opportunity for individuals with expertise in machine learning, large language models, and...


  • London, Greater London, United Kingdom Refonte Learning AI Full time

    About Refonte Learning AIRefonte Learning AI is a leading IT corporation globally recognized for its expertise in providing top-notch IT and Ed-tech services. We specialize in Artificial intelligence (AI), digital marketing, data science, data analytics, UI-UX design, web development, and app development.We are dedicated to innovation, excellence, and...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    As advanced AI systems continue to evolve, the potential risks associated with their cyber capabilities pose a significant threat to organizational and individual security. These risks are particularly concerning when combined with other AI risk areas, such as harmful outcomes from biological and chemical capabilities, and autonomous systems.The AI Safety...


  • London, Greater London, United Kingdom Xcede Full time

    Xcede is seeking an experienced Ai Infrastructure Engineer to join our growing GenAI team. This role requires a strong background in Python and proficiency in AWS, with a bonus for experience with Kafka, Databricks, and RAG. Your primary responsibility will be to develop effective prompts for AI models while fine-tuning them, collaborating with Data Science...


  • London, Greater London, United Kingdom Higher - AI recruitment Full time

    About the Job DescriptionThis Data Engineer position is an exciting opportunity to join our team at Higher - AI recruitment and contribute to the development of sophisticated data-driven products that support our clients' journey towards Net Zero.As a mid-senior level Data Engineer, you will have the opportunity to work with cutting-edge technologies and...


  • London, Greater London, United Kingdom Napier AI Full time

    **Compliance Technology Specialists at Napier AI**We are a leading provider of innovative compliance solutions, leveraging the power of artificial intelligence and machine learning to minimize risk and increase efficiency.Job Summary:We are seeking an experienced AI Compliance Integration Specialist to join our team in the UK. As a key member of our...


  • London, Greater London, United Kingdom Signal AI Full time

    About the Reputation TeamThe Reputation Team at Signal AI is dedicated to delivering exceptional customer experiences in the Reputation space. Our mission is to provide innovative tools and solutions that help PR executives and Chief Communications Officers navigate the vast volume of world media data.As a key member of our team, you will be responsible for...


  • London, Greater London, United Kingdom Mistral AI Full time

    About the RoleWe are seeking an experienced AI Solutions Sales Executive to join our team at Mistral AI. In this role, you will be instrumental in driving digital transformation with tech startups, scale-ups, and larger software companies.Key Responsibilities:Sales Strategy and Execution: Develop and execute strategic sales plans to convert leads into valued...


  • London, Greater London, United Kingdom Mesh-AI Full time

    About Mesh-AIWe are a leading specialist Data and AI consultancy that works with major enterprise customers across the Financial Services and Energy Industries. Our projects range from strategy to implementation and deliver groundbreaking results from Data and AI faster than our competitors.As a fast-growing start-up organisation, we offer a huge opportunity...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    Role OverviewThe AI Safety Institute is seeking a highly skilled Senior AI Safety Researcher to join its Safeguard Analysis Team. The successful candidate will play a key role in researching and developing interventions that secure systems from abuse by bad actors.About the RoleThis is a challenging and rewarding opportunity for an experienced researcher to...


  • London, Greater London, United Kingdom Refonte Learning AI Full time

    We are seeking ambitious individuals for our AI & Data Science Study and Internship Program. This initiative offers a unique opportunity to collaborate closely with our seasoned AI & data science team on diverse and impactful projects.As an AI, Data Science, DevOps, and Cloud professional, you will have the chance to gain hands-on experience in these dynamic...


  • London, Greater London, United Kingdom microTECH Global LTD Full time

    Job Title: DevOps EngineerJob Type: Fixed Term Contract Our client, microTECH Global LTD, is a global telecommunication company seeking a highly skilled Senior DevOps Engineer to manage their AI Infrastructure Team. We are looking for an experienced professional to oversee the large-scale AI development and training infrastructure, ensuring seamless...


  • London, Greater London, United Kingdom Writer Full time

    About WriterWriter is a leading full-stack generative AI platform that delivers transformative ROI for top-tier enterprises. Named one of the top 50 companies in AI by Forbes, Writer empowers hundreds of customers like Accenture, Intuit, L'Oreal, and Vanguard to revolutionize their workflow.We offer an all-in-one solution that simplifies deploying...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    We are seeking an exceptional Cybersecurity Research Engineer to join our team at the AI Safety Institute. Our goal is to develop first-of-its-kind government-run infrastructure to benchmark the progress of advanced AI capabilities in cyber security. The selected candidate will work closely with a cross-functional team of cybersecurity researchers, machine...


  • London, Greater London, United Kingdom AI Safety Institute Full time

    About the RoleWe are seeking a highly motivated and talented Research Scientist to join our Societal Impacts team at the AI Safety Institute. The successful candidate will work with our team to design and run studies that answer important questions about the effect AI will have on society.Key ResponsibilitiesDesign and run studies to evaluate the impact of...


  • London, Greater London, United Kingdom Photon Full time

    Cloud Infrastructure Specialist Job DescriptionAt Photon, we are looking for a skilled Cloud Infrastructure Specialist to join our team. As a Cloud Infrastructure Specialist, you will be responsible for designing, implementing, and managing infrastructure as code using Terraform for GCP environments.Responsibilities:Design and implement scalable, reliable,...