Sr. MLOps Engineer, GenAI
1 day ago
Since launching in Kuwait in 2004, talabat, the leading on-demand food and Q-commerce app for everyday deliveries, has been offering convenience and reliability to its customers. talabat's local roots run deep, offering a real understanding of the needs of the communities we serve in eight countries across the region.
We harness innovative technology and knowledge to simplify everyday life for our customers, optimize operations for our restaurants and local shops, and provide our riders with reliable earning opportunities daily.
Here at talabat, we are building a high performance culture through engaged workforce and growing talent density. We're all about keeping it real and making a difference. Our 6,000+ strong talabaty are on an awesome mission to spread positive vibes. We are proud to be a multi great place to work award winner.
*Summary
Job Description*
As the leading delivery platform in the region, we have a unique responsibility and opportunity to positively impact millions of customers, restaurant partners, and riders. To achieve our mission, we must scale and continuously evolve our machine learning capabilities, including cutting-edge Generative AI (genAI) initiatives. This demands robust, efficient, and scalable ML platforms that empower our teams to rapidly develop, deploy, and operate intelligent systems.
As an ML Platform Engineer, your mission is to design, build, and enhance the infrastructure and tooling that accelerates the development, deployment, and monitoring of traditional ML and genAI models at scale. You'll collaborate closely with data scientists, ML engineers, genAI specialists, and product teams to deliver seamless ML workflows—from experimentation to production serving—ensuring operational excellence across our ML and genAI systems.
*Qualifications
Responsibilities*
- Design, build, and maintain scalable, reusable, and reliable ML platforms and tooling that support the entire ML lifecycle, including data ingestion, model training, evaluation, deployment, and monitoring for both traditional and generative AI models.
- Develop standardized ML workflows and templates using MLflow and other platforms, enabling rapid experimentation and deployment cycles.
- Implement robust CI/CD pipelines, Docker containerization, model registries, and experiment tracking to support reproducibility, scalability, and governance in ML and genAI.
- Collaborate closely with genAI experts to integrate and optimize genAI technologies, including transformers, embeddings, vector databases (e.g., Pinecone, Redis, Weaviate), and real-time retrieval-augmented generation (RAG) systems.
- Automate and streamline ML and genAI model training, inference, deployment, and versioning workflows, ensuring consistency, reliability, and adherence to industry best practices.
- Ensure reliability, observability, and scalability of production ML and genAI workloads by implementing comprehensive monitoring, alerting, and continuous performance evaluation.
- Integrate infrastructure components such as real-time model serving frameworks (e.g., TensorFlow Serving, NVIDIA Triton, Seldon), Kubernetes orchestration, and cloud solutions (AWS/GCP) for robust production environments.
- Drive infrastructure optimization for generative AI use-cases, including efficient inference techniques (batching, caching, quantization), fine-tuning, prompt management, and model updates at scale.
- Partner with data engineering, product, infrastructure, and genAI teams to align ML platform initiatives with broader company goals, infrastructure strategy, and innovation roadmap.
- Contribute actively to internal documentation, onboarding, and training programs, promoting platform adoption and continuous improvement.
*Requirements
Technical Experience*
- Strong software engineering background with experience in building distributed systems or platforms designed for machine learning and AI workloads.
- Expert-level proficiency in Python and familiarity with ML frameworks (TensorFlow, PyTorch), infrastructure tooling (MLflow, Kubeflow, Ray), and popular APIs (Hugging Face, OpenAI, LangChain).
- Experience implementing modern MLOps practices, including model lifecycle management, CI/CD, Docker, Kubernetes, model registries, and infrastructure-as-code tools (Terraform, Helm).
- Demonstrated experience working with cloud infrastructure, ideally AWS or GCP, including Kubernetes clusters (GKE/EKS), serverless architectures, and managed ML services (e.g., Vertex AI, SageMaker).
- Proven experience with generative AI technologies: transformers, embeddings, prompt engineering strategies, fine-tuning vs. prompt-tuning, vector databases, and retrieval-augmented generation (RAG) systems.
- Experience designing and maintaining real-time inference pipelines, including integrations with feature stores, streaming data platforms (Kafka, Kinesis), and observability platforms.
- Familiarity with SQL and data warehouse modeling; capable of managing complex data queries, joins, aggregations, and transformations.
- Solid understanding of ML monitoring, including identifying model drift, decay, latency optimization, cost management, and scaling API-based genAI applications efficiently.
*Qualifications*
- Bachelor's degree in Computer Science, Engineering, or a related field; advanced degree is a plus.
- 3+ years of experience in ML platform engineering, ML infrastructure, generative AI, or closely related roles.
- Proven track record of successfully building and operating ML infrastructure at scale, ideally supporting generative AI use-cases and complex inference scenarios.
- Strategic mindset with strong problem-solving skills and effective technical decision-making abilities.
- Excellent communication and collaboration skills, comfortable working cross-functionally across diverse teams and stakeholders.
- Strong sense of ownership, accountability, pragmatism, and proactive bias for action.
-
Senior MLOps/GenAI Infrastructure Engineer
2 weeks ago
London, Greater London, United Kingdom hackajob Full time £50,000 - £70,000 per yearhackajob*is collaborating withBBCto connect them with exceptional tech professionals for this role.*Job Details**Job Title: Senior MLOps/GenAI Infrastructure EngineerLocation: London / Salford / Glasgow / Newcastle / Cardiff (This is a hybrid role and the successful candidate will balance office working with home working)Band: DSalary: up to £59,600 -...
-
Sr. Data Engineer – Industry 4.0
3 days ago
London, Greater London, United Kingdom Cognizant Full time £80,000 - £150,000 per yearJD: Sr. Data Engineer – Industry 4.0We are hiring a senior Data Engineer to lead the development of intelligent, scalable data platforms for Industry 4.0 initiatives. This role will drive integration across OT/IT systems, enable real-time analytics, and ensure robust data governance and quality frameworks. The engineer will collaborate with...
-
Sr. Data Engineer – Industry 4.0
2 days ago
London, Greater London, United Kingdom Cognizant Technology Solutions Full time £80,000 - £120,000 per yearJD: Sr. Data Engineer – Industry 4.0We are hiring a senior Data Engineer to lead the development of intelligent, scalable data platforms for Industry 4.0 initiatives. This role will drive integration across OT/IT systems, enable real-time analytics, and ensure robust data governance and quality frameworks. The engineer will collaborate with...
-
MLOps Engineer
1 week ago
London, Greater London, United Kingdom FitNext Co. Full time £45,000 - £90,000 per yearOn-Site MLOps Engineer (Kubernetes, Cloud, ML Workflows)London, UK – Soho (next to Tottenham Court Road) | Contractor | Start: ASAPAbout the RoleStrong MLOps engineer with exposure in high-volume systems to help implement best practices and scale MLOps practice for a global brand. Must have strong on-prem and Kubernetes experience. This role includes an...
-
GenAI Platform Engineer II
3 days ago
London, Greater London, United Kingdom GSK Full time £80,000 - £120,000 per yearThe Onyx Research Data Tech organization is GSK's Research data ecosystem which has the capability to bring together, analyze, and power the exploration of data at scale. We partner with scientists across GSK to define and understand their challenges and develop tailored solutions that meet their needs. The goal is to ensure scientists have the right data...
-
GenAI Platform Engineer II
3 days ago
London, Greater London, United Kingdom GSK Full time £80,000 - £120,000 per yearThe Onyx Research Data Tech organization is GSK's Research data ecosystem which has the capability to bring together, analyze, and power the exploration of data at scale. We partner with scientists across GSK to define and understand their challenges and develop tailored solutions that meet their needs. The goal is to ensure scientists have the right data...
-
Sr. Engineer, ML Platform
4 days ago
London, Greater London, United Kingdom talabat Full time 180,000 - 250,000 per yearCompany Description Since launching in Kuwait in 2004, talabat, the leading on-demand food and Q-commerce app for everyday deliveries, has been offering convenience and reliability to its customers. talabat's local roots run deep, offering a real understanding of the needs of the communities we serve in eight countries across the region.We harness innovative...
-
GenAI Platform Engineer II
2 days ago
London, Greater London, United Kingdom GSK Full time £120,000 - £180,000 per yearThe Onyx Research Data Tech organization is GSK's Research data ecosystem which has the capability to bring together, analyze, and power the exploration of data at scale. We partner with scientists across GSK to define and understand their challenges and develop tailored solutions that meet their needs. The goal is to ensure scientists have the right data...
-
Senior MLOps Engineer
2 weeks ago
London, Greater London, United Kingdom Marks and Spencer Full time £60,000 - £100,000 per yearSummaryWe are seeking a passionate a passionate Senior MLOps Engineer to join our team and work along data scientists to deliver Data Science solutions. This role will help discover and implement state-of-the-art solutions for our ML projects to enable rapid experimentation, increase accuracy of our models and ensure high quality of our products. This role...
-
MLOps Engineer
5 days ago
London, Greater London, United Kingdom Ash by Slingshot AI Full time £80,000 - £120,000 per yearSlingshot AISlingshot AI is the team behind Ash, the first AI designed for mental health. Our mission is to make support more accessible and help people change their lives in the ways they want.We're building a world-class team by empowering individuals with the autonomy, flexibility, and support they need to do their best work. We dream big, iterate fast,...