AI Engineer

2 weeks ago


London, Greater London, United Kingdom Quadrivia AI Full time £60,000 - £120,000 per year

About Us
Quadrivia is the health technology company behind Qu, a comprehensive, controllable, and customizable assistant AI built by clinicians, for clinicians. Addressing the urgent shortage of healthcare professionals, Qu provides real-time, personal, and reliable support for clinical tasks across the care continuum. Designed for providers, payers, and pharmaceutical companies, Qu is easy to customize and integrates seamlessly into workflows, delivering precise assistance across the care spectrum.

The Role
Own and evolve the core "brain" service that powers Qu. Design, build, and operate multi-agent LLM systems that communicate in real time over text and voice. Ship fast Python services with FastAPI, keep latency low, quality high, and evaluation continuous.

What You'll Do

  • Own Qu's brain service end to end: architecture, SLAs, latency budgets, error modes, rollouts.
  • Low-latency comms: streaming text and voice, VAD, barge-in, turn-taking, interruption handling. WebRTC, SIP, and LiveKit experience is a strong plus.
  • Multi-agent orchestration: planner–executor–critic patterns, role routing, shared memory, tool routers, coordination protocols and evaluation.
  • Reasoning & optimization: ReAct, Chain-of-Thought, plus Tree-/Graph-of-Thoughts when useful.
  • Programmatic prompt optimization: DSPy for prompt/program compilation; integrate MiPRO and GEPA for iterative prompt evolution under eval constraints.
  • RAG engineering: high-signal retrieval (chunking, hybrid search, re-ranking), query rewriting, compression, caching, freshness, and strong grounding; evaluate faithfulness, context precision/recall, and answer relevancy.
  • Evaluation & observability: Pre-call validate inputs, enforce safety, and verify retrieval quality for RAG; in-call trace prompts, tool calls, token/latency/cost and enforce streaming guardrails; post-call run automated task evals (faithfulness, relevancy, hallucination, safety), regressions, red-teaming, and CI/CD gates. Instrument with structured logs and OpenTelemetry, surface dashboards and alerts, and feed live traffic slices into shadow evals for drift detection.

Minimum Qualifications

  • 5+ years in ML or backend engineering in product environments; recent focus on LLM systems.
  • Expert Python. Strong FastAPI, asyncio, pydantic, and production observability.
  • Real-time systems: you've built or integrated low-latency text/voice. You have used LiveKit, Pipecat or similar tech.
  • Working knowledge of agent patterns and eval-driven development.
  • Hands-on with ReAct and CoT; pragmatic with ToT/GoT tradeoffs.
  • Prior startup experience.

Nice To Have

  • DSPy for compilation and self-improving workflows; MiPRO/GEPA integration.
  • Experience with evaluation tooling and LLM-as-judge setups.
  • WebRTC/SRTP, jitter buffers, SIP basics; LiveKit a plus.
  • LiveKit Agents, SIP–WebRTC gateways, TURN/SFU tuning.
  • GCP: Cloud Run/GKE, Pub/Sub, Vertex AI, GCS, Secret Manager, Cloud Logging/Trace.
  • Healthcare data familiarity.

Example Problems You'll Tackle

  • Push median voice round-trip under 2 seconds while preserving turn-taking and barge-in.
  • Set up OTEL-first tracing for the agent graph with automated eval triggers on production traffic slices.
  • Improve our RAG pipeline with hybrid retrieval and re-ranking, then prove gains via faithfulness and context metrics with regression harnesses.
  • Turn EHR integrations into LLM tools.

Tech Stack
Python, FastAPI, pydantic, asyncio, Redis, Postgres, vector stores, WebRTC stacks, LiveKit, SIP gateways, STT/TTS, Docker, Terraform, K8s, OTEL, DeepEval.

What You Get

  • Work on cutting-edge real-time agent tech with a best-in-class team in healthtech.
  • Fun off-sites in Barcelona.
  • High-tech laptop and solid dev ergonomics.
  • Flexibility: work from home or hybrid in Barcelona/London.

  • AI Engineer

    2 weeks ago


    London, Greater London, United Kingdom Alfa AI Full time £60,000 - £100,000 per year

    Role OverviewWe are seeking a highly skilled and experienced AI Engineer for a critical 10-month contract. The successful candidate will be responsible for designing, developing, and deploying advanced artificial intelligence and machine learning solutions for a key government project. This role requires an individual with a strong background in AI/ML, a...

  • AI Engineer

    1 week ago


    London, Greater London, United Kingdom Finster AI Full time £80,000 - £120,000 per year

    About Finster AIWe're a Series A stage firm, redefining the future of finance with our AI-native research and task automation platform, backed by leading, global venture investors. Founded by a team of experts from Google DeepMind, Meta AI, and J.P. Morgan, Finster AI provides cutting-edge solutions to help finance professionals unlock unique insights with...

  • AI Engineer

    2 weeks ago


    London, Greater London, United Kingdom WeBuild-AI Full time £60,000 - £120,000 per year

    About WeBuild-AI:WeBuild-AI are AI natives delivering 10x value for enterprise organisations. We combine highly skilled experts with our AI Launchpad, industry-aligned language models, and agents to transform enterprise organisations into AI-powered and data-driven businesses. We work with enterprise organisations on a global stage, reinventing how they...

  • Lead AI Engineer

    4 days ago


    London, Greater London, United Kingdom WeBuild-AI Full time £80,000 - £120,000 per year

    About WeBuild-AI:WeBuild-AI are AI natives delivering 10x value for enterprise organisations. We combine highly skilled experts with our AI Launchpad, industry-aligned language models, and agents to transform enterprise organisations into AI-powered and data-driven businesses. We work with enterprise organisations on a global stage, reinventing how they...

  • AI Engineer, London

    5 days ago


    London, Greater London, United Kingdom Eloquent AI Full time £60,000 - £120,000 per year

    Meet Eloquent AIAt Eloquent AI, we're building the next generation of AI Operators—multimodal, autonomous systems that execute complex workflows across fragmented tools with human-level precision. Our technology goes far beyond chat: it sees, reads, clicks, types, and makes decisions—transforming how work gets done in regulated, high-stakes...


  • London, Greater London, United Kingdom WeBuild-AI Full time £120,000 - £180,000 per year

    About WeBuild-AI:WeBuild-AI are AI natives delivering 10x value for enterprise organisations. We combine highly skilled experts with our AI Launchpad, industry-aligned language models, and agents to transform enterprise organisations into AI-powered and data-driven businesses. We work with enterprise organisations on a global stage, reinventing how they...

  • Staff AI Engineer

    1 week ago


    London, Greater London, United Kingdom Bluefish AI Full time £80,000 - £120,000 per year

    About the PositionAs a Staff AI Engineer, you'll serve as a technical leader for our LLM-powered products at the forefront of marketing and advertising technologies. You'll own critical architectural decisions, set quality bars, and lead multi‑team initiatives that drive measurable outcomes.As our Staff AI Engineer, you will lead the vision and execution...

  • Senior AI Engineer

    1 week ago


    London, Greater London, United Kingdom Bluefish AI Full time £100,000 - £140,000 per year

    About the Position As a Senior AI Engineer, you'll spearhead the development of LLM-powered products at the forefront of marketing and advertising technologies. Utilizing your expertise in machine learning, large language models, and natural language processing, you'll play a pivotal role in designing, implementing, and enhancing our range of AI-driven...

  • AI Product Engineer

    17 hours ago


    London, Greater London, United Kingdom Luma AI Full time £60,000 - £120,000 per year

    The Opportunity Luma AI is building the next era of AI with Omni models that can see, hear, and understand the world. As a full-stack company, we control the entire vertical from the GPU cluster to the user interface. We operate with the resources to compete at the forefront while maintaining a lean structure that ensures you will be part of the core...


  • London, Greater London, United Kingdom Mistral AI Full time £80,000 - £120,000 per year

    About MistralAt Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.We democratize AI through high-performance, optimized, open-source and cutting-edge models, products, and solutions. Our comprehensive AI platform is designed...