Senior/Principal ML Systems Architect
21 hours ago
We are seeking a highly experienced ML Systems Architect to design and implement a scalable, production-grade architecture for our machine learning solver. This role bridges research prototypes and commercial deployment, ensuring reliability, maintainability, and performance in a mixed technology stack.
Responsibilities- Architect the ML Solver Platform:
- Define modular architecture for data preprocessing, model execution, and post-processing.
- Establish clear API contracts between Python/TensorFlow and C# services.
- Productionize ML Workflows:
- Convert research code into robust, testable, and observable services.
- Implement CI/CD pipelines, automated testing, and reproducibility standards.
- Integration & Interoperability:
- Design REST/gRPC endpoints for cross-language communication.
- Ensure compatibility with C#/.NET services.
- Performance & Scalability:
- Optimize GPU/CPU utilization, batching strategies, and memory management.
- Plan for multi-model and multi-tenant scenarios.
- MLOps & Lifecycle Management:
- Implement model versioning, artifact registries, and deployment workflows.
- Set up monitoring, logging, and alerting for solver performance.
- Security & Compliance:
- Apply best practices for secrets management, dependency scanning, and secure artifact storage.
- ML Frameworks: Expert in TensorFlow (TF2/Keras), experience with ONNX Runtime for inference.
- Programming: Advanced Python for ML; strong understanding of packaging, type checking, and performance profiling.
- Architecture: Proven experience designing scalable ML systems for production.
- APIs: Proficiency in gRPC/Protobuf and REST for cross-language integration.
- MLOps: CI/CD pipelines, containerization (Docker/Kubernetes), model registries, reproducibility.
- Performance Optimization: GPU acceleration (CUDA/cuDNN), mixed precision, XLA, profiling.
- Observability: Metrics, tracing, structured logging, dashboards.
- Security: SBOM, image signing, role-based access, vulnerability scanning.
- Experience with ONNX Runtime Training, PyTorch, or hybrid ML architectures.
- Familiarity with distributed training strategies and multi-GPU setups.
- Knowledge of feature stores and data validation frameworks.
- Exposure to regulated environments and compliance frameworks.
- ML: TensorFlow, ONNX Runtime, tf2onnx.
- APIs: FastAPI, gRPC.
- DevOps: GitLab CI/GitHub Actions, Docker, Kubernetes.
- Monitoring: Prometheus, Grafana, OpenTelemetry.
- Security: HashiCorp Vault, Sigstore.
- Work on cutting-edge ML solutions integrated into commercial engineering software.
- Define architecture that scales across global deployments.
- Collaborate with a team of experts in ML, software engineering, and UI development.
To apply: Send your resume and a brief cover letter to
-
Principal Architect – GPU Shader
20 hours ago
Bristol, Bristol, United Kingdom AMD Full timeWHAT YOU DO AT AMD CHANGES EVERYTHINGAt AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...
-
Senior ML Runtime Engineer
2 weeks ago
Bristol, Bristol, United Kingdom Fractile Full time £48,000 - £120,000 per yearAt Fractile, we're taking a revolutionary approach to computing to run the world's largest language models 100x faster than existing systems. Our fast-growing team is working at the cutting edge of the latest AI developments in both hardware and software. Want to get involved?We are looking for Senior ML Runtime Engineers with experience of key ML software...
-
Principal Enterprise Architect
17 hours ago
Bristol, Bristol, United Kingdom Logiq Full timePrincipal Enterprise ArchitectLocation:Hybrid; with travel expected to client sites and Logiq's offices in Bristol, Chippenham or Exeter.Salary:Negotiable, plus car allowance, plus up to 10% performance bonus*, plus excellent benefits package.Logiq is a fast-growing Technology and Consultancy Company, providing cutting-edge solutions to high-risk clients...
-
ML Ops Engineer
24 hours ago
Bristol, Bristol, United Kingdom Thales Full timeLocation: Building 550 - Bristol Business Park, United KingdomIn fast changing markets, customers worldwide rely on Thales. Thales is a business where brilliant people from all over the world come together to share ideas and inspire each other. In aerospace, transportation, defence, security and space, our architects design innovative solutions that make our...
-
Data Architect
2 weeks ago
Bristol, Bristol, United Kingdom Indotronix Avani UK Full time £70,000 - £120,000 per yearRole:Cloud Data ArchitectLocation:Bristol, London / Hybrid (2 days a week on site, with travel to client sites as required)Role Type:Permanent / Full-TimeSalary:Depends on experienceRole Summary:As a Senior/Principal Consultant Cloud Data Architect, you will lead the design and implementation of secure, scalable and resilient cloud data platforms in highly...
-
Sr ML Complier Engineer
2 weeks ago
Bristol, Bristol, United Kingdom Fractile Full time £80,000 - £120,000 per yearAt Fractile, we're taking a revolutionary approach to computing to run the world's largest language models 100x faster than existing systems. Our fast-growing team is working at the cutting edge of the latest AI developments in both hardware and software. Want to get involved?We are looking for Senior ML Compiler Engineers with experience in machine learning...
-
Sr ML Complier Engineer
4 days ago
Bristol, Bristol, United Kingdom Fractile Full timeAt Fractile, we're taking a revolutionary approach to computing to run the world's largest language models 100x faster than existing systems. Our fast-growing team is working at the cutting edge of the latest AI developments in both hardware and software. Want to get involved?We are looking for Senior ML Compiler Engineers with experience of machine learning...
-
Software QA – ML Kernels
10 hours ago
Bristol, Bristol, United Kingdom Graphcore Full timeAbout Graphcore Graphcore is one of the world's leading innovators in Artificial Intelligence compute. It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry. As part of the SoftBank Group, Graphcore is a...
-
Senior Systems Engineer
2 weeks ago
Bristol, Bristol, United Kingdom iO Associates Full time £60,000 - £100,000 per year**Job Title: Senior Systems EngineerLocation: BristolJob Type: Permanent**Organisation OverviewOur Client is a dynamic and innovative systems engineering consultancy dedicated to delivering cutting-edge solutions across defence and civil sectors. Renowned for its collaborative culture, commitment to excellence, and emphasis on personal development, the...
-
Senior Hardware Systems Engineer
5 days ago
Bristol, Bristol, United Kingdom Thales Full timeLocation: Cheadle, United KingdomThales people provide armed forces customers with operational advantage at every decisive moment throughout the mission. Defence and armed forces customers rely on us to deliver the full range of defence mission systems solutions at land, sea, and air. Our platforms extend across the battlespace including Above and Sonar,...