Senior HPC Engineer
7 days ago
Millennium's Infrastructure organization is dedicated to designing, engineering, supporting, and managing a robust server estate, systems virtualization, and core enterprise services. We are seeking a Senior HPC Engineer for a hands-on technical leadership position to support Worldquant's intiative of maintaining financial research leadership. This role is pivotal in designing, building, and maintaining our cutting-edge High-Performance Computing (HPC) and GPU clusters, which are essential for our AI and Machine Learning initiatives. The ideal candidate will have a strong background in HPC environments, with specific expertise in GPU-accelerated computing and advanced storage solutions. You will be responsible for ensuring the reliability, scalability, and performance of our computational infrastructure.
You will join a highly specialized team of exceptionally talented yet refreshingly humble individuals from diverse disciplines. We believe that delivering exceptional services requires the ability to make meaningful changes across the entire stack. Our mission is to solve real business challenges, reduce operational complexities, and foster a collaborative, team-driven environment that promotes mutual growth and success.
Key Responsibilities:
Design and Implementation: Lead the architectural design, implementation, and maintenance of large-scale HPC and GPU clusters.
Storage Management: In collaboration with the storage team, architect and manage high-performance storage solutions tailored for GPU-intensive workloads, ensuring low-latency data access and high throughput.
System Optimization: Monitor, analyze, and tune the performance of the HPC environment, including compute nodes, networking fabrics, and parallel file systems.
Automation: Develop and maintain automation scripts and tools for provisioning, configuration management, and monitoring of the HPC infrastructure.
Collaboration: Work closely with researchers, data scientists, and software engineers to understand their computational needs and provide a robust and efficient platform to accelerate their work.
Troubleshooting: Provide expert-level support for complex issues related to hardware, software, and networking within the HPC ecosystem.
Technology Evaluation: Stay current with emerging technologies and industry trends in HPC, GPU computing, and storage, and conduct evaluations to recommend new solutions.
Contribute to organizational knowledge through documentation, education, and writing maintainable code. Provide guidance to the team in your subject matter expertise.
Qualifications/Skills:
A Bachelor's degree in Computer Science, Engineering, or a related field.
A minimum of 7 years of progressive experience in designing, building, and managing complex HPC environments.
Proven experience with GPU-accelerated computing, including NVIDIA GPUs and associated software (e.g., CUDA).
Deep expertise in high-performance storage systems and parallel file systems (e.g., Lustre, GPFS/Spectrum Scale).
Strong proficiency in Linux/Unix operating systems, scripting languages and configuration management platforms
Experience with cluster management and scheduling software (e.g., Kubernetes, ), with a strong preference for Slurm
Familiarity with high-speed interconnects like InfiniBand or RoCE.
Understanding AI technologies and their applications in infrastructure automation and management. Experience with or a strong interest in implementing AI/ML solutions for infrastructure optimization, anomaly detection, or predictive analytics.
A passion for technology and automation, with a deep sense of curiosity and ownership.
A hands-on approach to problem-solving and a demonstrable enthusiasm for technology.
Excellent verbal and written communication skills.
Preferred Qualifications
Master's or Ph.D. in a relevant technical field.
Experience in a buy-side financial organization.
Experience with cloud-based HPC, preferably with GCP.
Knowledge of containerization technologies such as Docker and Singularity.
-
HPC Engineer
16 hours ago
London, Greater London, United Kingdom Linux Recruit Full time £45,000 - £52,500 per yearSpecialismLinux EngineeringJob typePermanentLocationLondonSalary£45,000 - £52,500 per annumJoin an internationally renowned institute as it establishes a new High Performance Computing function to support world leading research. Your experience across HPC, Storage and GPUs will allow you to contribute to this innovative team building out a hybrid setup to...
-
HPC Engineer
2 weeks ago
London, Greater London, United Kingdom RED Global Full time £60,000 - £90,000 per yearWe are seeking an experienced and highly motivatedHigh-Performance Computing (HPC) Engineerto join our team. The successful candidate will have a proven record of delivering robust HPC services and infrastructure, combined with the ability to work closely with the scientific and research community to optimise computational workflows.The role requires an...
-
Senior HPC Infrastructure Engineer
2 weeks ago
London, Greater London, United Kingdom Hays Full time £90,000 - £120,000 per yearYour new companyJoin a pioneering organisation at the forefront of AI and High Performance Computing (HPC) infrastructure. With a strong focus on innovation and ethical computing, this company is building scalable, GPU-optimised environments that support cutting-edge research and enterprise workloads.Your new roleThis is a fully remote, hands-on technical...
-
HPC Systems Engineer
2 weeks ago
London, Greater London, United Kingdom Nscale Full time £60,000 - £90,000 per yearJoin Nscale as a HPC Systems EngineerAre you passionate about Data Centre builds and large scale GPU infrastructure projects? Do you thrive in a fast-paced, high-growth environment where your work has a direct impact on business outcomes? If so, this could be the role for youNscale is the GPU cloud engineered for AI. We provide cost-effective,...
-
Welder (HPC)
6 days ago
London, Greater London, United Kingdom EDF Full time £40,000 - £55,000 per yearThe HPC Jobs Service supports local people into exciting, long-term careers across our Project.Welder (HPC)APPLY HERE: Job details - Welder (HPC) Severfield Nuclear & Infrastructure LimitedLocation: Bridgwater, Somerset, England, United KingdomContract Type: PermanentContract : Weekly hours: 38 Hours per WeekVacancy SummaryAt Severfield, we're creating...
-
HPC Operations Engineer
4 days ago
London, Greater London, United Kingdom Jump Trading Full time $150,000 - $175,000Jump Trading Group is committed to world class research. We empower exceptional talents in Mathematics, Physics, and Computer Science to seek scientific boundaries, push through them, and apply cutting edge research to global financial markets. Our culture is unique. Constant innovation requires fearlessness, creativity, intellectual honesty, and a...
-
Linux - HPC Engineer
2 weeks ago
London, Greater London, United Kingdom Cognizant Full time £65,000 - £90,000 per yearThis is an excellent opportunity for Senior Linux HPC Systems Administrator/Engineer professionals to be part of leading-edge technology projects. Cognizant's Cloud, Infrastructure & Security Services Practice provides end-to-end solutions covering architecture, design, implementation, management, and on-going support across the entire enterprise technology...
-
Linux - HPC Engineer
1 week ago
London, Greater London, United Kingdom Cognizant Technology Solutions Full time £60,000 - £120,000 per yearThis is an excellent opportunity for Senior Linux HPC Systems Administrator/Engineer professionals to be part of leading-edge technology projects. Cognizant's Cloud, Infrastructure & Security Services Practice provides end-to-end solutions covering architecture, design, implementation, management, and on-going support across the entire enterprise technology...
-
HPC Sales Solutions
6 days ago
London, Greater London, United Kingdom AMAX Full time £40,000 - £80,000 per yearWe are seeking a highly motivated HPC Sales Account Manager to expand AMAX's footprint in the HPC and AI market. This individual will be responsible for maintaining and nurturing relationships with our existing and new clients. They will be the main point of contact to ensure customer satisfaction, resolve issues, and identify new opportunities to upsell,...
-
HPC Production Engineer
5 days ago
London, Greater London, United Kingdom Jump Trading Full time $150,000 - $200,000Jump Trading Group is committed to world class research. We empower exceptional talents in Mathematics, Physics, and Computer Science to seek scientific boundaries, push through them, and apply cutting edge research to global financial markets. Our culture is unique. Constant innovation requires fearlessness, creativity, intellectual honesty, and a...