CUDA Engineer

2 weeks ago


London, United Kingdom KX Full time

About KX KX software powers the time-aware data-driven decisions that enable fast-moving companies to outpace competitors, realizing the full potential of their AI investments. The KX platform delivers transformational value by addressing data challenges related to completeness, timeliness and efficiency, ensuring companies understand change over time and can achieve faster, more accurate insights at any scale, cost-effectively. KX is essential to the operations of the world's top investment banks, aerospace and defence, high-tech manufacturing, healthcare and life sciences, automotive and fleet telematics organizations. The company has established offices and a robust customer base across North America, Europe, and Asia Pacific. Overview Of The Role KX is hiring a Senior CUDA Developer to design, optimise, and deliver high-throughput GPU compute components within the KX platform. This role is primarily focused on CUDA development, but we welcome candidates with experience in any GPU ecosystem, including OpenCL, HIP, or SYCL. You will help drive high-performance computing initiatives across our data and analytics platform. Key Responsibilities • Design and implement high-throughput GPU algorithms using CUDA. • Optimise GPU kernels for memory efficiency, occupancy, and large-scale throughput. • Contribute to GPU compute modules written primarily in C, with some C++ where applicable. • Use profiling tools such as NVIDIA Nsight Systems, Nsight Compute, and CUDA profiling tools to identify and resolve bottlenecks. • Integrate GPU workloads into high-performance data pipelines and HPC environments. • Collaborate with cross-functional engineering teams to enhance GPU acceleration capabilities across the platform. • Mentor junior engineers and contribute to internal GPU development standards. • Participate in architectural planning and long-term GPU development strategy. Skills • Strong hands-on experience with CUDA and GPU kernel development. • Programming experience in C (C++ helpful but not required). • Understanding of GPU architecture, including SMs, memory hierarchy, and warp execution. • Experience with any GPU ecosystem such as OpenCL, HIP, or SYCL. • Knowledge of high-throughput computation, HPC workloads, and parallel algorithms. • Experience with profiling, debugging, and performance optimisation. Essential Experience • Proven experience developing CUDA-based GPU applications. • Hands-on experience working with high-throughput compute or HPC systems. • Experience optimising GPU kernels using profiling tools. • Experience integrating GPU components into production systems. Preferred Experience • Experience with OpenCL, HIP, or SYCL. • Experience mentoring junior engineers. • Familiarity with distributed computers or large-scale HPC environments. Location & Workplace Type This role can be based out of our Dublin, Newry, Belfast or London Office and follows a Hybrid model. Why Choose KX • Data Driven: We lead with instinct and follow fact. • Naturally Curious: We lean in, listen and learn fast. • All In: We take ownership, take on challenges and give it our all. Benefits • Competitive Salary • Individually tailored training and skills development • Private healthcare package and Employee Assistance Programme • Enhanced maternity and paternity package • Wellness Days and Volunteer Days


  • CUDA Engineer

    1 week ago


    London, United Kingdom KX Full time

    About KXKX software powers the time-aware data-driven decisions that enable fast-moving companies to outpace competitors, realizing the full potential of their AI investments. The KX platform delivers transformational value by addressing data challenges related to completeness, timeliness and efficiency, ensuring companies understand change over time and can...

  • CUDA Engineer

    1 week ago


    London, United Kingdom KX Full time

    Job DescriptionAbout KXKX software powers the time-aware data-driven decisions that enable fast-moving companies to outpace competitors, realizing the full potential of their AI investments. The KX platform delivers transformational value by addressing data challenges related to completeness, timeliness and efficiency, ensuring companies understand change...

  • CUDA Engineer

    2 weeks ago


    London, United Kingdom KX Full time

    About KXKX software powers the time-aware data-driven decisions that enable fast-moving companies to outpace competitors, realizing the full potential of their AI investments. The KX platform delivers transformational value by addressing data challenges related to completeness, timeliness and efficiency, ensuring companies understand change over time and can...

  • CUDA Engineer

    2 weeks ago


    London Area, United Kingdom KX Full time

    About KXKX software powers the time-aware data-driven decisions that enable fast-moving companies to outpace competitors, realizing the full potential of their AI investments. The KX platform delivers transformational value by addressing data challenges related to completeness, timeliness and efficiency, ensuring companies understand change over time and can...

  • CUDA Engineer

    1 week ago


    Greater London, United Kingdom FD Technologies Full time

    About KXKX software powers the time-aware data-driven decisions that enable fast-moving companies to outpace competitors, realizing the full potential of their AI investments. The KX platform delivers transformational value by addressing data challenges related to completeness, timeliness and efficiency, ensuring companies understand change over time and can...

  • CUDA Kernel Optimizer

    2 weeks ago


    Greater London, United Kingdom Mercor Full time

    CUDA Kernel Optimizer - ML Engineer at Mercor Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts...


  • London, United Kingdom Mercor Full time

    1) Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization performance profiling and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize...


  • Greater London, United Kingdom Mercor Full time

    A leading technology consultancy is seeking a CUDA Kernel Optimizer - ML Engineer to develop and benchmark CUDA kernels, focusing on performance optimization and profiling. This role is ideal for independent contractors who excel in systems-level work with a compensation range of $120-$250/hour based on deliverables. Candidates should have deep expertise in...


  • London Area, United Kingdom Tiro Partners Limited Full time

    ML / Machine Learning / C++ / Python / CUDAMachine Learning Engineer – AI & Foundation Models (Future CTO Track)West London | 3–4 days onsiteUp to £150k + equityCompany: AI StartUpWe're hiring one of the first technical engineers for a pioneering AI startup building a foundation model that fully automates development.This is a founding role with...

  • AI Research Engineer

    6 hours ago


    London, United Kingdom Harnham Full time

    Do you want to build frontier-level LLM models from scratch?Have you worked on large-scale GPU training, Triton/CUDA, or MoE systems?Are you ready to join one of Europe's most technical deep-learning teams?A Europe-based deep learning company is building the next generation of foundation models. Think of a smaller, faster, highly technical version of the...