Neural Network Optimization Engineer

2 weeks ago


London, Greater London, United Kingdom Recraft Full time £60,000 - £1,050,000 per year
About Us

Founded in the US in 2022 and now based in London, UK, Recraft is an AI tool for professional designers, illustrators, and marketers, setting a new standard for excellence in image generation.

We designed a tool that lets creators quickly generate and iterate original images, vector art, illustrations, icons, and 3D graphics with AI. Over 3 million users across 200 countries have produced hundreds of millions of images using Recraft, and we're just getting started.

Join a universe of professional opportunities, develop and support large-scale projects, and shape the future of creativity. We are committed to making Recraft an essential, daily tool for every designer and setting the industry standard. Our mission is to ensure that creators can fully control their creative process with AI, providing them with innovative tools to turn ideas into reality.

If you're passionate about pushing the boundaries of AI, we want you on board

Job Description

We are seeking an experienced Neural Network Optimization Engineer who will specialize in enhancing the performance, latency, and throughput of neural network inference workflows. The ideal candidate will have substantial hands-on experience optimizing inference workloads using technologies such as TensorRT, Triton language, and model quantization techniques. You will collaborate closely with ML researchers to ensure that our machine learning models run at peak efficiency and reliability in production environments.

Key Responsibilities
  • Optimize neural network models for inference performance and latency reduction

  • Implement model quantization methods (e.g., INT8, FP8) to maximize computational efficiency.

  • Benchmark, analyze, and improve inference performance on targeted hardware platforms.

  • Collaborate with the ML researchers to deploy optimized models in production environments.

  • Stay updated with the latest developments in model optimization, inference engines, quantization methods, and related technologies.

Requirements
  • Proven professional experience optimizing neural network inference workloads.

  • Strong expertise with TensorRT, Triton language, CUDA programming.

  • Experience with neural network quantization techniques.

  • Proficiency in Python and PyTorch.

  • Deep understanding of GPU architectures and performance optimization.

  • Excellent problem-solving skills and ability to analyze performance bottlenecks.

What We Offer
  • Competitive salary.

  • We're able to offer Skilled Worker visa sponsorship in the UK for qualified candidates.

  • Opportunities for professional growth and development.

  • A collaborative and user-focused work environment.

  • The chance to shape the future of AI-powered creativity through research.

  • Exciting projects where your insights will directly impact product development.



  • London, Greater London, United Kingdom Apple Full time £40,000 - £80,000 per year

    Application Deadline: Friday 24th October 2025 Shape the future of real-time 3D animation Join our Animation Research team in London and explore brand new neural network approaches for character animation. We're seeking passionate and driven students to redefine how animation is developed and experienced across Apple platforms. As an intern, you'll explore...


  • London, Greater London, United Kingdom Nokia Global Full time £60,000 - £120,000 per year

    DescriptionAs a Network Optimization Engineer, you'll play a key role in designing and improving medium-to-high complexity network solutions. You'll develop HLD/LLD designs, plan capacity, analyze network interfaces, and ensure performance through ongoing optimization. This role gives you the opportunity to work with advanced planning tools, design...


  • London, Greater London, United Kingdom Nokia Global Full time £60,000 - £140,000 per year

    DescriptionAs a Network Optimization Engineer at Nokia, you'll enhance mature wireless networks across Europe alongside a dynamic team of experts. Your focus will be on end-to-end optimization, including capacity, architecture, and performance analysis. Collaborating closely, you'll conduct root cause analyses and implement best practices post-launch to meet...


  • London, Greater London, United Kingdom Nokia Global Full time £40,000 - £80,000 per year

    DescriptionYour focus will be on end-to-end optimization, including capacity, architecture, and performance analysis. Collaborating closely, you'll conduct root cause analyses and implement best practices post-launch to meet evolving customer needs. You'll directly engage with customers, offering consultative insights on 4G and 5G technologies, using...


  • London, Greater London, United Kingdom PulsePoint Full time £60,000 - £120,000 per year

    Description Function: Engineering, R&D → Data Science / Machine Learning / Operations Research About PulsePoint: PulsePoint is a fast-growing healthcare technology company (with adtech roots) using real-time data to transform healthcare. We help brands and agencies interpret the hard-to-read signals across the health journey and unify these digital...


  • London, Greater London, United Kingdom 7f4271bd-5244-445b-9c25-3101c781d616 Full time £60,000 - £120,000 per year

    Company DescriptionWe suggest you enter details here.Role DescriptionThis is a full-time Hybrid role based in London for a Full Stack/AI Engineer. The role involves designing, developing, and maintaining sophisticated AI-driven applications and full stack solutions. Day-to-day tasks include building and optimizing machine learning models, implementing...


  • London, Greater London, United Kingdom Anza Full time $150,000 - $275,000 per year

    Software Engineer, Networking - Anza Who We AreAnza is a Solana R&D lab pushing the boundaries of blockchain performance and scalability. Anza was founded by experienced executives and core engineers solving the toughest problems in Web3. Crypto ecosystems rely on robust protocols, and we believe those are best built out in the open, with multiple...


  • London, Greater London, United Kingdom Anza Full time £32,000 - £80,000 per year

    Software Engineer, Networking - Anza Who We AreAt Anza, we're at the forefront of blockchain technology, developing the Agave client to enhance the Solana ecosystem — a blockchain designed for rapid growth without compromising security or scalability. We pioneer advanced solutions to meet the evolving demands of decentralized applications.The RoleAs a...


  • London, Greater London, United Kingdom Aalyria Full time £60,000 - £120,000 per year

    Role Overview:We are seeking a highly skilled engineer with strong expertise in wireless communications theory, with a strong mathematical background in linear algebra, matrix theory, signal processing and relevant industry experience. This engineer will work with a team of highly skilled software engineers, systems engineers and system architects on the...


  • London, Greater London, United Kingdom Aalyria Full time £80,000 - £120,000 per year

    About AalyriaAalyria is a leading technology company that supplies laser communications technology and temporospatial software-defined networking platforms to the aerospace industry. With technology acquired from Google, Aalyria is at the forefront of innovation in satellite and airborne mesh networks, as well as cislunar and deep-space communications. We...