Data Engineer – Spark Specialist
4 days ago
Data Engineer – Spark Specialist Join to apply for the Data Engineer – Spark Specialist role at Dataiku Get AI-powered advice on this job and more exclusive features. Dataiku is The Universal AI Platform™, giving organizations control over their AI talent, processes, and technologies to unleash the creation of analytics, models, and agents. Providing no-, low-, and full-code capabilities, Dataiku meets teams where they are today, allowing them to begin building with AI using their existing skills and knowledge. About The Role Dataiku is looking for a Data Engineer specialized in Spark (PySpark) to join our Field Engineering team. In this role, you will work closely with our clients to troubleshoot and optimize complex data pipelines within the Dataiku platform. This includes both reactive support (advanced issues reported via the support portal) and proactive services (performance reviews and architecture advisory missions we propose to clients). You will serve as a technical expert in data processing, leveraging SQL and Python frameworks. You will specialize in Spark-based distributed data processing and lakehouse architecture. You will help our clients succeed, whether working with SQL-based workflows, processing data on Kubernetes, Databricks, or other modern data platforms. What You'll Do Help customers design, build, and optimize Flows in Dataiku, improving overall project performance and maintainability Debug and enhance complex Spark code and data pipelines for better performance and reliability. Guide clients in tuning and scaling Spark environments, such as Kubernetes and Databricks, including providing architectural guidance and best practices to enhance performance and reliability. Optimize SQL-based data pipelines to ensure efficient and robust data workflows within Dataiku. Advise clients on integrating different data pipelines (Spark, SQL, Python) into optimized solutions Collaborate with internal teams to resolve technical issues and contribute to the knowledge base. Who You Are You have deep hands-on experience building, debugging, and tuning Spark pipelines in production environments. Specifically, you have: Spark & PySpark Expertise Proficiency in writing and debugging PySpark code for large-scale data processing. Experience with Parquet, Delta Lake, and columnar file formats. Understanding of Spark’s interaction with metastores (e.g., Hive, Unity Catalog). Deep understanding of resource management: Spark executors, cores, memory, and relevant configurations (e.g., spark.executor.memory, spark.sql.shuffle.partitions). Expertise in tuning Spark jobs: partitioning, caching, broadcast joins, and avoiding unnecessary shuffles. Lakehouse & Orchestration Familiarity with lakehouse architectures and ACID-compliant data layers (Delta Lake, Iceberg, Hudi). Experience working with Databricks, including Databricks Connect and Databricks Workflows. Experience automating and scheduling Spark jobs using tools like Apache Airflow or native orchestration tools. Core Data Engineering Skills Proven experience developing, optimizing, and troubleshooting SQL-based data pipelines for efficient ETL and data transformation processes. Proficiency in building and managing data transformation workflows in Python, leveraging frameworks such as pandas. Familiarity with data modeling concepts and data quality best practices. Experience integrating data from a variety of sources, including databases, APIs, and cloud storages. Ability to communicate technical concepts effectively to both technical and non-technical stakeholders. What does the hiring process look like? Initial call with a member of our Technical Recruiting team Video call with the Field Engineer Hiring Manager Technical Assessment to show your skills (Home Test) Debrief of your Tech Assessment with FE Team members Final Interview with the VP Field Engineering What are you waiting for At Dataiku, you'll be part of a journey to shape the ever-evolving world of AI. We're not just building a product; we're crafting the future of AI. If you're ready to make a significant impact in a company that values innovation, collaboration, and your personal growth, we can't wait to welcome you to Dataiku And if you’d like to learn even more about working here, you can visit our Dataiku LinkedIn page. Our practices are rooted in the idea that everyone should be treated with dignity, decency and fairness. Dataiku also believes that a diverse identity is a source of strength and allows us to optimize across the many dimensions that are needed for our success. Therefore, we are proud to be an equal opportunity employer. All employment practices are based on business needs, without regard to race, ethnicity, gender identity or expression, sexual orientation, religion, age, neurodiversity, disability status, citizenship, veteran status or any other aspect which makes an individual unique or protected by laws and regulations in the locations where we operate. This applies to all policies and procedures related to recruitment and hiring, compensation, benefits, performance, promotion and termination and all other conditions and terms of employment. If you need assistance or an accommodation, please contact us at: reasonable-accommodations@dataiku.com Protect yourself from fraudulent recruitment activity Dataiku will never ask you for payment of any type during the interview or hiring process. Other than our video-conference application, Zoom, we will also never ask you to make purchases or download third-party applications during the process. If you experience something out of the ordinary or suspect fraudulent activity, please review our page on identifying and reporting fraudulent activity here. Seniority level Mid-Senior level Employment type Full-time Job function & Industries Information Technology Software Development Referrals increase your chances of interviewing at Dataiku by 2x Get notified about new Data Engineer jobs in Ledbury, England, United Kingdom. #J-18808-Ljbffr
-
Spark Data Engineer — Scale Pipelines
4 days ago
Ledbury, United Kingdom Dataiku Full timeA leading AI platform company is seeking a Data Engineer – Spark Specialist to work closely with clients in optimizing complex data pipelines using Spark and SQL. The ideal candidate will have deep experience in building and debugging Spark pipelines, proficiency in PySpark, and expertise in data processing within environments like Databricks. This role...
-
Senior Data Engineer
2 weeks ago
Ledbury, United Kingdom Methods Business and Digital Technology Full timeSenior Data Engineer On-site Full time Methods Business and Digital Technology Limited Methods is a £100M+ IT Services Consultancy who has partnered with a range of central government departments and agencies to transform the way the public sector operates in the UK. Established over 30 years ago and UK-based, we apply our skills in transformation,...
-
Senior Data Engineer
1 week ago
Ledbury, United Kingdom Methods Full timeMethods is a £100M+ IT Services Consultancy who has partnered with a range of central government departments and agencies to transform the way the public sector operates in the UK. Established over 30 years ago and UK-based, we apply our skills in transformation, delivery, and collaboration from across the Methods Group, to create end-to-end business and...
-
Hybrid Cloud Data Engineer
2 weeks ago
Ledbury, United Kingdom Methods Business and Digital Technology Full timeA leading IT Services Consultancy in the UK is looking for a Senior Data Engineer to design and manage sophisticated data infrastructure systems across on-premises and Azure environments. The role requires active Security Clearance and expertise in Python, ETL/ELT workflows, Docker, Kubernetes, and Azure Data Factory. Applicants should have a minimum of 5...
-
Data Scientist
5 days ago
Ledbury, United Kingdom Anson McCade Full timeSenior Data Scientist – Defence & National Security £60,000 – £100,000 + Benefits Hybrid: 2–3 Days/Week in Hereford, UK Eligibility: SC Cleared or Eligible (British Citizenship Required) Overview A leading AI consultancy, recognized for its work across national security and defence, is seeking a Senior Data Scientist to lead high‑impact projects in...
-
Maintenance Engineer
6 days ago
Ledbury, United Kingdom Omega Full timeJob Title: Maintenance Engineer Location: Ledbury, Gloucestershire Pay Range/details: Competitive Salary + Benefits Contract Type: Permanent Our client, is recruiting a Multi Skilled Maintenance Engineer to join its experienced engineering team. Key Responsibilities - Maintenance Engineer Perform scheduled and unscheduled maintenance on all production...
-
Maintenance Engineer
7 days ago
Ledbury, United Kingdom Omega Full timeJob Title: Maintenance EngineerLocation: Ledbury, GloucestershirePay Range/details: Competitive Salary + BenefitsContract Type: PermanentOur client, is recruiting a Multi Skilled Maintenance Engineer to join its experienced engineering team.Key Responsibilities - Maintenance EngineerPerform scheduled and unscheduled maintenance on all production equipment to...
-
Maintenance Engineer
6 days ago
Ledbury, United Kingdom Omega Full timeJob Title: Maintenance EngineerLocation: Ledbury, GloucestershirePay Range/details: Competitive Salary + BenefitsContract Type: PermanentOur client, is recruiting a Multi Skilled Maintenance Engineer to join its experienced engineering team.Key Responsibilities - Maintenance EngineerPerform scheduled and unscheduled maintenance on all production equipment to...
-
Senior Backend Engineer
1 week ago
Ledbury, United Kingdom Methods Business and Digital Technology Full time £80,000 - £120,000 per yearSenior Backend Engineer (Contractor)On-siteFull timeMethods Business and Digital Technology LimitedMethods is a £100M+ IT Services Consultancy who has partnered with a range of central government departments and agencies to transform the way the public sector operates in the UK. Established over 30 years ago and UK-based, we apply our skills in...
-
Cyber Security Engineer SoC/SIEM
7 days ago
Ledbury, United Kingdom Methods Business and Digital Technology Full timeOverviewMethods Business and Digital Technology LimitedMethods is a £100M+ IT Services Consultancy who has partnered with a range of central government departments and agencies to transform the way the public sector operates in the UK. Established over 30 years ago and UK-based, we apply our skills in transformation, delivery, and collaboration from across...