Senior Data Engineer
6 days ago
Posted Date: Dec 4 2025
The Onyx Research Data Platform organization represents a major investment by GSK R&D and Digital & Tech, designed to deliver a step change in our ability to leverage data, knowledge, and prediction to find new medicines. We are a full-stack shop consisting of product and portfolio leadership, data engineering, infrastructure and DevOps, data / metadata / knowledge platforms, and AI/ML and analysis platforms, all geared toward:
· Building an unified, automated, next-generation data experience for GSK's scientists, engineers, and decision-makers, increasing productivity, and reducing data friction
· Providing best-in-class AI/ML, GenAI and data analysis environments to accelerate our predictive capabilities and attract top-tier talent
· Aggressively engineering our data at scale to unlock the value of our combined data assets and predictions in real-time
Data Engineering is responsible for the design, delivery, support, and maintenance of industrialized automated end to end data services and pipelines. They apply standardized data models and mapping to ensure data is accessible for end users in end-to-end user tools through use of APIs. They define and embed best practices and ensure compliance with Quality Management practices and alignment to automated data governance. They also acquire and process internal and external, structure and unstructured data in line with Product requirements.
As a Senior Data Engineer, you are a leading technical contributor who turns ambiguous scientific or technical challenges into well-specified data solutions. You bring deep expertise in distributed systems, data processing, cloud platforms, and modern software engineering. You champion best practices, lead technical design, mentor engineers and drive high-impact work across the data ecosystem. You ensure robustness of our services and serve as an escalation point in the operation of existing services, pipelines, and workflows. You should be deeply familiar with the tools of modern data engineering (e.g. Spark, Kafka, Storm, …) and of our customers and engaged with the open-source community surrounding them – potentially, even to the level of contributing pull requests.
You operate with a strong engineering mindset, prioritizing automation, reliability, metrics and well-instrumented pipelines. You also support emerging capabilities such as GenAI powered data services, LLM-enabled agents, vectorized feature pipelines and RAG workflows.
Key responsibilities include:Designs, builds, and operates data tools, services, workflows, etc that deliver high value through the solution to key business problems by leveraging modern data engineering tools (e.g. Spark, Kafka, Storm, …) and orchestration tools (e.g. Google Workflow, AirFlow Composer)
Confidently optimizes design and execution of complex solutions in data ingestion and data transformation
Enables data products optimized for AI/ML and GenAI workloads—high throughput, observable, feature-ready and governed
Produces well-engineered software, including appropriate automated test suites, technical documentation, and operational strategy
Implements modular, reusable components and microservices that accelerate development and reduce operational overhead
Provides input into the roadmaps of upstream teams (e.g. Data Platforms, DataOps, DevOps) to help improve the overall program of work
Ensure consistent application of platform abstractions to ensure quality and consistency with respect to logging and lineage
Fully versed in coding best practices and ways of working, and participates in code reviews and partnering to improve the team's standards
Adhere to QMS framework and CI/CD best practices and helps to guide improvements to them that improve ways of working
Provides technical leadership, code reviews, architectural guidance, and mentorship to junior engineers and serves as an escalation point for complex operational issues across pipeline and data services.
We are looking for professionals with these required skills to achieve our goals:
PhD + 2 years, Masters + 4 years or a Bachelors degree with 6+ years of Data engineering experience in industry
Software engineering experience
Experience overcoming high volume, high compute challenges
Familiarity with orchestrating tooling
Cloud experience
Experience in automated testing and design
Experience with DevOps-forward ways of working
If you have the following characteristics, it would be a plus:
Deep knowledge and use of at least one common programming language: e.g., Python, Scala, Java, including toolchains for documentation, testing, and operations / observability
Deep expertise in modern software development tools / ways of working (e.g. git/GitHub, devops tools, metrics / monitoring, …)
Cloud experience (e.g., AWS, Google Cloud, Azure, Kubernetes), including infrastructure-as-code
Application experience of CI/CD implementations using git and a common CI/CD stack (e.g. Jenkins, CircleCI, GitLab, Azure DevOps)
Demonstrated excellence with agile software development environments using tools like Jira and Confluence
Deep familiarity with the tools, techniques, etc of modern data engineering (e.g. Spark, Kafka, Storm, …) and orchestration (e.g. Google Workflow, AirFlow Composer), including engagement with the open source community (and potentially making contributions to such tools)
Strong experience in data modelling, database concepts and SQL
Prior experience building GenAI-related pipelines (embeddings, RAG, LLM data prep, scalable inference data flows)
#GSK-LI
• If you are based in Cambridge, MA; Waltham, MA; Rockville, MD; or San Francisco, CA, the annual base salary for new hires in this position ranges $136,950 to $228,250.
The US salary ranges take into account a number of factors including work location within the US market, the candidate's skills, experience, education level and the market rate for the role. In addition, this position offers an annual bonus and eligibility to participate in our share based long term incentive program which is dependent on the level of the role. Available benefits include health care and other insurance benefits (for employee and family), retirement benefits, paid holidays, vacation, and paid caregiver/parental and medical leave.
If salary ranges are not displayed in the job posting for a specific country, the relevant compensation will be discussed during the recruitment process.
Please visit GSK US Benefits Summary to learn more about the comprehensive benefits program GSK offers US employees.
Why GSK?
Uniting science, technology and talent to get ahead of disease together.
GSK is a global biopharma company with a purpose to unite science, technology and talent to get ahead of disease together. We aim to positively impact the health of 2.5 billion people by the end of the decade, as a successful, growing company where people can thrive. We get ahead of disease by preventing and treating it with innovation in specialty medicines and vaccines. We focus on four therapeutic areas: respiratory, immunology and inflammation; oncology; HIV; and infectious diseases – to impact health at scale.
People and patients around the world count on the medicines and vaccines we make, so we're committed to creating an environment where our people can thrive and focus on what matters most. Our culture of being ambitious for patients, accountable for impact and doing the right thing is the foundation for how, together, we deliver for patients, shareholders and our people.
If you require an accommodation or other assistance to apply for a job at GSK, please contact the GSK Service Centre at US Toll Free) or outside US).
GSK is an Equal Opportunity Employer. This ensures that all qualified applicants will receive equal consideration for employment without regard to race, color, religion, sex (including pregnancy, gender identity, and sexual orientation), parental status, national origin, age, disability, genetic information (including family medical history), military service or any basis prohibited under federal, state or local law.
Important notice to Employment businesses/ Agencies
GSK does not accept referrals from employment businesses and/or employment agencies in respect of the vacancies posted on this site. All employment businesses/agencies are required to contact GSK's commercial and general procurement/human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business/ agency and GSK. In the absence of such written authorization being obtained any actions undertaken by the employment business/agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses/agencies in respect of the vacancies posted on this site.
Please note that if you are a US Licensed Healthcare Professional or Healthcare Professional as defined by the laws of the state issuing your license, GSK may be required to capture and report expenses GSK incurs, on your behalf, in the event you are afforded an interview for employment. This capture of applicable transfers of value is necessary to ensure GSK's compliance to all federal and state US Transparency requirements. For more information, please visit the Centers for Medicare and Medicaid Services (CMS) website
-
Senior Data Engineer
1 day ago
Cambridge, Cambridgeshire, United Kingdom SoCode Recruitment Full time £60,000 - £100,000 per yearDo you you enjoy working closely with a tight-knit team?Do you want to work in a business where making a difference is at the heart of their goals?I'm supporting a rapidly scaling medical technology innovator in their search for a Senior Data Engineer to help design and build a next-generation unified lakehouse platform on Databricks. This is a fantastic...
-
Senior Data Engineer
2 weeks ago
Cambridge, Cambridgeshire, United Kingdom KDR Talent Solutions Full time £70,000 - £90,000 per yearRole:Senior Data Engineer (Databricks / AWS / Lakehouse)Location:Cambridge, UK (Flexible Hybrid Working)Salary:£70,000 - £90,000 basic + Comprehensive Benefits PackageAre you a Data Engineer who wants to build systems thattrulymatter? Are you an expert in Databricks, looking for a challenge beyond just operating an existing platform?I'm hiring for a...
-
Senior Staff Data Engineer
1 week ago
Cambridge, Cambridgeshire, United Kingdom Visa Full time £60,000 - £100,000 per yearCompany Description Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure...
-
Senior Staff Data Engineer
7 days ago
Cambridge, Cambridgeshire, United Kingdom Visa Full time £80,000 - £120,000 per yearVisa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network,...
-
Stairlift Engineer
7 days ago
Cambridge, Cambridgeshire, United Kingdom Senior Stairlifts Full time £31,000 - £60,000 per yearJob Title: Stairlift Engineer Location: Cambridge (Various locations available)MINIMUM 1 year Experience in stairlifts or Mobility products requiredJoin Us in Making a DifferenceSenior Stairlifts is on a mission to become the UK's leading independent stairlift provider, and we need passionate, skilled engineers like you to help us reach that goal If...
-
Senior Data Engineer
2 weeks ago
Cambridge, Cambridgeshire, United Kingdom Roku Full time £60,000 - £110,000 per yearTeamwork makes the stream work.Roku is changing how the world watches TVRoku is the #1 TV streaming platform in the U.S., Canada, and Mexico, and we've set our sights on powering every television in the world. Roku pioneered streaming to the TV. Our mission is to be the TV streaming platform that connects the entire TV ecosystem. We connect consumers to the...
-
Data Engineer
2 weeks ago
Cambridge, Cambridgeshire, United Kingdom Mackenzie Jones Full time £40,000 - £80,000 per yearData Engineer. Permanent. T6/MN/ Hybrid - 2 Days Onsite Weekly - Cambridgeshire.Must be Eligible to work in the UK.International Manufacturing organisation is seeking to secure a Data Engineer. Member of a small Data Engineering Team which is part of a much larger IT function.Role:Data Movement & Transformation processes between...
-
Data Engineer
1 week ago
Cambridge, Cambridgeshire, United Kingdom Axiom Software Solutions Limited Full time £60,000 - £100,000 per yearPosition: Data EngineerLocation: Cambridge / Luton, UK (Hybrid 2-3 days onsite in a week)Duration: Long Term B2B ContractJob Description:The ideal candidate with a minimum of 5 +years of experience having strong experience working with Snowflake, DBT, Python, and AWS to deliver ETL/ELT Pipelines using different resources. • Proficiency in Snowflake data...
-
Legal and Data Engineer
2 weeks ago
Cambridge, Cambridgeshire, United Kingdom Simmons & Simmons Full time £60,000 - £120,000 per yearThe role: We are looking for a Legal & Data Engineer to join our growing team. The role of the Legal & Data Engineer is a commercially-focused, client-facing position supporting and developing services for our clients. The Legal & Data Engineer will work closely with our Senior Legal & Data Engineer to identify, scope, price and implement services that bring...
-
Data Engineer
6 days ago
Cambridge, Cambridgeshire, United Kingdom Cyted Health Full time £45,000 - £65,000 per yearJob Summary As a Data Engineer at Cyted, you'll build the data infrastructure that powers our diagnostics and research. You'll transform experimental workflows into reliable, production-grade data pipelines, implementing reproducible ingestion and analysis processes (primarily using Nextflow) and developing automation and orchestration for both operational...