
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A leading pharmaceutical company is seeking a Data Engineer II to further develop their Onyx Research Data Platform. This role requires expertise in data engineering and software development, focusing on creating automated data services. Candidates should have at least a Bachelor’s degree and 4+ years in data engineering, with skills in Python and cloud systems. The company offers competitive compensation and a supportive working environment.
Job description
Site Name: South San Francisco 611 Gateway Blvd, Cambridge 300 Technology Square, London The Stanley Building | Posted Date: Dec 4 2025
The Onyx Research Data Platform organization represents a major investment by GSK R&D and Digital & Tech, designed to deliver a step change in our ability to leveraged data, knowledge, and prediction to find new medicines. We are a full‑stack shop consisting of product and portfolio leadership, data engineering, infrastructure and DevOps, data/metadata/knowledge platforms, and AI/ML and analysis platforms, all geared toward:
Data Engineering is responsible for the design, delivery, support, and maintenance of industrialised automated end‑to‑end data services and pipelines. They apply standardised data models and mapping to ensure data is accessible for end users in end‑to‑end user tools through use of APIs. They define and embed best practices and ensure compliance with Quality Management practices and alignment to automated data governance. They also acquire and process internal and external, structured and unstructured data in line with Product requirements.
As a Data Engineer II, you are a technical contributor who can take a well‑defined specification for a function, pipeline, service, or other sort of component, devise a technical solution, and deliver it at a high level. You are aware of, and adhere to, best practice for software development in general (and data engineering in particular), including code quality, documentation, DevOps practices, and testing. You ensure robustness of our services and serve as an escalation point in the operation of existing services, pipelines, and workflows. You will work across structured, unstructured, and scientific data domains, applying modern engineering and automation best practices to deliver reliable, scalable, and governed data products. You will also contribute to emerging GenAI‑enabled data capabilities, such as embedding pipelines, vectorised data flows, and LLM‑ready data products.
You should be deeply familiar with the most common tools (languages, libraries, etc.) in the data space, such as Spark, Kafka, Storm, etc., and aware of the open‑source communities that revolve around these tools. You have a strong focus on operability of your tools and services, and develop, measure, and monitor key metrics for their work to seek opportunities to improve those metrics.
Why GSK?
Uniting science, technology and talent to get ahead of disease together.
• If you are based in Cambridge, MA; Waltham, MA; Rockville, MD; or San Francisco, CA, the annual base salary for new hires in this position ranges $116,325 to $193,875. The US salary ranges take into account a number of factors including work location within the US market, the candidate's skills, experience, education level and the market rate for the role. In addition, this position offers an annual bonus and eligibility to participate in our share‑based long‑term incentive program which is dependent on the level of the role. Available benefits include health care and other insurance benefits (for employee and family), retirement benefits, paid holidays, vacation, and paid caregiver/parental and medical leave. If salary ranges are not displayed in the job posting for a specific country, the relevant compensation will be discussed during the recruitment process.
Please visit GSK US Benefits Summary to learn more about the comprehensive benefits program GSK offers US employees.
GSK is an Equal Opportunity Employer. This ensures that all qualified applicants will receive equal consideration for employment without regard to race, color, religion, sex (including pregnancy, gender identity, and sexual orientation), parental status, national origin, age, disability, genetic information (including family medical history), military service or any basis prohibited under federal, state or local law.
If you require an accommodation or other assistance to apply for a job at GSK, please contact the GSK Service Centre at 1‑877‑694‑7547 (US Toll Free) or +1 801 567 5155 (outside US).
Important notice to Employment businesses/ Agencies
GSK does not accept referrals from employment businesses and/or employment agencies in respect of the vacancies posted on this site. All employment businesses/agencies are required to contact GSK’s commercial and general procurement/human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business/ agency and GSK. In the absence of such written authorization being obtained any actions undertaken by the employment business/agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses/agencies in respect of the vacancies posted on this site.
Please note that if you are a US Licensed Healthcare Professional or Healthcare Professional as defined by the laws of the state issuing your license, GSK may be required to capture and report expenses GSK incurs, on your behalf, in the event you are afforded an interview for employment. This capture of applicable transfers of value is necessary to ensure GSK’s compliance to all federal and state US Transparency requirements. For more information, please visit the Centers for Medicare and Medicaid Services (CMS) website at https://openpaymentsdata.cms.gov/