
Enable job alerts via email!
A healthcare technology company in Cambridge seeks a Data Engineer to build and maintain data infrastructure for diagnostics and research. This full-time role involves developing reliable data pipelines, ensuring data governance, and collaborating closely with computational biologists and software engineers. Ideal candidates will have 2-3 years of relevant experience, a degree in a related field, and proficiency in Python and cloud services. The position offers a salary between £45,000 and £65,000, along with comprehensive benefits.
As a Data Engineer at Cyted, you'll build the data infrastructure that powers our diagnostics and research. You'll transform experimental workflows into reliable, production-grade data pipelines, implementing reproducible ingestion and analysis processes (primarily using Nextflow) and developing automation and orchestration for both operational and research workloads.
You'll establish strong data governance and observability practices, ensuring datasets are versioned, catalogued, and fully traceable from source to output. Security and compliance will be embedded in everything you design, meeting the standards required for regulated healthcare and diagnostics environments.
You'll work closely with computational biologists in R&D and software engineers in the Technology team to translate scientific and product requirements into scalable, maintainable solutions. Alongside delivery, you'll maintain clear technical documentation, contribute to code reviews, and help raise engineering standards across the team.
The role is a full-time position with a standard 37.5 hour working week. The role holder may be required to work flexibly.
The Data Engineer will be based at Cyted\'s Head Office, Ground Floor Building 3 Old Swiss, 149 Cherry Hinton Road, Cambridge, United Kingdom, CB1 7BX.
At Cyted, how we work is just as important as what\'re building. Our values shape how we collaborate, innovate, and deliver for patients and partners. As our Data Engineer, you\'ll bring these values to life from day one.
We care deeply about data integrity, patient outcomes, and the clinicians who rely on our insights. In this role, care means building systems that are accurate, traceable, and resilient - because real people depend on the results we generate. You\'ll take pride in clean code, reproducible pipelines, and the knowledge that every dataset you shape contributes to earlier, better diagnosis.
We expect you to own the work and contributions to your functions with confidence and curiosity. You\'ll be responsible for designing and maintaining the infrastructure that connects our science, operations, and technology. You\'ll take initiative, move with purpose, and be trusted to make critical decisions that keep our data ecosystem secure, scalable, and compliant.
We aim high. We\'re scaling fast, working across complex regulated environments, and pushing boundaries in how data accelerates diagnostics. You\'ll be empowered to build with ambition - optimising workflows, streamlining automation, and helping define what great data engineering looks like in healthcare.
You\'ll be expected to dive deep into the science, the systems, and the standards. You\'ll understand the technical and regulatory nuance behind every workflow, and you\'ll be just as comfortable debugging a Nextflow pipeline as you are explaining architecture decisions to cross-functional teams. You won\'t just maintain systems, you\'ll actively improve them.
We encourage everyone to challenge and commit. You\'ll help shape how we work as a data-led company, questioning assumptions, sharing ideas, and being open to better ways. But once we align, you\'ll deliver with clarity, ownership, and precision.
And most of all, we deliver. This is a role for someone who thrives on progress, who builds with intent and sees impact in every successful workflow run, every insight delivered, and every patient outcome improved.
This is how we work at Cyted, and if this sounds like the environment where you\'ll do your best work, we\'d love to speak with you.
We\'re looking for a skilled, proactive Data Engineer who\'s ready to build and scale the infrastructure that powers our scientific and operational insights. The ideal candidate will bring experience working with complex, regulated datasets, a strong grasp of modern data engineering tools and best practices, and the curiosity to solve problems at the intersection of biology and technology. You\'ll be hands-on, adaptable, and motivated to design systems that are reliable, compliant, and built to grow in a fast-paced, purpose-driven environment.
To succeed in this role, you\'ll bring: