Enable job alerts via email!

Clinical Data and AI Integration Analyst

King's College London

London

On-site

GBP 40,000 - 60,000

Full time

8 days ago

Job summary

A leading UK educational institution is seeking a Clinical Data Linkage Service professional to support data processing and integration tasks within healthcare. The candidate should have a Master's degree and experience in data engineering with Python or Java. This is a full-time role offering a fixed term contract until 30/11/2027.

Qualifications

  • MSc or equivalent experience in a relevant area.
  • Experience with Python and/or Java for data engineering tasks.
  • Familiarity with ETL workflows and database systems.
  • Knowledge of data security in healthcare.

Responsibilities

  • Support and maintain CogStack for EHR data processing.
  • Implement ETL workflows for data mapping.
  • Contribute to troubleshooting in data provisioning.
  • Assess data quality and develop quality control tools.

Skills

Python
Java
ETL workflows
Database systems
Data security
Git
DevOps practices
Communication skills

Education

MSc in computer science or related field

Tools

PostgreSQL
SQL Server
Docker

Job description

About us:

CogStack ( https://cogstack.org/ ) is an award winning ecosystem of tools and workflows that facilitate the ingestion, structuring, organising and visualisation of Electronic Health Record (EHR) data built by a multidisciplinary team of software developers, machine learning engineers, clinical researchers and health informaticians. The CogStack team is at the forefront of building impactful solutions and partnering with NHS Trusts and healthcare providers, tackling real world clinical problems supporting use cases from state-of-the-art clinical research through to translational research delivering innovative solutions for direct patient care (How Elastic improves patient outcomes with valuable healthcare data; https://doi.org/10.1101/123299 ).

The CogStack team benefits from sitting within a leading programme of clinical, health and bioinformatics at the South London and Maudsley (SLaM) Biomedical Research Centre (BRC) and forms a key component of both the Centre for Translational Informatics (www.ctiuk.org) and actionable analytics theme of the recently awarded Health Data Research UK (HDR UK) London site.

Major funding has been awarded by the Office for Life Sciences, InnovateUK and recently a Stage 3 AI for Health and Social Care Award from NHSx. The ecosystem has already been recognised in Government reports to the Chief Medical Officer, NHSx AI report, NHS Tech Plan and keynote speeches by the Health Secretary.

About the role:

The Clinical Data Linkage Service (CDLS), hosted by the NIHR Maudsley Biomedical Research Centre (BRC), provides secure and ethical linkage between datasets from King's College Hospital (KCH), Guy's and St Thomas' NHS Foundation Trust (GSTT), and the Clinical Record Interactive Search (CRIS) platform at South London and Maudsley NHS Foundation Trust (SLaM).

The postholder will support and maintain the use of CogStack to extract and process clinical data at KCH and GSTT for linkage to CRIS via the CDLS. This includes working closely with CogStack colleagues, data controllers, research teams and operational stakeholders across Trusts to ensure high-quality, auditable and timely data processing pipelines.

The post holder will be expected to be able to contribute to the following areas:
  • Operational support for running and maintaining CogStack pipelines at KCH and GSTT, with a focus on secure, high-quality, and auditable EHR data extraction for the CDLS.
  • Implementation and documentation of ETL workflows to map data to CDLS and CRIS data structures.
  • Contribution to the technical specification and troubleshooting of issues arising in clinical data provisioning, NLP processing, or linkage preparation.
  • Extension of CogStack-NiFi or other internal modules for custom data routing, transformation or enrichment tasks (e.g. MedCAT NER+L).
  • Data quality assessment and contribution to the development of automated or manual quality control tools for clinical datasets.
  • Communication of requirements and constraints to clinical and non-technical audiences, especially in relation to information governance and linkage protocols.
  • Collaboration with the broader CogStack team to ensure architectural alignment, reusable components, and long-term platform sustainability.
This is a full time post (35 hours per week), and you will be offered a fixed term contract until 30/11/2027.

About you:

To be successful in this role, we are looking for candidates to have the following skills and experience:

Essential criteria
  1. MSc or equivalent experience in a relevant area such as computer science, health informatics, software engineering, or data science
  2. Experience with Python and/or Java for data engineering or EHR processing tasks
  3. Experience with ETL workflows, database systems (e.g. PostgreSQL, SQL Server), and API-based data integration
  4. Knowledge of data security, audit logging and information governance in healthcare settings
  5. Experience working with version control (e.g. Git), DevOps practices and container technologies such as Docker
  6. Strong communication skills and ability to work across multi-disciplinary and cross-organisational teams

Desirable criteria
  1. Experience working in or with NHS Trusts or other health data environments
  2. Experience with MedCAT, CogStack-NiFi or similar clinical NLP tools
  3. Understanding of CRIS or similar de-identified research platforms
  4. Experience managing complex or high-volume data pipelines in production environments
  5. Knowledge of FHIR, SNOMED CT or other healthcare interoperability and ontology standards

Downloading a copy of our Job Description

Full details of the role and the skills, knowledge and experience required can be found in the Job Description document, provided at the bottom of the page. This document will provide information of what criteria will be assessed at each stage of the recruitment process.

Further information:

We ask all candidates to submit a copy of their CV, and a supporting statement, detailing how they meet the essential criteria listed in the advert. If we receive a strong field of candidates, we may use the desirable criteria to choose our final shortlist, so please include your evidence against these where possible.

To find out how our managers will review your application, please take a look at our 'How we Recruit' pages.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.