Job Summary
This role supports the development of a whole-of-government Synthetic Data Generation (SDG) product. You will work across data science, applied machine learning, and software engineering to design models, build features, support users, and improve the platform as it grows.
About the Company
A leading government technology team focused on building secure, privacy-enhancing solutions and advanced data platforms used across many public sector agencies. The organisation develops tools that strengthen data privacy, drive innovation, and support national digital initiatives.
Key Responsibilities
- Design and run experiments to evaluate emerging SDG models such as DDPM, ARF, and Gaussian Copula.
- Investigate model limitations related to data types, size, or cardinality.
- Tune hyperparameters, refine model architectures, and propose improved modelling approaches.
- Collaborate with software engineers to build product features that require ML or data science inputs.
- Develop approaches for preprocessing, constraints handling, imputation, and privacy metrics.
- Work with users and engineering teams to diagnose training issues, poor outputs, or integration problems.
- Provide clear explanations and actionable fixes in a user-friendly manner.
- Write user-facing documentation explaining model choices, metrics, and technical concepts in simple language.
- Help translate complex data science topics into clear, accessible explanations.
- Work closely with the engineering team (Next.js, FastAPI, AWS) to integrate engines into production systems.
- Participate in Agile discussions, reviews, and collaborative design work.
Technical Skills
- Strong foundation in machine learning model development and experimentation.
- Proficiency in Python and ML frameworks such as PyTorch, TensorFlow, and scikit-learn.
- Ability to analyze model behavior, troubleshoot training issues, and design experiments.
- Working knowledge of backend development using REST APIs (FastAPI, Flask or similar).
- Comfortable working with cloud environments, preferably AWS.
- Familiarity with Git, CI/CD, and collaborative coding practices.
Nice-to-Haves
- Experience with privacy-enhancing technologies, anonymisation, synthetic data generation, or differential privacy.
- Understanding of frontend integration workflows (Next.js / React).
- Experience working in multi-disciplinary product teams.
Mindset & Soft Skills
- Curious, eager to learn, and open to exploring new domains, especially data privacy.
- Strong communication skills with the ability to simplify technical concepts.
- Enjoys working in a collaborative, fast-paced Agile environment.