Enable job alerts via email!

Research Engineer - Datasets

Canva

City Of London

Hybrid

GBP 60,000 - 80,000

Full time

17 days ago

Job summary

A leading design technology company in London is seeking a Data-focused Research Engineer to enhance their generative AI models. The role involves managing datasets, ensuring high-quality synthetic data generation, and conducting analyses to align models with human design preferences. Ideal candidates should excel in Python, have a strong aesthetic sense, and be familiar with ML frameworks. Enjoy equity packages, inclusive parental leave, and flexible work options.

Benefits

Equity packages
Inclusive parental leave policy
Annual Vibe & Thrive allowance
Flexible leave options

Qualifications

  • Strong proficiency in Python and ML frameworks is essential.
  • Experience with generative AI is highly desirable.
  • Background in visual design or human-computer interaction is a plus.

Responsibilities

  • Design and build scalable pipelines for processing and curating datasets.
  • Develop ML models to generate high-quality synthetic data.
  • Own the design and implementation of human evaluation workflows.

Skills

Strong proficiency in Python
Proficiency in ML frameworks (e.g., PyTorch, TensorFlow)
Strong aesthetic sense for visual design or human-computer interaction
Experience with generative AI and synthetic data generation
Solid understanding of statistical methods

Tools

Data annotation tools
Job description
Company Description

Join the team redefining how the world experiences design.

Our flagship campus is in Sydney, Australia but London is home to part of our European operations. And you have choice in where and how you work, we trust our Canvanauts to choose the balance that empowers them and their team to achieve their goals.

About the role

At Canva, our mission is to empower the world to design. To ensure our generative AI models are truly helpful, we are seeking a talented data focused Research Engineer to build our next-generation design and vision models.

In this foundational role, you will be the expert on what fuels our models: data. You will be responsible for the end-to-end lifecycle of our datasets, from curating nuanced human feedback on design quality to pioneering the use of machine learning for high-quality synthetic data generation. You will design the sophisticated human evaluation studies and experiments that measure our success, and you will be the key analyst who translates signals from automated evaluators into a deep understanding of our models\' alignment with human taste and intent. This is a unique role for a data-first thinker with a strong design sensibility, who is passionate about building the ground truth that will define the future of creativity.

At the moment, this role is focused on:

  • Human Feedback Data Curation: Owning the processing, cleaning, and strategic curation of large-scale, subjective human feedback on design quality, which is the lifeblood of our models.
  • Synthetic Data Generation: Using generative AI and machine learning techniques to create novel, high-quality synthetic data that augments our training sets and improves model capabilities.
  • Alignment Analysis & Evaluation Design: Designing methods to analyze outputs from both human and automated systems to deeply understand and measure our models\' alignment with user preferences.
Primary Responsibilities
  • Design and build scalable pipelines for processing and curating large datasets of human design feedback.
  • Develop ML models to generate high-quality synthetic data for training and fine-tuning.
  • Own the design and implementation of human evaluation workflows, including creating guidelines and quality rubrics.
  • Prepare datasets for automated evaluation systems and analyze their outputs to provide a robust signal on model performance and human alignment.
  • Design and analyze experiments (including A/B tests) to measure the real-world impact of our models on design quality.
  • Conduct deep-dive analyses into model performance to identify failure modes and guide future development.

You\'re probably a match if you have:

  • A strong aesthetic sense, with a background or demonstrated passion for visual design or human-computer interaction.
  • Strong proficiency in Python and ML frameworks (e.g., PyTorch, TensorFlow).
  • Solid understanding of statistical methods, including experimental design, A/B testing, and quality evaluation systems.
  • Experience with generative AI and synthetic data generation is highly desirable.
  • Familiarity with data annotation tools is a plus.
Qualifications
Additional Information
What\u2019s in it for you?

Achieving our crazy big goals motivates us to work hard - and we do - but you\'ll experience lots of moments of magic, connectivity and fun woven throughout life at Canva, too. We also offer a stack of benefits to set you up for every success in and outside of work.

Here\'s a taste of what\u2019s on offer:

  • Equity packages - we want our success to be yours too
  • Inclusive parental leave policy that supports all parents & carers
  • An annual Vibe & Thrive allowance to support your wellbeing, social connection, home office setup & more
  • Flexible leave options that empower you to be a force for good, take time to recharge and supports you personally

Check out lifeatcanva.com for more info.

Other stuff to know

We make hiring decisions based on your experience, skills and passion, as well as how you can enhance Canva and our culture. When you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process.

Please note that interviews are predominantly conducted virtually.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.