Enable job alerts via email!

Senior Data Engineer, Discovery & AI

Tubi Tv

Toronto

Hybrid

CAD 90,000 - 120,000

Full time

8 days ago

Job summary

A leading streaming service is seeking a Data Engineer to develop and maintain data processing pipelines to support reporting and analytics. You will manage data needs for product verticals and collaborate with teams to ensure data quality. Candidates should have over 7 years of experience including strong skills in SQL, Spark, and Databricks. This position offers an opportunity to work with large datasets and be part of an innovative team.

Qualifications

  • 7+ years of industry experience, at least 5 in a data engineering role.
  • Experience with scalable, flexible, and always-on data pipelines.
  • Strong experience with Databricks and cloud providers.

Responsibilities

  • Own the data needs of a specific product vertical.
  • Build high quality datasets using Spark and DBT.
  • Participate in on-call rotation for data issues.

Skills

Data manipulation
SQL
Spark
Python
Databricks

Education

BSc/MSc in Computer Science or related field

Tools

DBT
Airflow
AWS
Job description

Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands of creator-led stories and hundreds of Tubi Originals made for the most passionate fans.

About the Role:

As a Data Engineer at Tubi, you'll be instrumental in developing and maintaining robust data processing pipelines that underpin our reporting, analytics, and performance insights. You'll operate as an embedded data engineer within a specific data product vertical, Discovery & AI, shaping the culture, best practices, and overall approach to data engineering within the team.

In this role, you will also be responsible for supporting your vertical with data-related automation, mentoring junior engineers if need be, and contributing to the strategic direction of data engineering across the company. Each product team presents unique big data challenges, requiring you to be adaptable, curious, and possess a strong foundation in big data and engineering principles.

What You'll Do:

  • Be the primary owner of the data needs of a specific product vertical. That means everything from raw-data ingestion to end-user analysis.
  • Build intuitive, easy-to-use, and high quality datasets using Spark and DBT
  • Track down data quality issues when they arise, and then set appropriate data quality monitors and alerts to help prevent future incidents
  • Participate in the occasional on-call rotation (12-hr day time shifts, ~1-2/month)
  • Understand your product vertical's datasets, with an ability to document and educate your team on how certain tables / fields are meant to be used
  • Be the liaison between your product vertical and the core data infrastructure team to make sure business needs are met in a computationally efficient manner

Your Background:

  • BSc/MSc in Computer Science or related field
  • 7+ years of industry experience, at least 5 of which are in a data engineering (or data-centric software engineering) role.
  • Track record of building and operating scalable, flexible, and always-on data pipelines.
  • A desire and ability to truly understand and serve the business problems and use-cases you will be working with.
  • Fluent in data manipulation and SQL
  • Strong knowledge of Spark, Python (libraries like pandas and polars, helpful)
  • Strong experience with Databricks (SQL Warehouses, Jobs Clusters, and Serverless)
  • Experience in Databricks Feature Store usage with ML offline training
  • Experience in setting up experiments and A/B testing
  • Nice to have MLFlow Python library experience
  • Familiarity with DBT and Airflow in particular is helpful
  • Familiarity with StatSig and Experiment setup is helpful
  • Nice to have experience bringing data in from RDS/PostgreSQL instances to Databricks
  • Experience with cloud providers and cloud storage (AWS preferred)
  • Experience with efficiently working with datasets at TB scale
  • Service and data quality oriented!
  • A passion for shipping production quality code with good test coverage
  • Ability to prioritize tasks and self-motivate without constant supervision

We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, gender identity, disability, protected veteran status, or any other characteristic protected by law.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.