Job Search and Career Advice Platform

Enable job alerts via email!

Data Engineer

EnStream LP

Toronto

On-site

CAD 80,000 - 100,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading digital identity firm in Toronto is seeking a hands-on Data Engineer/ML Engineer to build and scale their data platform and machine learning pipelines. The role includes designing robust data processes, implementing ETL/ELT pipelines, and establishing observability for data quality. The ideal candidate has strong AWS experience, proficiency in Python and SQL, and is capable of ensuring production readiness for data workflows. Competitive salary and professional growth opportunities are offered.

Benefits

National-scale initiative contribution
Cutting-edge applications
Collaborative work environment

Qualifications

  • Hands-on experience with AWS data engineering and ML tools.
  • Strong knowledge of Python and SQL for pipeline development.
  • Experience in implementing observability for production services.

Responsibilities

  • Design and implement data platform on AWS including data management.
  • Build and maintain scalable ETL/ELT pipelines with data quality controls.
  • Develop production-grade data pipelines for various ML workflows.

Skills

AWS experience
Python (PySpark, pandas)
SQL
Data pipeline implementation
Observability for pipelines

Tools

S3
Glue/Athena/EMR
Redshift
SageMaker
Job description
Job Title: Data Engineer/ML Engineer

Department: Applied AI & Data Engineering

Type: Full-time (FTE)

Reports to: Head of Applied AI & Data Engineering

This role requires a minimum of four (4) days per week working onsite at EnStream’s head office in Toronto; this requirement may be changed at management’s discretion.

Who is EnStream

EnStream is a leader in secure digital identity and mobile data intelligence, working to advance the future of digital trust in Canada. We build innovative data-driven models that enhance the integrity, reliability, and safety of digital identity ecosystems. Our latest initiative leverages advanced data science, machine learning, and deep learning to further grow and sustain digital trust across Canada.

Our mission is to empower frictionless trust in every interaction. EnStream is dedicated to increasing trust and convenience for Canadians using real-life, verified identities and network data held by trusted telco networks. At EnStream, every team member plays a critical role in shaping our strategy and delivering meaningful impact across industries.

About the Role

We’re hiring a hands‑on Data & ML Engineer to help build and scale the EnStream Trust Platform’s data platform and machine learning pipelines. You’ll design robust data and ML pipelines across internal and partner data sources, with a strong focus on production readiness, observability, and repeatability. The data and ML pipelines you’ll build and support span tabular and graph features and AI/ML models, using unsupervised and semi‑supervised approaches for anomaly detection, clustering, and risk scoring.

What You’ll Do
  • Design and implement the EnStream Trust Platform’s data platform on AWS, including ingestion, data quality, error management, data value, data flow, data security design patterns, curated/feature‑ready datasets, and governed access layers
  • Build and maintain scalable ETL/ELT pipelines (batch and/or streaming as needed) with strong data quality controls (schema checks, validation rules, reconciliations) and clear lineage/metadata
  • Develop production‑grade data pipelines for both tabular and graph signals, supporting unsupervised and semi‑supervised learning workflows
  • Implement end‑to‑end observability for data and ML pipelines: logging, metrics, tracing, alerting, and dashboards for pipeline health, data quality, latency, and model performance/drift where applicable
  • Establish engineering best practices for reliability and handoff: versioned code and datasets, configuration‑driven runs, CI/CD for pipelines, and runbooks for operations and incident response
  • Partner with product and external partners to align on data contracts, delivery cadence, and measurable outcomes
What You Bring
Must-Have Skills & Experience
  • Hands‑on AWS experience across data engineering and ML engineering (e.g., S3, Glue/Athena/EMR, Redshift, SageMaker), including orchestration and monitoring
  • Strong Python (PySpark and/or pandas) and SQL, with a track record of building reliable, maintainable data pipelines and feature datasets
  • Hands‑on experience engineering data and ML pipelines on AWS (e.g., S3, Glue/Athena/EMR, Redshift, Step Functions, SageMaker), including orchestration and cost/performance considerations
  • Proven ability to implement observability for pipelines (data quality monitoring, metrics/logging, alerting, dashboarding) and operate services in production
  • Experience supporting ML workflows end‑to‑end (data/feature generation, training/scoring pipelines, reproducible environments, and configuration/parameter traceability)
  • Exposure to both tabular and graph data modeling contexts, including unsupervised and/or semi‑supervised approaches used to generate risk/anomaly/clustering signals
Nice-to-Have Skills & Experience
  • Prior data science experience (or strong applied analytics background) to help validate assumptions and interpret model outputs with stakeholders.
  • Familiarity with modern MLOps tooling and patterns (experiment tracking, model registry, CI/CD for ML, infrastructure as code).
  • Experience with graph analytics/graph ML frameworks (e.g., NetworkX, PyG, DGL) and/or graph databases (e.g., Neptune, Neo4j).
  • Experience with streaming data systems and event‑driven pipelines (e.g., Kinesis, Kafka).
  • Experience with containerized workloads and orchestration (Docker, Kubernetes/EKS) and infrastructure automation
Why Join Us?
  • Contribute to a national‑scale initiative defining the future of digital trust in Canada
  • Work on cutting‑edge fraud detection applications using real‑world identity data
  • Collaborate with a highly skilled, cross‑functional team
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.