Enable job alerts via email!

Data Engineer

EnStream LP

Toronto

On-site

CAD 80,000 - 100,000

Full time

Today

Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading digital identity firm in Toronto is seeking a hands-on Data Engineer/ML Engineer to build and scale their data platform and machine learning pipelines. The role includes designing robust data processes, implementing ETL/ELT pipelines, and establishing observability for data quality. The ideal candidate has strong AWS experience, proficiency in Python and SQL, and is capable of ensuring production readiness for data workflows. Competitive salary and professional growth opportunities are offered.

Benefits

National-scale initiative contribution

Cutting-edge applications

Collaborative work environment

Qualifications

Hands-on experience with AWS data engineering and ML tools.
Strong knowledge of Python and SQL for pipeline development.
Experience in implementing observability for production services.

Responsibilities

Design and implement data platform on AWS including data management.
Build and maintain scalable ETL/ELT pipelines with data quality controls.
Develop production-grade data pipelines for various ML workflows.

Skills

AWS experience

Python (PySpark, pandas)

SQL

Data pipeline implementation

Observability for pipelines

Tools

Glue/Athena/EMR

Redshift

SageMaker

Job Title: Data Engineer/ML Engineer

Department: Applied AI & Data Engineering

Type: Full-time (FTE)

Reports to: Head of Applied AI & Data Engineering

This role requires a minimum of four (4) days per week working onsite at EnStream’s head office in Toronto; this requirement may be changed at management’s discretion.

Who is EnStream

EnStream is a leader in secure digital identity and mobile data intelligence, working to advance the future of digital trust in Canada. We build innovative data-driven models that enhance the integrity, reliability, and safety of digital identity ecosystems. Our latest initiative leverages advanced data science, machine learning, and deep learning to further grow and sustain digital trust across Canada.

Our mission is to empower frictionless trust in every interaction. EnStream is dedicated to increasing trust and convenience for Canadians using real-life, verified identities and network data held by trusted telco networks. At EnStream, every team member plays a critical role in shaping our strategy and delivering meaningful impact across industries.

About the Role

We’re hiring a hands‑on Data & ML Engineer to help build and scale the EnStream Trust Platform’s data platform and machine learning pipelines. You’ll design robust data and ML pipelines across internal and partner data sources, with a strong focus on production readiness, observability, and repeatability. The data and ML pipelines you’ll build and support span tabular and graph features and AI/ML models, using unsupervised and semi‑supervised approaches for anomaly detection, clustering, and risk scoring.

What You’ll Do

Design and implement the EnStream Trust Platform’s data platform on AWS, including ingestion, data quality, error management, data value, data flow, data security design patterns, curated/feature‑ready datasets, and governed access layers
Build and maintain scalable ETL/ELT pipelines (batch and/or streaming as needed) with strong data quality controls (schema checks, validation rules, reconciliations) and clear lineage/metadata
Develop production‑grade data pipelines for both tabular and graph signals, supporting unsupervised and semi‑supervised learning workflows
Implement end‑to‑end observability for data and ML pipelines: logging, metrics, tracing, alerting, and dashboards for pipeline health, data quality, latency, and model performance/drift where applicable
Establish engineering best practices for reliability and handoff: versioned code and datasets, configuration‑driven runs, CI/CD for pipelines, and runbooks for operations and incident response
Partner with product and external partners to align on data contracts, delivery cadence, and measurable outcomes

What You Bring

Must-Have Skills & Experience

Hands‑on AWS experience across data engineering and ML engineering (e.g., S3, Glue/Athena/EMR, Redshift, SageMaker), including orchestration and monitoring
Strong Python (PySpark and/or pandas) and SQL, with a track record of building reliable, maintainable data pipelines and feature datasets
Hands‑on experience engineering data and ML pipelines on AWS (e.g., S3, Glue/Athena/EMR, Redshift, Step Functions, SageMaker), including orchestration and cost/performance considerations
Proven ability to implement observability for pipelines (data quality monitoring, metrics/logging, alerting, dashboarding) and operate services in production
Experience supporting ML workflows end‑to‑end (data/feature generation, training/scoring pipelines, reproducible environments, and configuration/parameter traceability)
Exposure to both tabular and graph data modeling contexts, including unsupervised and/or semi‑supervised approaches used to generate risk/anomaly/clustering signals

Nice-to-Have Skills & Experience

Prior data science experience (or strong applied analytics background) to help validate assumptions and interpret model outputs with stakeholders.
Familiarity with modern MLOps tooling and patterns (experiment tracking, model registry, CI/CD for ML, infrastructure as code).
Experience with graph analytics/graph ML frameworks (e.g., NetworkX, PyG, DGL) and/or graph databases (e.g., Neptune, Neo4j).
Experience with streaming data systems and event‑driven pipelines (e.g., Kinesis, Kafka).
Experience with containerized workloads and orchestration (Docker, Kubernetes/EKS) and infrastructure automation

Why Join Us?

Contribute to a national‑scale initiative defining the future of digital trust in Canada
Work on cutting‑edge fraud detection applications using real‑world identity data
Collaborate with a highly skilled, cross‑functional team

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Top locations

Top companies

Top positions