Enable job alerts via email!

Data Platform Engineer

HRB

Toronto

On-site

CAD 80,000 - 120,000

Full time

Today
Be an early applicant

Job summary

A technology company in Toronto seeks a skilled engineer to build scalable data platforms. You will design and optimize the data platform that powers our API, handling data streaming and storage. Candidates should have 3+ years in data engineering, expertise in Java and Python, and experience with tools such as Kafka and Flink. Join us to tackle complex backend challenges and contribute to impactful product development.

Qualifications

  • 3+ years of experience in platform engineering or data engineering.
  • 2+ years of experience designing and optimizing data pipelines at TB to PB scale.
  • Familiarity with lake-house architectures like Iceberg and Delta.
  • Experience with real-time data processing tools like Kafka, Flink and Spark.

Responsibilities

  • Own projects enhancing data replication, storage, and reporting capabilities.
  • Design scalable storage solutions for handling petabytes of data.
  • Develop and maintain real-time data systems.
  • Write clean, maintainable code in Java and Python.

Skills

Platform engineering
Data engineering
Java
Python
System design
Problem-solving

Tools

Apache Iceberg
Apache Kafka
Apache Flink
AWS S3
AWS DynamoDB
Apache Spark
AWS CDK
Job description
About the role

We’re looking for an engineer who thrives on building scalable data platforms and enjoys tackling complex backend challenges. This isn’t just a data engineering role, you’ll be designing and optimizing the data platform that powers our API, managing everything from data streaming and storage to analytics features at petabyte scale.

You should be comfortable navigating both data and backend engineering, with a solid foundation in software development. You’ll work with advanced data architectures, including Iceberg, Flink, and Kafka, tackling large‑scale challenges and contributing to core product development using Java and Python. If you’re excited by the opportunity to shape a high‑impact platform and tackle diverse engineering problems, we’d love to hear from you.

What you will do:
  • Own projects aimed at enhancing data replication, storage, enrichment, and reporting capabilities.
  • Build and optimize efficient streaming and batch data pipelines that support our core product and API.
  • Design scalable storage solutions for handling petabytes of IoT and time‑series data.
  • Develop and maintain real‑time data systems to ingest growing data volumes.
  • Implement distributed tracing, data lineage and observability patterns to improve monitoring and troubleshooting.
  • Write clean, maintainable code in Java and Python for various platform components.
  • Shape architectural decisions to ensure scalability and reliability throughout the data platform.
The ideal candidate will have:
  • 3+ years of experience in platform engineering or data engineering.
  • 2+ years of experience designing and optimizing data pipelines at TB to PB scale.
  • Proficient in Java, with a focus on clean, maintainable code.
  • Strong system design skills with a focus on big data and real‑time workflows.
  • Familiarity with lake‑house architectures (e.g., Iceberg, Delta, Paimon).
  • Experience with real‑time data processing tools like Kafka, Flink and Spark.
  • Knowledge of distributed systems and large‑scale data challenges.
  • Strong problem‑solving skills and a collaborative mindset.
  • Nice‑to‑have:
    • Experience working with orchestration / workflow engines (e.g. Step Functions, Temporal)
    • Experience with serverless and/or event‑driven architectures (e.g. AWS Lambda, SQS).
    • Experience with Javascript/Typescript languages (for cross team work)
Tech Stack
  • Languages: Java, Python
  • Framework: Springboot
  • Storage: AWS S3, AWS DynamoDB, Apache Iceberg, Redis
  • Streaming: AWS Kinesis, Apache Kafka, Apache Flink
  • ETL: AWS Glue, Apache Spark
  • Serverless: AWS SQS, AWS EventBridge, AWS Lambda and Step Functions.
  • Infrastructure as Code: AWS CDK
  • CI/CD: GitHub Actions
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.