Enable job alerts via email!

AI Data scientist + Data Engineer

spruceinfotech

Toronto

On-site

CAD 80,000 - 130,000

Full time

15 days ago

Job summary

A leading technology company in Toronto is seeking an AI data scientist and data engineer for a hybrid role. This position involves designing and implementing GenAI-driven systems, developing APIs, and leveraging cloud technologies. Candidates should possess strong backend skills, familiarity with data engineering tools, and experience in deploying applications at scale.

Qualifications

  • Must have strong backend development skills in Python and FastAPI.
  • Solid experience with Kubernetes and API deployment required.
  • Proven experience with LLM-based applications needed.

Responsibilities

  • Design and scale AI-driven systems for various applications.
  • Develop and productionize APIs backed by large language models.
  • Implement high-performance Spark workloads to support data flows.

Skills

Backend development
Python
FastAPI
async programming
Kubernetes
Docker
API deployment
Databricks
Delta Lake
PySpark
Cloud platforms
Azure

Tools

Apache Hive
Hadoop
AWS
Kafka
Scala

Job description

Hybrid role 2 to 4 days in Toronto downtown / week.

Combination role of AI data scientist Data Engineer Must have

JD below a mix of Data engineer and data scientist.

What will you do

  • Design build and scale GenAI-driven systems that power research digitization banking workflows global markets and monetization pipelines
  • Develop and productionize APIs and intelligent services backed by large language models (LLMs) and semantic search
  • Build and manage Kubernetes-deployed MCP servers using FastAPI supporting dynamic routing prompt orchestration and multi-source data access
  • Implement high-performance Spark workloads on Databricks and Delta Lake to support structured and unstructured data flows
  • Collaborate with platform teams AI scientists and business stakeholders to deliver context-aware AI-integrated tools
  • Drive CI / CD automation testing and infrastructure-as-code for scalable and secure releases

Must Have

  • Strong backend development skills in Python FastAPI and async programming
  • Solid hands-on experience with Kubernetes Docker and API deployment at scale
  • Deep understanding of Databricks Delta Lake PySpark and distributed data workflows
  • Proven experience building or integrating with LLM-based applications including prompt routing or semantic matching
  • Excellent debugging profiling and optimization skills in high-throughput environments
  • Comfort working with cloud platforms especially Azure

Nice to Have

  • Familiarity with model orchestration frameworks (LangChain LlamaIndex or similar)
  • Experience designing or contributing to MCP-style architectures (multi-modal intent-aware tool-executing systems)
  • Working knowledge of MLflow Airflow or Snowflake
  • Exposure to alternative data sources (web satellite social geospatial) and their AI use cases
  • Understanding of enterprise CI / CD secrets management and secure API gateways

Remote Work : Employment Type :

Contract

Key Skills

Apache Hive,S3,Hadoop,Redshift,Spark,AWS,Apache Pig,NoSQL,Big Data,Data Warehouse,Kafka,Scala

Experience : years

Vacancy : 1

Create a job alert for this search

Data Scientist • Toronto, Ontario, Canada

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.