Responsibilities
- Google Cloud Platform (GCP) services: AI Platform (Vertex AI), Cloud Storage, BigQuery, Cloud Functions, Cloud Pub / Sub, Cloud Build, Airflow, and Cloud Run.
- Understanding of ML concepts and LLMs (training, validation, hyperparameter tuning, evaluation).
- Experience with TensorFlow, Keras, PyTorch, and scikit-learn.
- Data preprocessing, ETL, and data pipelines using pyspark / Scala using serverless dataproc.
- CI / CD for ML (MLOps): Knowledge of CI / CD tools Looper pro / Jenkins; model versioning, continuous training, and deployment using Vertex AI pipelines.
- Automation & Scripting: Strong programming skills in Python, Bash, and SQL; automation of workflows and ML pipelines.
- DevOps & Containerization: Kubernetes (GKE) and Docker for containerization and orchestration; good to have Helm charts and YAML for Kubernetes deployments.
- Monitoring & Observability: Cloud Monitoring, Cloud Logging, Prometheus, and Grafana for monitoring and alerting; model performance monitoring with Vertex AI Model Monitoring.
- Security & Compliance: Understanding of VPC, firewall rules, and service accounts.
- Data Science: Must understand general data science methods and the development life cycle.
Qualifications
- Must understand general data science methods and the development life cycle (from the data science perspective to deployment).
- Experience with cloud-based ML platforms and modern ML tooling as listed above.
Seniority level
Employment type
Job function
Industries
- IT Services and IT Consulting
Toronto, Ontario, Canada CA$140,000 - CA$180,000