Enable job alerts via email!

Data Engineer

Medeloop

Montreal

Hybrid

CAD 80,000 - 120,000

Full time

17 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Start fresh or import an existing resume

Job summary

Medeloop, an early-stage startup, seeks a Data Engineer specializing in healthcare data to develop and maintain data infrastructure in GCP. This role involves designing ETL pipelines and collaborating with data science teams to optimize data architecture, ensuring it meets business requirements for growth and innovation.

Qualifications

  • 3+ years of experience as a data engineer, especially in healthcare.
  • Experience leading data infrastructure projects on GCP.
  • Knowledge of data governance and security best practices.

Responsibilities

  • Design, implement, and maintain data infrastructure on GCP.
  • Develop ETL pipelines using GCP services.
  • Collaborate with teams to align data architecture with business needs.

Skills

Python
Java
Scala
Analytical Skills
Problem Solving

Education

Bachelor's or Master's degree in Computer Science

Tools

GCP
Terraform
Cloud SQL
Spark

Job description

We are seeking a Data Engineer with expertise in relational databases, healthcare data, medical terminology, ETLs, data exchange, and GCP to join Medeloop, an early-stage startup. As a Data Engineer at Medeloop, you will not only design, develop, and maintain our data infrastructure but also engage in a variety of engineering functions, offering a fantastic opportunity for exposure and growth. Working closely with our data science and product teams, you will ensure that our data architecture meets the evolving needs of the business. This role is ideal for those who are eager to help out wherever needed and are looking for great growth and leadership opportunities as we expand.

Key Responsibilities:

  • Drive the design, implementation, and maintenance of our data infrastructure on GCP
  • Develop ETL pipelines using GCP Dataflow and Cloud Storage to ingest, transform, and load data from various sources
  • Collaborate with data science and product teams to understand data requirements and develop solutions to meet those requirements
  • Develop data exchange protocols using GCP services like Cloud Functions, Cloud Workflows and Cloud Pub/Sub to facilitate the transfer of data between parties
  • Build vector databases to store and manage complex healthcare terminology using CloudSQL
  • Ensure the security and integrity of our data infrastructure by implementing appropriate security measures and data governance policies
  • Design and optimize large-scale Spark workloads on Cloud Dataproc
  • Continuously evaluate and improve our data infrastructure to ensure that it meets the evolving needs of the business and industry trends

Who You Are:

  • Bachelor's or Master's degree in Computer Science, Data Science, a related field, or equivalent experience
  • 3+ years of experience as a data engineer, preferably in the healthcare industry
  • Experience with traditional programming languages such as Python, Java, or Scala
  • Experience leading the design and implementation of data infrastructure projects on GCP
  • Experience with ETL and Big Data services and frameworks such as Cloud Dataproc, BigQuery, Apache Spark, and others.
  • Experience with GCP services like Cloud Storage, CloudSQL, Cloud Functions, Cloud Workflows
  • Experience developing cloud infrastructure using Terraform
  • Knowledge of data governance and security best practices
  • Strong analytical and problem-solving skills
  • Excellent communication and collaboration skills
  • Ability to work independently and in a team environment
  • Passion for using data to improve healthcare outcomes
Nice To Have
  • Strong knowledge of vector databases, medical terminology, and healthcare data
  • Experience with healthcare common data models and data exchange protocols such as OMOP CDM, FHIR, and others
  • Experience with AWS services (EMR, StepFunctions, Lambdas, Aurora RDS, Glue)
Apply for this job

*

indicates a required field

First Name *

Last Name *

Preferred First Name

Email *

Phone

Resume/CV

Enter manually

Accepted file types: pdf, doc, docx, txt, rtf

Enter manually

Accepted file types: pdf, doc, docx, txt, rtf

Education

Degree Select...

Are you currently based in Montreal and open to working in a hybrid model? * Select...

Are you willing to undergo a background check in accordance with local laws and regulations? * Select...

Describe a data pipeline that you have created using cloud infrastructure (GCP preferred) *

Have you run Spark jobs with cloud infrastructure (GCP DataProc or AWS EMR)? What was the use case, and how did you manage performance? *

What is your level of experience with Terraform? Share a brief example of how you’ve used it in a project. *

Describe your approach to building automated and orchestrated data workflows. *

Have you worked on projects that worked cross cloud AWS to GCP? *

Have you worked with healthcare data subject to HIPAA compliance requirements? If not, do you have experience handling other types of sensitive or regulated data? *

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.