Enable job alerts via email!

Lead Data Engineer

Randstad (Schweiz) AG

Los Angeles (CA)

Remote

USD 120,000 - 180,000

Full time

Today
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A well-funded Series A HRTech startup is seeking a Lead Data Infrastructure Architect. In this US-only remote role, you will be managing billions of data points and leading a team of data engineers. Responsibilities include designing data pipelines, implementing ETL processes using PySpark, and ensuring performance optimization across data warehouses. Ideal candidates will have extensive experience in data engineering and a strong understanding of AWS services.

Qualifications

  • 5-8 years of professional data engineering experience required.
  • Good proficiency in PySpark, AWS data services, and Docker.
  • Strong background in Big Data processing and data warehouse design.

Responsibilities

  • Design scalable data pipelines and architect ETL processes.
  • Distribute enriched data through medallion architecture.
  • Integrate new data sources into the main pipeline.

Skills

Data Engineering
AWS Data Services
Python
SQL
Big Data Processing
ETL Processes
Distributed Computing
Data Warehouse Design

Tools

PySpark
Docker
Postgres
OpenSearch
Pandas

Job description

Company Context

Series A, well-funded US startup in HRTech developing WorkHQ.com and an AI Recruiter product.

This is a US-only, Remote role (Mainland).

Role Overview

Lead data infrastructure architect managing billions of data points across 250M+ professional profiles.

Hire data engineers to aid you in that journey.

Core Responsibilities
  • Design scalable data pipelines processing massive record volumes

  • Architect ETL processes using PySpark on Amazon EMR (Open to shifting to other solutions like Data Bricks / Snowflake)

  • Distribute enriched data through medallion architecture across Postgres, Athena, OpenSearch

  • Integrate new data sources into the main pipeline

  • Implement advanced data matching using Splink

Technical Requirements
  • 5-8 years professional data engineering experience

  • Good proficiency in:

    • PySpark and distributed computing

    • AWS data services (EMR, Glue, Athena)

    • Docker

    • Pandas and DataFrame manipulation

    • Complex data format handling (JSONL, Parquet)

  • Strong background in:

    • Big data processing architectures

    • Data warehouse design

    • Performance optimization

  • Advanced Python, SQL skills

Nice to Have
  • Probabilistic record linking expertise

  • OpenSearch/elasticsearch technologies

  • Machine learning data pipeline design

  • Recruitment tech ecosystem knowledge

Technical Stack
  • Big Data: PySpark, EMR

  • Databases: Postgres, OpenSearch

  • Cloud: AWS

  • Containerization: Docker

  • Data Formats: JSONL, Parquet

  • Analytics: Metabase, Athena, Glue

  • Data Processing: Pandas, Splink

Other Considerations

While this role has specific requirements - if you lack a few technical skills, but motivated to learn and lead the platform, please apply for consideration.

If you are coming from Director/Head of/VP levels that is relevant to this job, you can apply as well.

You will need to apply directly on our platform.

Thank you for your time.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Lead Data Engineer

WorkHQ

Los Angeles null

Remote

Remote

USD 140.000 - 180.000

Full time

7 days ago
Be an early applicant

Lead Data Engineer

Jobot

Salt Lake City null

Remote

Remote

USD 150.000 - 190.000

Full time

Today
Be an early applicant

Lead Data Engineer

Jobot

Grand Prairie null

Remote

Remote

USD 150.000 - 190.000

Full time

Today
Be an early applicant

Lead Data Engineer

Leidos

null null

Remote

Remote

USD 89.000 - 163.000

Full time

2 days ago
Be an early applicant

Lead Data Engineer

Health-E Commerce

null null

Remote

Remote

USD 150.000 - 180.000

Full time

Today
Be an early applicant

Lead Data Engineer

Epoch Biodesign

San Francisco null

Remote

Remote

USD 120.000 - 180.000

Full time

2 days ago
Be an early applicant

Lead Data Engineer

Experteer Italy

null null

Remote

Remote

USD 120.000 - 160.000

Full time

Yesterday
Be an early applicant

Lead Data Engineer

Trust & Will

California null

Remote

Remote

USD 166.000 - 215.000

Full time

2 days ago
Be an early applicant

Lead Data Engineer

GE Vernova

null null

Remote

Remote

USD 98.000 - 140.000

Full time

Today
Be an early applicant