Enable job alerts via email!

Remote Senior Data Engineer

Varwise

Remote

PLN 100,000 - 130,000

Full time

Yesterday

Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading Adtech company is seeking a Remote Senior Data Engineer to manage and scale data processing systems and maintain a complex data lake. Candidates should have 8+ years of software engineering experience, particularly in big data and Scala-based systems. This role emphasizes building reliable data pipelines and optimizing large-scale data solutions. The ideal applicant has strong knowledge of AWS, Spark, and experience with both SQL and NoSQL databases. This is a fully remote position open to candidates worldwide.

Qualifications

8+ years of professional software engineering experience, focusing on data engineering in big data environments.
4+ years developing and delivering Scala-based systems, familiarity with Python, and at least one other language.
Proficiency using Spark (PySpark) or TensorFlow.
Hands-on experience developing data solutions on a major cloud platform.

Responsibilities

Create and maintain scalable distributed data processing systems.
Become a core maintainer of the data lake.
Ensure data pipelines run 24/7.
Lead technical discussions to improve tools or projects.

Skills

Data engineering

Big data environments

Distributed systems

Scala

Python

Machine Learning

AWS

Spark

Database management

ETL processes

Education

Bachelor’s degree in Computer Science or related discipline

Tools

AWS

Spark (PySpark)

Databricks

Kubeflow

SageMaker

PostgreSQL

Cassandra

Remote Senior Data Engineer @ Varwise

We are looking for Data Engineers to work remotely for an Adtech company that leverages machine learning and data science to build an identity graph that can scale to reach millions of users via brands with programmatically selected households. The work includes scaling our Big Data asset that combines billions of transaction data points—including intent, conversions, and first‑party data—into an identity graph that needs to scale to a future cookie‑less world.

This is a 100% remote position, you will be working with team members in NYC.

We value technical excellence and you will have both resources and time to deliver world‑class code.

Responsibilities

Work on creating and maintaining reliable and scalable distributed data processing systems
Become a core maintainer of the data lake
Maintain our data lake by building searchable data sets for broader business uses
Scale, troubleshoot, and fix existing applications and services
Own a complex set of services and applications
Focus on ensuring that our data pipelines run 24/7
Lead technical discussions leading to improvements in tools, processes or projects
Work on scaling our identity graph to deliver impactful advertising campaigns
Work on data sets exceeding billions of records
Work on AWS‑based infrastructure
Scale our MLOps platform by using both traditional ML and LLM/Generative AI based applications

Qualifications

8+ years of professional software engineering experience, with a focus on data engineering in big data environments.
4+ years of experience in developing and delivering production‑grade Scala‑based systems, familiarity with Python, and at least one other high‑level programming language (e.g., Java, C++, C#).
Proficiency in all aspects of SDLC, from concept to running production systems.
Proficiency using Spark (PySpark) or TensorFlow.
Proven experience building and optimizing large‑scale data pipelines using Databricks and Spark.
Experience participating in ETL and ML pipeline projects based on Airflow, Kubeflow, Mleap, SageMaker or similar.
Hands‑on experience developing and deploying data solutions in a major cloud platform (AWS, GCP, or Azure).
Experience working with AI, LLMs, Agents, and/or Generative AI technologies, both in product applications and for development productivity.
Database experience at large scale, both SQL and NoSQL databases like PostgreSQL, Cassandra, Neo4j, Neptune, or similar.
Experience in large‑scale data management formats and frameworks such as Parquet, ORC, Databricks/Delta Lake, Iceberg or Hudi.
Bachelor’s degree in Computer Science or related discipline.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Top locations