Ativa os alertas de emprego por e-mail!

Data Engineer

Supersummary

Brasil

Teletrabalho

BRL 20.000 - 80.000

Tempo integral

Há 2 dias
Torna-te num dos primeiros candidatos

Melhora as tuas possibilidades de ir a entrevistas

Cria um currículo adaptado à oferta de emprego para teres uma taxa de sucesso superior.

Resumo da oferta

A leading EdTech company, SuperSummary, is seeking a Data Engineer to join their remote team. In this role, you will design and optimize data systems, collaborate with cross-functional teams, and ensure data quality. Ideal candidates will have 3-6 years of experience, strong skills in Python and SQL, and a background in cloud platforms like Azure.

Serviços

Competitive salary
Benefits
Vacation policy
Workspace improvement stipend
Professional development stipend

Qualificações

  • 3-6 years of experience in data engineering.
  • Strong foundation in Python and SQL.
  • Proven experience with cloud platforms like Azure.

Responsabilidades

  • Design and develop scalable data pipelines.
  • Integrate data from diverse sources.
  • Implement data quality monitoring systems.

Conhecimentos

Python
SQL
Data Modeling
ETL/ELT Development
Data Governance
Problem Solving

Formação académica

Bachelor’s degree in Mathematics, Computer Science, Engineering, Economics, Statistics

Ferramentas

Azure Data Factory
Azure Databricks
Azure SQL Database
Tableau
Power BI

Descrição da oferta de emprego

Join to apply for the Data Engineer role at SuperSummary

Join to apply for the Data Engineer role at SuperSummary

Lift Ventures, a remote-first startup studio whose portfolio of businesses has reached over 250 million consumers to date, is seeking a seasoned and talented Data Engineer for SuperSummary, our fast-growing EdTech business. SuperSummary is a subscription-based website and mobile app offering a library of professionally written study guides and other educational tools and resources on thousands of books for students, teachers, and readers of all types.

About the Job

We are looking for a Data Engineer to join our fully remote team and play a key role in designing, building, and optimizing our data systems. Reporting to the Data Engineering Coordinator, you will collaborate with product managers, department leaders, data scientists, and analysts to deliver high-quality, actionable data insights that shape product development, drive innovation, and support strategic decision-making across the company.

This position is 100% remote, with a preference for candidates based in Latin America. Our distributed team spans the U.S., Brazil, the Philippines, and beyond — we value diverse perspectives and an inclusive, collaborative work environment.

Key Responsibilities

Design and Develop Scalable Data Pipelines

  • Build, maintain, and optimize robust ETL/ELT pipelines using tools such as Azure Data Factory, Azure Databricks, or Synapse Analytics.
  • Ensure pipelines meet business requirements, are scalable, efficient, and well-documented.

Data Integration and Management

  • Integrate data from diverse sources, including APIs, data warehouses, cloud services, and more.
  • Manage and optimize data storage solutions like Azure Data Lake, Azure SQL Database, and Azure Blob Storage.

Ensure Data Quality and Governance

  • Implement data quality monitoring systems and checks to ensure accuracy, consistency, and reliability.
  • Uphold data governance policies and ensure compliance with data security and privacy standards.

Cross-Functional Collaboration

  • Work closely with cross-functional stakeholders to understand evolving data needs and translate them into scalable solutions.
  • Provide technical support for data-related issues and contribute to the continuous improvement of the data infrastructure.

Performance Optimization

  • Monitor, troubleshoot, and enhance the performance and availability of data pipelines.
  • Optimize data workflows to improve speed, efficiency, and cost-effectiveness.

Documentation and Best Practices

  • Document data architectures, workflows, and processes to ensure transparency and knowledge sharing.
  • Advocate for and apply best practices in data engineering, staying current with emerging tools and technologies.

Sample Projects

  • Unified Traffic & Session Pipelines

Built scalable ETL/ELT workflows (Airflow/Databricks) that merged clickstream and session data from Amplitude, Google Analytics, AWR, etc., into a central analytics layer—powering cross-platform marketing + product dashboards.

Developed Python-based scrapers and API orchestrations to pull competitive pricing, book metadata, and review data—fusing it into our recommendation engine for richer decision-making.

  • Data Lakehouse & Warehouse Architecture

Led rollout of a cloud-native Lakehouse (Delta on Databricks/Azure Synapse) alongside a star-schema enterprise warehouse—standardizing schemas, partitioning strategies, and CI/CD for SQL artifacts (dbt or Synapse pipelines).

  • Data Governance & Quality Framework

Established data ownership models, automated lineage tracking, and built monitoring jobs (Great Expectations) to catch schema drift, null spikes, and stale datasets before they hit BI tools.

  • Database Performance & Cost Optimization

Tuned Postgres/Azure SQL/MySQL clusters (indexing, query refactoring, partitioning), slashed query runt.

Job Requirements

Qualifications

  • 3–6 years of experience in data engineering, with a strong foundation in Python and SQL for building, optimizing, and maintaining data pipelines, transformations, and data-driven solutions.
  • Bachelor’s degree (or equivalent experience) in Mathematics, Computer Science, Engineering, Economics, Statistics, or a related technical field.
  • Proven experience working with cloud platforms such as Azure (strongly preferred), AWS, or Google Cloud Platform, including hands-on knowledge of cloud-based data tools and services.
  • Solid understanding of data modeling, ETL/ELT development, and data warehousing principles, with a track record of applying best practices to design scalable and efficient systems.
  • Advanced proficiency with Spark (or similar distributed data processing frameworks), demonstrating the ability to work with large, complex datasets and optimize performance.
  • Familiarity with integrating data into visualization and BI tools (e.g., Tableau, Power BI, QlikView) to support downstream analytics and insights.
  • Strong problem-solving skills with a systems-thinking approach — able to tackle complex technical challenges, design end-to-end solutions, and continuously improve processes.
  • Excellent communication and collaboration skills, with a proven ability to work effectively across cross-functional and international teams.
  • Comfortable working in a professional English-speaking environment (both written and verbal).
  • Work with a distributed, global team that has been remote-first since 2018
  • Competitive salary, benefits, and vacation policy
  • Workspace improvement stipend
  • Professional development and learning stipend

EEOC Statement

SuperSummary supports workplace diversity and does not discriminate on the basis of age, race, national origin, religion, gender identity or expression, sexual orientation, pregnancy, physical or mental disability, or any other protected class.

We welcome diverse perspectives and are dedicated to fostering an inclusive workplace where everyone can grow and thrive. We understand that candidates may not meet every requirement in the job description, but we strongly encourage individuals from all backgrounds to apply. If you’re passionate about this role and our mission, even if your experience doesn’t perfectly match, we’d love to hear from you and explore how you can contribute to our team.

Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Information Technology
  • Industries
    Technology, Information and Internet

Referrals increase your chances of interviewing at SuperSummary by 2x

Sign in to set job alerts for “Data Engineer” roles.
Desenvolvedor Front-end | Front-end Developer - Remoto

Joinville, Santa Catarina, Brazil 1 month ago

Desenvolvedor(a) Fullstack (Python / Vue.js)

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Obtém a tua avaliação gratuita e confidencial do currículo.
ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.

Ofertas semelhantes

Consultor(a) Data Engineer BW - 4HANA

Stefanini Group

Curitiba

Teletrabalho

BRL 20.000 - 80.000

Há 3 dias
Torna-te num dos primeiros candidatos

Mid Data Engineer

ília

Teletrabalho

BRL 20.000 - 80.000

Há 4 dias
Torna-te num dos primeiros candidatos

Senior Data Engineer

Bees Brasil

Teletrabalho

BRL 20.000 - 80.000

Há 8 dias

Python and Kubernetes Software Engineer - Data, Workflows, AI/ML & Analytics

Canonical

Florianópolis

Teletrabalho

USD 30.000 - 60.000

Ontem
Torna-te num dos primeiros candidatos

Data Engineer (Remote)

Blue Orange Digital

São Paulo

Teletrabalho

BRL 20.000 - 80.000

Ontem
Torna-te num dos primeiros candidatos

Senior Data Engineer

Luxevision Consulting LLC

Teletrabalho

USD 18.000 - 36.000

Ontem
Torna-te num dos primeiros candidatos

Python and Kubernetes Software Engineer - Data, Workflows, AI/ML & Analytics

Canonical

São Paulo

Teletrabalho

USD 40.000 - 60.000

Ontem
Torna-te num dos primeiros candidatos

Python and Kubernetes Software Engineer - Data, Workflows, AI/ML & Analytics

Canonical

Buenos Aires

Teletrabalho

USD 40.000 - 60.000

Ontem
Torna-te num dos primeiros candidatos

Data Engineer (Remote)

Blue Orange Digital

Rio de Janeiro

Teletrabalho

BRL 20.000 - 80.000

Hoje
Torna-te num dos primeiros candidatos