Enable job alerts via email!

Senior Data Engineer

Spokeo

Pasadena (CA)

Remote

USD 120,000 - 160,000

Full time

5 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative firm is seeking a Senior Data Engineer to enhance data systems and automate processes. This role involves developing data pipelines and collaborating with data science teams to create impactful data products. You'll work with cutting-edge technologies in a remote-first environment, contributing to meaningful projects that help millions reconnect with friends and family. Join a company recognized for its commitment to work-life balance and employee satisfaction, where your expertise will drive the future of data transparency.

Benefits

Bonus program
Equity plans
401K matching
100% medical/dental/vision coverage
Unlimited employee PTO

Qualifications

  • 7+ years of development experience in data engineering.
  • Proven experience with large datasets exceeding 100M+ records.
  • Hands-on programming experience with Python and Spark.

Responsibilities

  • Build data automation pipelines for ingestion and processing.
  • Collaborate with stakeholders to develop data products.
  • Create unit tests to monitor technical performance.

Skills

Python
AWS
Spark
PySpark
SQL
Data Governance
Data Pipeline Automation

Education

Bachelor’s degree in Computer Science
Bachelor’s degree in Information Systems
Bachelor’s degree in Mathematics

Tools

Airflow
DynamoDB
Elasticsearch

Job description

About Spokeo:

Join our mission to make the world more transparent with data.

Spokeo is a people search engine that helps over 18 million monthly visitors reconnect with friends, reunite with families, and protect against fraud. Additionally, our 12 billion records and over 250 million unique profiles help business professionals locate people and assets, research criminal investigation subjects, and more.

Founded in 2006, we have grown to a remote-first company of nearly 200 dedicated employees with an average tenure of 4.5 years. Find out why we were named a “Best Company” for 2023 by Comparably for Women, Compensation, Happiest Employees, Company Perks & Benefits, and Work-Life Balance, as well as “Best CEO” for co-founder Harrison Tang.

About this Opportunity

As a Senior Data Engineer at Spokeo, you will develop, optimize, and improve our data systems such as ETL data, pipeline, storage, and entity resolution. This involves working with infrastructure built in AWS, including Airflow, PySpark, EMR, S3, DynamoDB, and more. This role will help build and improve data products, automation platform features, analytical software packages, and data pipeline orchestration tools.

What You’ll Do:

  • Build infrastructure and data automation pipelines for the ingestion, processing, and loading of data from various sources. Automate and integrate new components into the data pipeline.
  • Collaborate with stakeholders and data science teams to develop data products including entity resolution and best selection to efficiently execute product vision and strategy in alignment with organizational goals and priorities.
  • Create unit and stress test components to monitor technical performance and ensure identified issues are resolved.
  • Develop data analysis tools to provide data insights and capture key metrics.
  • Research solutions and maintain technical documentation.
  • Follow best practices for data governance, quality, cleansing, and other ETL-related activities.

Who You Are:

  • 7+ years of development experience in data engineering within a production environment (internships and academic settings excluded).
  • Proven experience working with large datasets exceeding 100M+ records or multiple terabytes.
  • 2+ years of development experience in highly scalable, distributed systems and cluster architectures using AWS.
  • 5+ years of hands-on programming experience with Python.
  • 5+ years of professional experience working in big data ecosystems, Spark is required; PySpark is preferable.
  • 3+ years experience with SQL, schema design, and dimensional data modeling.
  • 2+ years of professional experience working with dataflow orchestration tools, such as Airflow.
  • 2+ years experience with non-relational databases (e.g., DynamoDB, Elasticsearch, etc.).
  • Bachelor’s degree in Computer Science, Information Systems, Mathematics, or a related field is required.

Working at Spokeo

Spokeo offers a bonus program, equity plans, and 401K matching for qualified roles. Twice a year, we do discretionary, merit-based salary increases. Additional benefits include 100% medical/dental/vision coverage and unlimited employee PTO.

Spokeo extends written offers to candidates who successfully complete their selection process. Spokeo’s offers include a base salary, participation in a company bonus program, stock options, and comprehensive benefits. A final offer will depend on several factors, including, but not limited to, marketplace competition, job leveling, the candidate’s experience, skills, etc.

Privacy Notice for Candidates: https://www.spokeo.com/recruiting-policy

Spokeo is an equal-opportunity employer. Applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability, or protected veteran status. Spokeo fosters a business culture where ideas and decisions from all people help us grow, innovate, create the best products, and be relevant in a rapidly changing world.

Recruiters or staffing agencies: Spokeo is not obligated to compensate any external recruiter or search firm who presents a candidate or their resume or profile to a Spokeo employee without 1) a current, fully executed agreement on file, and 2) being assigned to the open position (as a search) via our applicant tracking solution.

#LI-Remote

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Data Engineer - DeFi

ZipRecruiter

Washington

Remote

USD 90,000 - 150,000

2 days ago
Be an early applicant

Senior Data Engineer

Tyler Technologies

Washington

Remote

USD 93,000 - 135,000

4 days ago
Be an early applicant

Senior Data Engineer

Capital Bank MD

Rockville

Remote

USD 115,000 - 141,000

6 days ago
Be an early applicant

Sr. Data Engineer

First American

California

Remote

USD 126,000 - 169,000

7 days ago
Be an early applicant

Senior Data Engineer - DeFi

ZipRecruiter

New York

Remote

USD 90,000 - 150,000

2 days ago
Be an early applicant

Senior Data Engineer - DeFi

ZipRecruiter

Cleveland

Remote

USD 90,000 - 150,000

2 days ago
Be an early applicant

Sr. IT Software Engineer - GCP (Sr. Data Engineer) - Remote

Lensa

Lincoln

Remote

USD 94,000 - 160,000

Yesterday
Be an early applicant

Senior Data Engineer, iQueue for Operating Rooms (Western US)

LeanTaaS

Santa Clara

Remote

USD 90,000 - 140,000

2 days ago
Be an early applicant

Senior Data Engineer

Upteam

Remote

USD 100,000 - 150,000

2 days ago
Be an early applicant