Enable job alerts via email!

Sr Big Data Engineer - Oozie and Pig (GCP)

Rackspace Technology

United States

Remote

USD 90,000 - 150,000

Full time

30 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is on the lookout for a Senior Big Data Engineer to join their remote team. This role offers the opportunity to design and develop scalable batch processing systems utilizing cutting-edge technologies like Hadoop and GCP. The ideal candidate will possess strong programming skills in Java or Python and have a solid understanding of data structures and algorithms. This innovative firm values independent, self-driven engineers who thrive in complex environments and are committed to delivering high-quality solutions. Join a company that embraces diversity and is dedicated to empowering its customers through technology.

Qualifications

  • 5+ years of experience in customer-facing software or technology.
  • Proficiency in Oozie, Pig, and Java or Python required.

Responsibilities

  • Design and develop scalable batch processing systems using Hadoop and Oozie.
  • Collaborate with teams to ensure data pipeline reliability and code quality.

Skills

Java
Python
Oozie
Pig
Hadoop
SQL
Cloud Services
DevOps
Data Structures
Algorithms

Education

Bachelor's degree in Computer Science
Software Engineering

Tools

Google Cloud Platform (GCP)
Terraform
Apache Hadoop Ecosystem
CI/CD Pipelines

Job description

About the Role

We are seeking a Senior Big Data Engineer with deep expertise in distributed systems, batch data processing, and large-scale data pipelines. The ideal candidate has strong hands-on experience with Oozie, Pig, the Apache Hadoop ecosystem, and programming proficiency in Java (preferred) or Python. This role requires a deep understanding of data structures and algorithms, along with a proven track record of writing production-grade code and building robust data workflows.

This is a fully remote position and requires an independent, self-driven engineer who thrives in complex technical environments and communicates effectively across teams.

Work Location: US-Remote, Canada-Remote

Key Responsibilities:

  • Design and develop scalable batch processing systems using technologies like Hadoop, Oozie, Pig, Hive, MapReduce, and HBase, with hands-on coding in Java or Python.
  • Write clean, efficient, and production-ready code with a strong focus on data structures and algorithmic problem-solving applied to real-world data engineering tasks.
  • Develop, manage, and optimize complex data workflows within the Apache Hadoop ecosystem, with a strong focus on Oozie orchestration and job scheduling.
  • Leverage Google Cloud Platform (GCP) tools such as Dataproc, GCS, and Composer to build scalable and cloud-native big data solutions.
  • Implement DevOps and automation best practices, including CI/CD pipelines, infrastructure as code (IaC), and performance tuning across distributed systems.
  • Collaborate with cross-functional teams to ensure data pipeline reliability, code quality, and operational excellence in a remote-first environment.

Qualifications:

  • Bachelor's degree in Computer Science, software engineering or related field of study.
  • Experience with managed cloud services and understanding of cloud-based batch processing systems are critical.
  • Proficiency in Oozie, Airflow, Map Reduce, Java.
  • Strong programming skills with Java (specifically Spark), Python, Pig, and SQL.
  • Expertise in public cloud services, particularly in GCP.
  • Proficiency in the Apache Hadoop ecosystem with Oozie, Pig, Hive, Map Reduce.
  • Familiarity with BigTable and Redis.
  • Experience in Infrastructure and Applied DevOps principles in daily work. Utilize tools for continuous integration and continuous deployment (CI/CD), and Infrastructure as Code (IaC) like Terraform to automate and improve development and release processes.
  • Proven experience in engineering batch processing systems at scale.

Must Have: (Important)

  • 5+ years of experience in customer-facing software/technology or consulting.
  • 5+ years of experience with “on-premises to cloud” migrations or IT transformations.
  • 5+ years of experience building and operating solutions built on GCP.
  • Proficiency in Oozie and Pig.
  • Proficiency in Java or Python.

About Rackspace Technology

We are the multicloud solutions experts. We combine our expertise with the world’s leading technologies — across applications, data and security — to deliver end-to-end solutions. We have a proven record of advising customers based on their business challenges, designing solutions that scale, building and managing those solutions, and optimizing returns into the future. Named a best place to work, year after year according to Fortune, Forbes and Glassdoor, we attract and develop world-class talent. Join us on our mission to embrace technology, empower customers and deliver the future.

More on Rackspace Technology

Though we’re all different, Rackers thrive through our connection to a central goal: to be a valued member of a winning team on an inspiring mission. We bring our whole selves to work every day. And we embrace the notion that unique perspectives fuel innovation and enable us to best serve our customers and communities around the globe. We welcome you to apply today and want you to know that we are committed to offering equal employment opportunity without regard to age, color, disability, gender reassignment or identity or expression, genetic information, marital or civil partner status, pregnancy or maternity status, military or veteran status, nationality, ethnic or national origin, race, religion or belief, sexual orientation, or any legally protected characteristic. If you have a disability or special need that requires accommodation, please let us know.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Sr Big Data Engineer Airflow and Oozie (GCP)

Open Data Science Conferenc

Remote

USD 116,000 - 199,000

30+ days ago

Big Data Architect

GPF Staffing, LLC.

Remote

USD 90,000 - 150,000

30+ days ago

Sr. Big Data / Data Management Consultant/ Specialists

Artha Solutions

Naperville

Remote

USD 90,000 - 130,000

30+ days ago