Enable job alerts via email!

Principal Engineer - Ozone/HDFS

Cloudera

Bengaluru

On-site

INR 6,75,000 - 9,00,000

Full time

Today
Be an early applicant

Job summary

A leading data solutions provider based in Bengaluru is looking for a Principal Software Engineer to work on Apache Ozone. The role involves designing core features, mentoring engineers, and contributing to the open-source community. Ideal candidates should have extensive experience in backend engineering, particularly with Java and distributed systems. This position offers competitive benefits, including a generous PTO policy and flexible work arrangements.

Benefits

Generous PTO Policy
Flexible WFH Policy
Mental & Physical Wellness programs
Phone and Internet Reimbursement program
Access to Continued Career Development

Qualifications

  • Bachelor's +15 years or Master's +12 years relevant experience required (8+ for PhD).
  • Experience with large-scale distributed systems design and development.

Responsibilities

  • Design and implement core features of Apache Ozone and Apache Ratis.
  • Regularly contribute code and design docs to the Apache community.
  • Support enterprise customers with big data analytics and ML/AI pipelines.

Skills

Backend engineering
Java programming
C++ programming
Distributed systems design
Data structures and algorithms
Communication skills

Education

BS, MS, or PhD in Computer Science
Job description
Overview

At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.

Cloudera is looking for an exceptional and passionate software engineer with a strong distributed systems background to join the Storage Engineering team focused on building Apache Ozone. The Storage team is responsible for primary storage and storage access layers, which are core to the platform. Apache Ozone provides a massively scalable distributed object store with a distributed file system interface. Ozone is designed to scale to tens of billions of files and blocks, and overcome the limitations of Hadoop Distributed File System (HDFS), namely, millions of small files and managing a huge number of datanodes.

Opportunity to join the team that created and wrote most of the HDFS code and make a huge impact on the big data and cloud computing industry.

Responsibilities
  • As a Principal Software Engineer, you will be directly involved in the design and implementation of the core feature set of Apache Ozone and Apache Ratis (open-source RAFT implementation).

  • You will regularly contribute code and design docs to the Apache open-source community.

  • As part of storage engineering, you will support enterprise customers running 100s of petabytes-scale big data analytics and ML/AI pipelines.

  • You will partner with Engineering leaders, product managers, and cross-functional teams as a part of the Cloudera Data platform ecosystem in understanding requirements and turning them into a solid design and implementation, and facilitating integration and adoption.

  • Additionally, in this role, you will be responsible for leading a talented group of engineers working on the feature and mentoring junior engineers.

Qualifications
  • BS, MS, or PhD in Computer Science

  • Bachelor's +15, Master's +12 years of relevant industry experience required (8+ for PhD candidate)

  • Strong backend engineering skill set with expertise in Java, or strong C++ skills, with intermediate Java expertise

  • Passionate about programming. Clean coding habits, attention to detail, and focus on quality

  • Experience with large-scale, distributed systems design and development with a strong understanding of scaling, replication, consistency, and high availability

  • Solid experience with system software design and development with a strong understanding of computer architecture, storage, network, and IO subsystems, and distributed systems

  • Hands-on programmer with strong data structures and algorithms skillset

  • Strong oral and written communication skills

You may also have
  • Strong background in a distributed storage system, including file systems, database storage internals, NoSQL storage, or distributed hash tables

  • Strong background in performance tuning, identifying performance bottlenecks, and implementing performance optimizations

  • Strong understanding of the Apache Big Data ecosystem and over 3+ years of experience in systems software, including file systems

  • Recognized contributions to open source projects

  • Experience using projects such as Hive, Pig, MapReduce, HBase, etc., is a big plus

  • Good Understanding of storage development, RAFT replication framework, or equivalent distributed consensus frameworks

What you can expect from us
  • Generous PTO Policy

  • Support work life balance with Unplugged Days

  • Flexible WFH Policy

  • Mental & Physical Wellness programs

  • Phone and Internet Reimbursement program

  • Access to Continued Career Development

  • Comprehensive Benefits and Competitive Packages

  • Paid Volunteer Time

  • Employee Resource Groups

EEO/VEVRAA

#LI-RA1

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.