Enable job alerts via email!

Staff Software Engineer - Observability

Databricks

Mountain View (CA)

On-site

USD 192,000 - 260,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a pioneering data and AI company that empowers data teams to tackle significant global challenges. As a software engineer in the Runtime Observability team, you will develop innovative solutions that enhance the performance and reliability of products and infrastructure. Your work will involve collaborating across teams to identify key metrics, building robust tooling for metrics collection, and scaling observability solutions for millions of instances. This role offers a unique opportunity to make a meaningful impact in a fast-paced environment that values creativity and technical excellence, all while enjoying comprehensive benefits and a commitment to diversity and inclusion.

Benefits

Comprehensive health coverage
401(k) Plan
Equity awards
Flexible time off
Paid parental leave
Family Planning
Gym reimbursement
Annual personal development fund
Work headphones reimbursement
Employee Assistance Program (EAP)

Qualifications

  • 4+ years of experience in Java, Scala, C++, or similar languages.
  • Experience in large-scale distributed systems and metrics collection.

Responsibilities

  • Collaborate with teams to identify performance metrics.
  • Build tooling for logging and aggregating metrics.

Skills

Java
Scala
C++
Software Development
Distributed Systems
Metrics Collection
Health Monitoring
Observability Tools

Education

Bachelor's degree in Computer Science
Higher degree in related field

Tools

Observability Tools

Job description

P-186

At Databricks, we are inspired by allowing data teams to solve the world’s toughest problems, from security threat detection to cancer drug development. We do this by building and running the world’s best data and AI infrastructure platform, so our customers can focus on the high value challenges that are central to their own missions.

Our engineering teams build technical products that fulfill real, important needs in the world. We always push the boundaries of data and AI technology, while simultaneously operating with the security and scale that is important to making customers successful on our platform.

We develop and operate one of the largest scale software platforms. The fleet consists of millions of virtual machines, generating terabytes of logs and processing exabytes of data per day. At our scale, we observe cloud hardware, network, and operating system faults, and our software must gracefully shield our customers from any of the above.

As a software engineer in the Runtime Observability team, you will develop observability solutions that provide insights into the health and performance of our products and infrastructure.

You will report directly to the Director of Engineering.

The Impact You Will Have:
  • You will collaborate with different teams to identify metrics that allow engineers to observe how well the system and different subcomponents are performing.
  • You will build tooling and infrastructure to allow components to emit, log, and aggregate metrics that can be displayed on dashboards and used for alerting.
  • You will scale the observability solutions to support millions of instances and billions of queries per day.
  • You will develop processes and training for developers and field engineers to debug performance and reliability issues affecting customers.
What We Look For
  • BS (or higher degree) in Computer Science, or a related field
  • 4+ years of production level experience in one of: Java, Scala, C++, or similar language.
  • Experience in software development, in large-scale distributed systems
  • Familiarity with metrics collection, health monitoring, and observability tools
  • Experience building relationships with developers and field engineers to facilitate assessment and mitigation of performance and reliability problems.
  • 6+ years of production level experience in one of: Java, Scala, C++, or similar language.
  • Experience driving large projects involving multiple teams. Provide appropriate guidance on developing large-scale systems that can handle billions of queries per day.
Benefits
  • Comprehensive health coverage including medical, dental, and vision
  • 401(k) Plan
  • Equity awards
  • Flexible time off
  • Paid parental leave
  • Family Planning
  • Gym reimbursement
  • Annual personal development fund
  • Work headphones reimbursement
  • Employee Assistance Program (EAP)
  • Business travel accident insurance
Pay Range Transparency

Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents base salary range for non-commissionable roles or on-target earnings for commissionable roles. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks utilizes the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above. For more information regarding which range your location is in visit our page here.

Local Pay Range: $192,000 — $260,000 USD

About Databricks

Databricks is the data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast, Grammarly, and over 50% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark, Delta Lake and MLflow. To learn more, follow Databricks on Twitter, LinkedIn, and Facebook.

Our Commitment to Diversity and Inclusion

At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics.

Compliance

If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Staff Software Engineer - SRE, Backend (Reliability Engineering)

Affirm

Palo Alto

Remote

USD 225,000 - 275,000

8 days ago

Reposted :Staff Software Engineer, Ads Infrastructure

Discord

San Francisco

Remote

USD 248,000 - 279,000

16 days ago

Staff Software Engineer - SRE, Backend (Reliability Engineering)

Affirm

San Jose

Remote

USD 200,000 - 275,000

17 days ago

Staff Software Engineer, Viaduct

Jobs via Dice

Remote

USD 204,000 - 255,000

3 days ago
Be an early applicant

Staff Software Engineer, Mozilla VPN

Mozilla

Remote

USD 138,000 - 217,000

2 days ago
Be an early applicant

Staff Software Engineer, Mozilla VPN New Remote US

Mozilla Corporation

Remote

USD 138,000 - 217,000

2 days ago
Be an early applicant

Staff Software Engineer, Speculative Decoding Mountain View, CA (Remote)

Groq Inc.

Mountain View

Remote

USD 175,000 - 308,000

30+ days ago

Staff Software Engineer

People In AI

Remote

USD 180,000 - 200,000

5 days ago
Be an early applicant

Staff Software Engineer

ZipRecruiter

New London

Remote

USD 175,000 - 240,000

5 days ago
Be an early applicant