Enable job alerts via email!

Tech Lead - Network Operations Center

Databricks Inc.

United States

Remote

USD 114,000 - 220,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company is seeking an experienced Tech Lead to enhance platform observability and proactive monitoring. The role involves developing monitoring solutions, collaborating with engineering teams, and mentoring NOC engineers. This position offers a competitive salary and comprehensive benefits.

Benefits

Comprehensive benefits
Annual performance bonus
Equity options

Qualifications

  • Minimum of 6 years of experience as an SRE or DevOps engineer.
  • Strong knowledge of cloud technologies such as Azure, AWS, and GCP.

Responsibilities

  • Develop tooling and automate processes for platform monitoring.
  • Monitor critical infrastructure and triage alerts to identify incidents.
  • Provide mentorship to other NOC engineers on observability patterns.

Skills

Cloud Technologies
Automation
Scripting
Incident Detection

Education

Bachelor's degree in Computer Science

Tools

Python
ELK
Prometheus
Grafana
PagerDuty
Docker
Kubernetes

Job description

CSQ226R195; This role can be based anywhere in the United States

We are seeking an experienced Tech Lead to shape the future of platform observability and proactive monitoring within our Network Operations Center. The successful candidate will be responsible for developing monitoring solutions, alerting mechanisms, and customer-focused incident detection tools. They will also work closely with cross-functional engineering teams to investigate and resolve incidents, perform root cause analyses, and propose solutions to improve the reliability and stability of the Databricks Platform.

The impact you will have here:

  • Develop tooling and automate processes to enhance platform monitoring, proactive incident detection, and alerting.
  • Monitor critical infrastructure, triage alerts to proactively identify incidents, and collaborate with stakeholders to drive resolution..
  • Contribute to incident post-mortems and propose solutions to improve platform reliability and stability.
  • Participate in war rooms and temporary communication channels during outages.
  • Provide mentorship to other NOC engineers on observability patterns, alert design, and service health metrics.
  • Participate in oncall rotation

What are we looking for?

  • Minimum of 6 years of experience as an SRE or DevOps engineer
  • Strong knowledge of cloud technologies such as Azure, AWS, and GCP
  • Proficiency in automation and scripting (Python)
  • Experience developing monitoring and alerting solutions
  • Experience using tools such as ELK, Prometheus, Grafana, PagerDuty, etc.
  • Experience with containers and orchestration technologies such as Docker and Kubernetes.
  • Bachelor's degree in Computer Science or a related field

Pay Range Transparency

Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents base salary range for non-commissionable roles or on-target earnings for commissionable roles. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks utilizes the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above. For more information regarding which range your location is in visit our page here .

Zone 1 Pay Range

$143,300 — $219,700 USD

Zone 2 Pay Range

$129,000 — $197,700 USD

Zone 3 Pay Range

$121,700 — $186,700 USD

Zone 4 Pay Range

$114,600 — $175,700 USD

About Databricks

Databricks is the data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast, Grammarly, and over 50% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark, Delta Lake and MLflow. To learn more, follow Databricks on Twitter ,LinkedIn and Facebook .

Benefits

At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visithttps://www.mybenefitsnow.com/databricks .

Our Commitment to Diversity and Inclusion

At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics.

Compliance

If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.