Enable job alerts via email!

Tech Lead - Kubernetes & Observability (Supervisor, Platform Operations)

Availity

Pennsylvania

Remote

USD 120,000 - 160,000

Full time

Today
Be an early applicant

Job summary

A leading healthcare technology firm is seeking a Tech Lead for Kubernetes & Observability to manage infrastructure for U.S. healthcare transactions. The ideal candidate will have over 10 years of IT experience, including leadership in technical teams. This role involves overseeing Kubernetes control planes and observability tools, with a focus on enhancing operational efficiency. Competitive salary, unlimited PTO, and 401k match are offered in a remote work setting.

Benefits

Competitive salary
Unlimited PTO
Wellness reimbursement
Education reimbursement

Qualifications

  • 10+ years of relevant technical and business experience in IT systems delivery.
  • 3+ years of experience in IT systems engineering leadership or team management.
  • Strong leadership skills with the ability to motivate and guide technical teams.

Responsibilities

  • Lead and mentor an infrastructure engineering and operations team.
  • Own and advance the Kubernetes/EKS control plane.
  • Manage observability and logging platforms including Splunk.

Skills

Kubernetes/EKS administration at scale
Observability and monitoring tools
Linux systems administration
Terraform
AWS services (VPC, IAM, EC2)

Education

Bachelor’s degree in Computer Science or related field

Tools

Splunk
Cribl
Prometheus/Grafana
OpenTelemetry
New Relic
Job description

Availity delivers revenue cycle and related business solutions for health care professionals who want to build healthy, thriving organizations. Availity has the powerful tools, actionable insights and expansive network reach that medical businesses need to get an edge in an industry constantly redefined by change.

At Availity, we're not just another Healthcare Technology company; we're pioneers reshaping the future of healthcare! With our headquarters in Jacksonville, FL, and an office in Bangalore, India, along with a remote workforce across the United States, we're a global team united by a powerful mission.

We're on a mission to bring the focus back to what truly matters – patient care. As the leading healthcare engagement platform, we're the heartbeat of an industry that impacts millions. With over 2 million providers connected to health plans, and processing over 13 billion transactions annually, our influence is continually expanding.

Join our energetic, dynamic, and forward-thinking team where your ideas are celebrated, innovation is encouraged, and every contribution counts. We're transforming the healthcare landscape, solving communication challenges, and creating connections that empower the nation's premier healthcare ecosystem.

Role

Tech Lead – Kubernetes & Observability (Supervisor, Platform Operations)

You’ll be the technical lead for a team that manages the infrastructure backbone of U.S. healthcare transactions. Availity processes 13+ billion clinical and financial transactions annually, and our Kubernetes control plane, observability stack, and private-cloud platforms are the engines that keep it running. This is not a “keep the lights on” role. You will:

  • Own and evolve our Kubernetes (EKS/Istio) control plane at enterprise scale.
  • Lead the tooling and support for observability and logging (New Relic, Splunk, Cribl, OpenTelemetry) with reliability as your north star.
  • Oversee our EC2 application deployment pipelines and other mission-critical internal platforms in our AWS private cloud.
  • Guide and mentor engineers while setting the bar for operational excellence.

If you thrive where scale, reliability, and technical leadership intersect, this is your chance to make healthcare infrastructure more resilient, faster, and smarter.

By fostering a culture of ownership, innovation, and continuous improvement, you will empower your team to deliver scalable solutions that enhance operational efficiency and system reliability. Your role will be pivotal in aligning technical execution with business goals, mentoring talent, and advancing the maturity of platform operations through modern DevOps and SRE methodologies.

Sponsorship, in any form, is not available for this position.

Location: Remote, US

Qualifications

Role qualifications:

  • Bachelor’s degree in Computer Science or related field, or equivalent work experience.
  • 10+ years of relevant technical and business experience in IT systems delivery, operations, and support (preferably in healthcare or high-transaction environments).
  • 3+ years of experience in IT systems engineering leadership or team management.
  • Hands-on expertise with:
    Kubernetes/EKS administration at scale; Terraform, Helm, and AWS services (VPC, IAM, EC2, EKS, Istio).
  • Observability and monitoring tools: Splunk, Cribl, Prometheus/Grafana, OpenTelemetry, New Relic.
  • Linux (RHEL-based) systems administration, including SELinux.
  • Experience bridging infrastructure and development teams, ensuring alignment of roadmaps and goals.
  • Strong leadership skills with the ability to motivate and guide technical teams.
  • Excellent communication skills, with the ability to explain complex technical concepts to both technical and non-technical stakeholders.

Preferred:

  • SaaS experience supporting large-scale, mission-critical systems.
  • Familiarity with EC2 deployment pipelines for packaged software and re-platforming to cloud-native environments.
  • Knowledge of service mesh concepts (Istio, Linkerd, etc.).
  • Background in metrics-driven reliability engineering (SLOs, SLIs, error budgets).
  • Exposure to scripting/programming (JavaScript for Cribl, Python, etc.).

You will set yourself apart with:

  • Thrives in high-transaction, high-uptime environments and balances operational rigor with engineering speed.
  • Experience leading hybrid infrastructure transitions (on-prem to cloud).
  • Comfort debugging a Splunk pipeline or tuning an Istio config, and mentoring engineers on their career paths.
  • Believes in autonomy over micromanagement and is passionate about giving the team the trust and tools to succeed.

What you will be doing:

  • Leading and mentoring an infrastructure engineering and operations team focused on Kubernetes, observability, and platform services.
  • Owning and advancing the Kubernetes/EKS control plane, Istio service mesh, and related networking/security features (mTLS, SSL/TLS).
  • Managing observability and logging platforms including:
    • Splunk (EKS + on-prem components, forwarders, deployment server).
    • Cribl operational pipelines (EKS-based).
    • New Relic SaaS integrations and Prometheus data ingestion.
    • OpenTelemetry & KubeLogging/Banzai Operator for distributed tracing and logging pipelines.
    • Prometheus/Grafana migrations from on-prem OCP to AWS for metrics scraping and synthetic monitoring.
  • Overseeing EC2 application deployment pipelines for packaged software platforms hosted in AWS, including replatforming away from EL7 to cloud-native solutions.
  • Supporting legacy/on-prem platforms as they migrate into AWS (Tidal, Aries pipelines, provider Splunk, legacy base images).
  • Driving infrastructure-as-code practices (Terraform, Helm, Ansible) for repeatable deployments and environment consistency.
  • Collaborating with engineering, middleware, and product teams to define clear ownership, reduce friction, and ensure platform services enable—not block—delivery.
  • Ensuring upgrades, patching, and platform updates are proactively planned and executed without business disruption.
  • Setting reliability targets and defining operational metrics (availability, latency, error budgets) in line with SRE methodologies.
Availity culture and benefits
  • Availity is a certified “Great Place to Work” and received several workplace recognitions and DEI initiatives.
  • Culture is important to us with many ways to engage, including employee resource groups and communities.
  • Availity supports continuous learning with resources in our tech stack and industry.
  • Competitive salary, bonus, generous health benefits, and 401k match from day one.
  • Unlimited PTO for salaried associates plus 9 paid holidays; hourly associates have paid time off starting at 19 days.
  • Wellness reimbursement up to $250/year for gym memberships and related programs.
  • Education reimbursement and Paid Parental Leave for birth and adoptive parents.
  • Community involvement through company partnerships and volunteer initiatives.

Next steps:

After you apply, you will receive text/email messages thanking you for applying and updates as you move through the recruitment process.

Interview process:

  • Manager resume review
  • Recruiter video interview
  • Manager video interview
  • Technical video interview
  • Senior leadership video interview

Video Camera Usage:

Availity is camera-forward for remote collaboration. If you are not able to use your camera for all virtual meetings, you should not apply for this role. Having cameras on helps create a more connected and secure environment.

Disclaimers:

Availity is an equal opportunity employer and makes decisions without regard to race, religion, color, age, sex, sexual orientation, gender identity, genetic information, national origin, or other protected classifications. Availity is a drug-free workplace; candidates must pass a drug test before employment. Federal law requires identity and employment eligibility verification via I-9 and E-Verify where applicable. Learn more about E-Verify at http://www.dhs.gov/e-verify.

Click to view Federal Employment Notices: Family & Medical Leave Act, Equal Employment Opportunity Poster, Pay Transparency, Employee Polygraph Protection Act, IER Right to Work Poster, and other notices.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.