Job Search and Career Advice Platform

Enable job alerts via email!

Staff Software Engineer, Site Reliability (SRE)

Optimal Dynamics

Poland

Remote

PLN 656,000 - 803,000

Full time

30+ days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading logistics technology firm is hiring a Staff Software Engineer, Site Reliability to drive reliability across their production platform. The role involves owning incident response, defining SLIs and SLOs, and embedding security into the delivery pipeline. Candidates should have extensive experience in infrastructure and cloud computing, preferably with a background in containerization and strong Python skills. Competitive compensation and benefits package offered, with a focus on a collaborative environment.

Benefits

Competitive compensation
Health/Dental/Vision 100% covered
Unlimited PTO
401(k) with match
Paid Parental Leave
Fitness membership reimbursement

Qualifications

  • Experienced Staff-level Individual contributor leading reliability programs.
  • Technical grounding in infrastructure, cloud, and containerization.
  • Proficient in Python to drive operational improvements.

Responsibilities

  • Own the incident lifecycle across the company.
  • Define and drive SLIs/SLOs for core services.
  • Embed security measures into the delivery pipeline.

Skills

Infrastructure at scale
Cloud computing
Containerization (ECS/Kubernetes)
CI/CD
Python proficiency
Data-driven decision making
Strong communication skills
Influential leadership

Tools

AWS
Terraform
Bazel
GitHub
Job description
Staff Software Engineer, Site Reliability (SRE)

Remote

About Our Company

Built on over four decades of pioneering research at Princeton University, our platform represents the leading edge of innovation in freight and transportation planning. We help customers unlock double-digit revenue gains and drive smarter, data-driven operations at scale. With the recent close of our Series C funding round led by Koch Disruptive Technologies, we’re entering an exciting new phase of growth. Today, Optimal Dynamics is a high-growth company of ~70 employees, backed by top-tier investors including Bessemer Venture Partners, The Westly Group, Activate Capital, and Koch.

We\'re on a mission to redefine the way logistics decisions are made—and we’re just getting started.

About the Role

We’re hiring a Staff Software Engineer, Site Reliability to lead reliability across our production platform. As a Staff‑level Individual contributor, you will drive strategy and hands‑on execution across incident response, SLO/SLI programs, and production readiness, directly owning highly available services in AWS; all while partnering with Platform/Infra to build paved‑road tooling in our monorepo.

This is a full‑time, remote‑friendly role open to candidates across the United States. For those who prefer an in‑office experience, our HQ in New York City offers a collaborative environment.

What You’ll Do

Reliability (≈50%)

  • Own the company‑wide incident lifecycle: standards for detection, escalation, incident command, customer comms, and high‑quality postmortems with action tracking.
  • Define and drive SLIs/SLOs for core services; build guardrails and dashboards that make reliability visible and actionable.
  • Lead production readiness reviews, capacity/performance planning, load testing, disaster recovery exercises, and resilience engineering (failure testing/chaos where appropriate).
  • Level‑up on‑call: right‑sizing rotations, paging hygiene, runbooks, auto‑remediation, and continuous improvement of MTTA/MTTR.

Security (≈30%)

  • Embed security into the delivery pipeline: dependency and image scanning, least‑privilege/IAM baselines, secrets management, and service‑to‑service auth.
  • Partner with Engineering leadership to maintain SOC 2‑aligned controls as code; make audit‑friendly evidence generation part of everyday engineering.
  • Drive secure‑by‑default patterns in the platform (e.g., network posture, data protection, runtime policies) without slowing down developers.
  • Build and evolve paved roads for deploys, config, and runtime operations in our monorepo (Bazel) and CI/CD (AWS CodePipeline/CodeBuild).
  • Partner with product teams to make the “secure, reliable default” the easiest path—templates, tooling, libraries, and automation.
Who You Are
  • Experienced: Staff‑level IC who has led reliability programs at meaningful scale and owned incident response standards.
  • Technically Grounded: Deep, hands‑on experience with infrastructure at scale, cloud, containerization, and more:
  • ECS and/or Kubernetes containerization workloads
  • CICD & IaC (Terraform)
  • Python Proficient: You can read/review service code and land operational improvements.
  • Data Driven: In your approach to SLOs, capacity, performance, and cost efficiency with strong observability chops
  • Influential: Able to shape direction and create simple, durable standards
  • Communicative: Excels in both technical and interpersonal communication, with strong written and verbal skills
Nice To Have (Bonus Points)
  • Aware of FinOps (cost attribution, efficient scaling) and DR/BCP program experience.
  • Familiar with secure SDLC, threat modeling, and compliance automation in a SOC 2 context.
  • Experience collaborating with Data Science/ML teams and batch/streaming workloads.
  • Exposure to monorepo frameworks such as Bazel, Buck, etc.
About our tech stack and development practices

At Optimal Dynamics, our entire infrastructure runs on AWS, leveraging a wide range of services including DynamoDB, Aurora, SSM, and SQS to power our intelligent logistics platform.

  • Backend & AI: Python 3 and Java
  • Data Stack: Trino, Dagster, dbt, DuckDB, and Preset
  • IaC: Terraform and Spacelift
  • Cloud: AWS (ECS/RDS/S3/etc)
  • CI/CD: Bazel, Github, AWS CodePipeline/CodeBuild

We follow modern development best practices with all code stored on GitHub. Every pull request undergoes thorough code reviews, is fully unit tested, and deployed through our CI/CD pipeline for continuous quality assurance.

Pay Range

$180,000 - $220,000 USD

  • Competitive compensation, including Series C level equity
  • Health / Dental / Vision 100% covered for employee and 50% for dependents
  • Life Insurance, with optional supplemental insurance
  • Flexible Spending Account (FSA)
  • Health Spending Account (HSA)
  • 401(k) with match
  • Unlimited PTO (vacation, personal days, sick days, jury duty, military leave, bereavement)
  • 11 Holidays
  • Paid Parental Leave for all employees
  • Short-term and Long-term Disability Insurances, and AD&D Insurance
  • Fitness membership reimbursement

Equal Employment Opportunity Optimal Dynamics is proud to be an equal opportunity employer that celebrates diversity and is committed to creating an inclusive workplace with equal opportunity for all applicants and employees.

Background and Compliance If you are selected for a position, there will be a background screen, which may include checks for criminal records and/or motor vehicle reports and/or drug screening, depending on the position requirements. Job‑related concerns identified during the background screening may disqualify you from the new position or your current role. Background results will be evaluated on a case‑by‑case basis. Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.