Enable job alerts via email!

Software Engineer - Site Reliability

Partly - Digital Parts Infrastructure

Christchurch

On-site

NZD 80,000 - 130,000

Full time

17 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Start fresh or import an existing resume

Job summary

Join a leading digital parts infrastructure company as a Site Reliability Engineer. You will ensure the reliability and performance of large-scale systems while collaborating with a talented global team. This role offers the opportunity to work on cutting-edge technology in a dynamic environment focused on sustainability and innovation.

Benefits

Flexible time-off policy
Competitive salary + equity
Relocation support
Learning and development opportunities
Team events and celebrations

Qualifications

  • Experience in software development and systems engineering.
  • Familiarity with infrastructure-as-code and cloud platforms.
  • Mentoring experience and strong collaboration skills.

Responsibilities

  • Ensure stability, scalability, and security of cloud infrastructure.
  • Collaborate with developers to plan infrastructure needs.
  • Participate in incident troubleshooting and application support.

Skills

Proficiency in large software systems
Strong fundamentals in computer science
System engineering skills
Hands-on SRE experience
Deep knowledge of cloud platforms
Ownership and proactive problem-solving
Excellent communication skills

Job description

Note: Partly is headquartered in the UK, with a Product and Engineering base in Christchurch, and an early presence in San Francisco.

If you are not based in Christchurch, we will fly you to HQ for 2 weeks for onboarding, as well as 1 week per quarter for our “Season Openers” (we pay for your travel and accommodation). If you are relocating to Christchurch from NZ or overseas, we can also assist with relocation costs.

Our story

Partly's mission is to connect the world's parts by building the first global platform for replacement parts, starting with auto parts. Our vision is to accelerate a sustainable future where waste is eliminated and all replacement parts are universally searchable, accessible, and available to everyone.

Founded by ex-Rocket Lab engineers, we utilize cutting-edge technology to solve challenging problems in a $1.9 trillion industry. We've tripled our team in the last 12 months and expect to double again. We are a global team across Europe and Australasia.

We provide scalable digital infrastructure solutions to some of the world's largest companies and innovative startups, integrating across hundreds of companies globally to manage parts online.

Our investors include Blackbird Ventures, Square Peg, Octopus Ventures, Icehouse, Peter Beck, Akshay Kothari, and Dylan Field.

We are building a world-class team and a culture where people can do their best work. Our values are reflected in every experience.

This role

The Site Reliability Engineer (SRE) role involves software and systems engineering to build and maintain large-scale, distributed systems, ensuring reliability, uptime, and performance. The role requires high autonomy, leadership, and strategic thinking, ideal for those excited to design and support infrastructure connecting the world's parts.

What Will You Do

  • Ensure the stability, scalability, and security of our cloud infrastructure and third-party applications using Infrastructure-as-Code and automation tools like Terraform, GitOps, Python/Bash scripts.
  • Monitor and optimize costs across cloud and on-prem infrastructure, making resource and architecture recommendations.
  • Collaborate with developers, data engineers, and leadership to plan infrastructure needs, provide tooling, guidance, and training, and support software delivery.
  • Ensure software meets high production standards and drive improvements proactively.
  • Participate in incident troubleshooting, assisting developers with debugging applications, networks, databases, and systems.

Learn more about our culture and challenges here: https://shorturl.at/iAFUX

Skills Needed

  • Proficiency in developing and maintaining large software systems, with knowledge of maintainability and robustness.
  • Strong fundamentals in computer science: data structures, concurrency, APIs, testing, design patterns.
  • System engineering skills: profiling, identifying network issues, etc.
  • Hands-on SRE experience: containerization, infrastructure-as-code, GitOps, scalable infrastructure, CI/CD.
  • Deep knowledge of at least one cloud platform and Linux systems.
  • Ownership, leadership, proactive problem-solving, mentoring experience.
  • Excellent communication and collaboration skills, adaptable to a fast-paced environment.
  • Bonus: experience in high-growth startups, security compliance, specific tools (GCP, ArgoCD, Kafka), databases, or Rust programming.

Even if you lack some skills but believe you're a strong candidate, we encourage you to apply. We value diverse backgrounds and potential.

Benefits

  • Flexible time-off policy.
  • Challenging engineering work from day one, with minimal bureaucracy.
  • Dedicated Employee Experience team.
  • Competitive salary + equity.
  • Parental leave and flexible return options.
  • Flexible working hours and locations.
  • Focus days with ergonomic workspace.
  • Relocation support.
  • Modern offices in Christchurch and Auckland with amenities.
  • Team events, monthly lunches, celebrations.
  • Commitment to sustainability and environmental impact.
  • Learning and development opportunities.
  • Quarterly team weeks and annual offsites.

Relocation assistance is available for those moving to Christchurch.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.