Enable job alerts via email!

Senior Site Reliability Engineer, Cloud Networking

Carta

Waterloo

On-site

CAD 80,000 - 120,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Senior Site Reliability Engineer to enhance their cloud networking capabilities. In this role, you will build and maintain Kubernetes clusters, ensuring the reliability and performance of applications. You will collaborate with software engineers to design scalable solutions and push boundaries to improve systems as the company expands globally. This position offers the opportunity to work with cutting-edge technologies like AWS, Docker, and Terraform, making a significant impact on the infrastructure that powers the organization. If you are passionate about developing efficient and reliable infrastructure, this is the perfect opportunity for you.

Qualifications

Strong experience with Kubernetes and Docker for container orchestration.
Proficient in Python and familiar with infrastructure best practices.

Responsibilities

Build and maintain Kubernetes clusters, ensuring reliability and performance.
Collaborate with engineers to design scalable infrastructure solutions.

Skills

Kubernetes

Docker

Python

gRPC

Terraform

AWS

Networking

CI/CD

Tools

Terraform

AWS

Docker

Kubernetes

The Company You’ll Join

Carta develops purpose-built software that transforms traditional accounting into a powerful growth engine.

Carta’s world-class fund administration platform supports nearly 7,000 funds and SPVs, and represents nearly $130B in assets under management in venture capital and private equity.

Trusted by more than 40,000 companies, Carta also helps private businesses in over 160 countries manage their cap tables, valuations, taxes, equity programs, compensation, and more.

Together, Carta is setting a new standard as the end-to-end platform for private markets. Our best-in-class solution for fund management seamlessly integrates investor and portfolio company insights via a suite of tools designed ground-up to support the strategic impact of the fund CFO.

For more information about our offices and culture, check out our Carta careers page.

The Problems You’ll Solve

At Carta, our employees set out on a mission to unlock the power of equity ownership for more people in more places. We believe that the problems we solve today unlock the opportunities of tomorrow. As a Senior Site Reliability Engineer, Cloud Networking, you’ll work to:

Build and guide internal usage of Kubernetes/EKS including maintaining and monitoring EKS clusters, writing helm charts and configuring ingress and gateways.
Build and scale our internal platform offerings (compute, storage and networking services) to ensure the reliability, and performance of our applications.
Collaborate with application software engineers (as needed) to guide their design and ensure it scales for what Carta needs in the long run.
Act as an agent of change and push boundaries to incrementally improve our systems as we expand globally.

The Team You’ll Work With

You’ll be joining the Infrastructure Engineering team at Carta. The Infrastructure Engineering team is responsible for providing secure, reliable, scalable and performant Infrastructure to Carta’s customers and developers.

We are Software and Infrastructure Engineers who specialize in cloud computing, networking, systems design and architecture, storage, real time data telemetry, associated automation, tooling and processes. We possess a breadth and depth of knowledge about Carta’s infrastructure and industry-wide best practices, that translates into leverage for Carta’s business.

About You

You are excited by the idea of developing scalable, reliable and efficient infrastructure that powers the entire company. We’re looking for strong communicators who enjoy collaborating to solve complex problems. Familiarity with infrastructure best practices on performance, reliability and security and their associated tools is appreciated.

Our stack is Python, Java, Terraform, gRPC, Docker, Kubernetes, Postgres, running on AWS. Come join us!

Containers: Experience with Docker and Kubernetes or other container orchestration services.
API Services: Experience in designing, deploying, and maintaining API services, with a strong understanding of gRPC/Protobuf, Thrift, Avro or GraphQL.
Experience in proxy and service mesh technologies such as Kong, Istio, Envoy, or Linkerd.
Cloud Platforms: Extensive experience with cloud services such as AWS, Google Cloud Platform, or Azure, including services like EC2, S3, RDS, and Lambda.
Infrastructure as Code (IaC): Proficient in using tools such as Terraform, Ansible, or CloudFormation for managing and provisioning cloud infrastructure.
Networking: Experience with networking concepts and tools, including Container Network Interface (CNI), Network policy implementations. Experience with proxies and service mesh is a big plus.
Software Development: Proficiency in Python, with the ability to write efficient, maintainable, and scalable code.
Experience operating CI/CD and its associated best practices is also appreciated though not essential.