Enable job alerts via email!

Site Reliability Engineer

Aetion

Canada

Remote

CAD 80,000 - 120,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a forward-thinking company as a Site Reliability Engineer, where you will play a crucial role in managing cloud-based infrastructure and enhancing operational efficiency. This position offers the opportunity to collaborate with a talented team, mentor colleagues, and work on innovative projects that drive healthcare solutions. You will be instrumental in ensuring system reliability and scalability while fostering a culture of continuous improvement and excellence. If you are passionate about technology and eager to make a significant impact in the healthcare sector, this role is perfect for you.

Benefits

Daily in-office lunch stipend
Sabbatical opportunity after five years
Professional development opportunities
Comprehensive private health coverage
Peer & company recognition programs
Monthly educational lunch & learn

Qualifications

  • 5+ years in Systems Engineering or DevOps with cloud architecture expertise.
  • Experience with IaC tools like Terraform and CI/CD practices.

Responsibilities

  • Monitor and resolve infrastructure issues to ensure system reliability.
  • Streamline automation processes to improve operational efficiency.

Skills

Cloud Infrastructure Design
Service Mindset
CI/CD Pipelines
Communication Skills
Problem Solving
Flexibility
Collaboration

Education

Bachelor's Degree in Computer Science

Tools

AWS
Kubernetes
Docker
Terraform
Ansible
GitHub Actions
Pulumi

Job description

WELCOME to Aetion! We are one of the country’s leading science-driven technology companies using real-world evidence to provide innovative healthcare solutions. Our Aetion Evidence Platform is used to evaluate the safety, effectiveness and value of medications, delivering better outcomes to patients, medical professionals, and clients. We’ve partnered with top biopharma companies and are backed by leading venture capital firms to help increase our medical research and expand our product line. Aetion is headquartered in the US and has expanded throughout Europe with a Technology Hub in Barcelona.

Aetion and Aetion’s leadership are recipients of several prestigious awards:

  • Daily in-office lunch stipend (and a fully stocked kitchen!)
  • Sabbatical opportunity after five years of employment
  • Commitment to professional development opportunities with access to Skillsoft learning experience platform
  • Employee-led initiatives including annual company-wide innovation day & DEI resource groups
  • Comprehensive private health coverage w/ out-of-network reimbursements options.
  • Peer & company recognition programs
  • Monthly educational lunch & learn
Why join Aetion's Tech Team?
  • You’ll collaborate with other engineering leaders on all matters that impact the Engineering team, including resourcing and building technology/product vision.
  • You’ll have the opportunity to coach and mentor colleagues, including code reviews, higher-level software design, and direct management.
  • The team works on a technical stack which includes both cloud and on-premise deployments, big-data ingestion and analytics, distributed systems, and algorithmic complexity.
DESCRIPTION:

As Site Reliability Engineer, you will be a critical member of Aetion’s engineering organization. As Aetion’s products continue to scale, we are writing the next chapter in our ability to innovate and operate with efficiency and maturity. You will be an instrumental part of us continuing down this path.

As a member of the Site Reliability Engineering team, you will own Aetion’s infrastructure which is cloud-based, containerized, and managed through Infrastructure as Code (IaC). You will be supporting day to day operations (such as provisioning infrastructure and providing production support) and helping with engineering projects (maturing our infrastructure and automation). The team has great things in place but also has strong ambitions about what else we want to achieve.

RESPONSIBILITIES:

Your duties will include, but are not limited to:

  • Perform delivery and production support tasks, including monitoring, troubleshooting, and resolving infrastructure and application issues to ensure system reliability and uptime.
  • Continually streamline automation and processes to improve operational maturity and efficiency.
  • Provision, configure, and maintain Aetion’s infrastructure with a focus on simplicity, innovation, automation, reliability, scalability, security, cost-effectiveness, and ease of support.
  • Build and maintain Aetion’s development and deployment pipelines, supporting CI/CD and long-term-stable testing and release cycles.
  • Collaborate with cross-functional teams to provide timely and effective production support, ensuring a seamless experience for end-users and internal stakeholders.
  • Develop automation frameworks to support other development teams and reduce manual intervention in operational tasks.
  • Effectively contribute to complex engineering projects while balancing operational responsibilities.
QUALIFICATIONS:
Required:
Education:
  • Bachelor's Degree in Computer Science, Engineering, or a related field, or equivalent experience.
Experience:
  • Systems Engineering/DevOps/Distributed Systems: Minimum 5+ years of experience in Systems Engineering, DevOps, or developing distributed systems with strong knowledge of cloud architecture, particularly AWS and Kubernetes.
  • Infrastructure as Code (IaC) & CI/CD Tools: 3+ years with tools and languages such as Pulumi, Terraform, Ansible, or GitHub Actions (GHA).
  • Cloud Providers: 5+ years with cloud platforms (AWS required, GCP nice to have).
  • Containerized Workloads: 3+ years with Docker and Kubernetes. Experience with orchestration tools such as Concord and Karpenter is a plus.
  • Security Engineering: Experience with proactive threat prevention, incident response, and implementing compliance programs.
  • Databases: Experience working with SQL databases, big data platforms, and supporting big data pipelines.
  • Linux Systems: In-depth experience solving complex issues on Linux systems and/or within the JVM.
Skills:
  • Empathy for end-users and a strong service mindset when supporting day-to-day operations, ensuring a positive experience for stakeholders.
  • Strong understanding of cloud infrastructure design with a focus on security, reliability, and scalability.
  • Detailed knowledge of configuration, implementation, and maintenance of CI/CD pipelines and tooling (e.g., GitHub Actions or Jenkins).
  • Strong English language skills, both written and verbal, with the ability to communicate effectively across teams (e.g., commercial and science/analytics teams).
  • Ability to prioritize, communicate effectively, design for repeatability and scalability, exude ownership, and dig beneath the hood with technology.
  • Flexibility to improve existing systems and innovate on new capabilities.
  • Collaborative, open-minded, and able to quickly grasp complex concepts to contribute to the team’s overall effectiveness.
Preferred:

Experience with:

  • Debugging, tracing, and profiling Java applications.
  • Provisioning and operating SQL databases and big data platforms (Spark).
  • The healthcare or banking industry, or other fields where information security is a concern.
  • Privacy (HIPAA, GDPR) and security (SOC 2, Hitrust) certifications.
  • Lean and agile ways of working.
About Aetion’s Site Reliability Engineering Team:

Aetion is an Equal Opportunity Employer. Aetion is committed to being an employer of choice, not just a good place to work, but a great and inclusive place to work. To that end, we strive to recruit and maintain a workforce that meaningfully represents the diverse and culturally rich communities that we serve. Qualified applicants will receive consideration for employment without regard to their race, color, religion, national origin, sex, sexual orientation, gender identity, protected veteran status or disabled status or genetic information.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

EIT Professionals Corp

Remote

CAD 80 000 - 120 000

2 days ago
Be an early applicant

Intermediate Site Reliability Engineer, Foundations

GitLab

Remote

USD 103 000 - 222 000

12 days ago

Site Reliability Engineer

Blink AI

Remote

CAD 70 000 - 110 000

3 days ago
Be an early applicant

Site Reliability Engineer

Insight Global

Remote

CAD 100 000 - 125 000

9 days ago

Staff Infrastructure Site Reliability Engineer

Remoteworldwide

Remote

CAD 90 000 - 150 000

5 days ago
Be an early applicant

Site Reliability Engineer

Dayforce

Remote

CAD 70 000 - 110 000

6 days ago
Be an early applicant

Software Engineer, Site Reliability (Senior or Staff)

BioRender

Remote

CAD 80 000 - 150 000

9 days ago

Senior Site Reliability Engineer - (Remote - Canada)

Jobgether

Remote

CAD 80 000 - 120 000

23 days ago

Site Reliability Engineer

SmartSimple Software

Remote

CAD 60 000 - 100 000

28 days ago