Enable job alerts via email!
Boost your interview chances
Unity Health Toronto seeks a skilled System Administrator for its Data Science and Advanced Analytics team. The role involves supporting technologies for data scientists and engineers, ensuring systems operate optimally, and adopting innovative technologies in AI/ML. Candidates should have a Bachelor's degree, extensive Linux experience, and problem-solving skills.
Are you ready to be at the forefront of healthcare innovation? Since 2020, the Data Science and Advanced Analytics (DSAA) team at Unity Health Toronto (UHT) has been developing and implementing machine learning solutions – transforming complex healthcare data into actionable insights that drive better decision-making, enhance hospital efficiency, and improve patient care.
The DSAA team is looking for a skilled System Administrator to join their infrastructure team and support technologies primarily used by their data scientists, software engineers, and data engineers. As a System Administrator, you’ll work with a diverse range of tools and platforms – including HashiCorp Nomad, XNAT, event streaming technologies like Kafka, Posit Teams, Prometheus, LXD, Nix, Keycloak, HashiCorp Vault, Postgres databases, data federation tools like Data Virtuality and specialized waveform platforms like Atrium DB.
On a typical day, you may be involved with the following:
Provide day to day maintenance on Linux systems, e.g., authentication services, security services, network drives etc.;
Perform accounts administration which includes creating, disabling and expiring users’ accounts;
Configures bare metal, VM, and container infrastructure.
Works with the data scientists and product developers to understand their needs and requirements for implementing or improving underlying host systems.
Monitoring to ensure systems are performing, as required, checking to identify that there are no issues and/or errors on the systems in their logs;
Manage and monitor containers and container orchestration system to ensure systems are online and performing as required.
Troubleshoots problems that arise with the Linux systems, including: authentication systems, container-based services, web servers, hardware networking etc.;
Performs preventative maintenance on servers, when required;
Adopt emerging technologies in the AI/ML space to support data science;
Plan and automate deployment of hardware and software infrastructure;
Maintain active involvement in designated activities of new projects going live, e.g., testing process, etc.;
Works with other staff in the department to put together test cases and user acceptance testing, implementing results, in a timely manner;
Using and supporting version control systems (Gitlab) and infrastructure as code (e.g., with Terraform, Ansible);
Responds to downtime incidents on an on-call basis
Qualifications
Completion of a recognized Bachelor’s degree or a diploma in Computer Science, networking or related field required;
Five (5) years’ experience in field required; with Linux certification, preferably GCUX, required;
In depth knowledge of Debian/Ubuntu environment required;
In depth knowledge of container and reproducibility solutions, (e.g. LXD Docker/Podman, and Kubernetes/Nomad, Nix), required;
Demonstrated flexibility and ability to adapt to change required;
Demonstrated strong analytical, organization, conceptual and decision making skills with the ability to work within a team environment required;
Familiarity with data science, product development, and deployment teams and technologies (in particular R and Python and their respective communities and ecosystems) is an asset;
High familiarity with monitoring technologies such as netdata, Promethus, Glitchtip, Grafana;
Specializations in or experience with cloud development/deployment, medical imaging, or high frequency data (waveform) infrastructure an asset;
High familiarity with version control systems (Gitlab) and critical extensions thereof (e.g., MLFlow, CI/CD) is an asset;
Demonstrated excellent verbal and written communication skills required;
Ability to handle situations involving unplanned outages required;
Well developed problem solving skills required;
Demonstrated commitment to continuous professional learning required;
Experience in an on-prem or hybrid (on-prem/cloud) corporate environment;
Experience with HPC or cloud computing is an asset
Unity Health Toronto is committed to creating an accessible and inclusive organization. We strive to provide a recruitment process that is barrier-free and in compliance with the Accessibility for Ontarians with Disabilities Act (AODA) and the Ontario Human Rights Code. We understand that you may require an accommodation at any stage of the recruitment process. When you are contacted, please inform the Talent Acquisition Specialist and we will work with you to meet your accommodation needs. We want to emphasize that all accommodation requests are handled with the utmost confidentiality, respecting your privacy and dignity.
Referrals increase your chances of interviewing at St. Michael's Hospital by 2x
Toronto, Ontario, Canada CA$65,629 - CA$77,591 2 weeks ago
Vaughan, Ontario, Canada CA$80,000 - CA$85,000 1 week ago
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.