Job Search and Career Advice Platform

Enable job alerts via email!

Applied Scientist, AI Risk

Armilla AI

Toronto

On-site

CAD 80,000 - 120,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A cutting-edge startup is seeking an exceptional Applied Scientist focused on AI Risk in Toronto, Ontario. The role involves advancing understanding of AI systems and developing risk assessment frameworks. Candidates should possess an advanced degree, a track record of applied research, and expertise in machine learning. Join a dynamic team to influence how the insurance industry evaluates AI risk, combining technical challenges with impactful work at the intersection of AI and insurance.

Benefits

Professional growth opportunities
Startup culture and agility
Pioneering a new frontier in AI

Qualifications

  • Demonstrated research experience in AI/ML.
  • Experience building AI/ML systems from conception to deployment.
  • Hands-on experience with model evaluation, testing, and validation.

Responsibilities

  • Conduct deep technical research into AI model behavior.
  • Design and develop AI systems that assess safety and reliability.
  • Build production-grade tooling for automated AI risk assessment.

Skills

Machine Learning fundamentals
Python programming
Model evaluation and testing
Intellectual curiosity
Problem-solving

Education

Advanced degree in Computer Science, Machine Learning, Statistics, or related field

Tools

PyTorch
TensorFlow
JAX
NumPy
Pandas
Scikit-learn
Job description

Armilla AI is a cutting-edge startup at the intersection of artificial intelligence and insurance. Based in Toronto, Ontario, Canada, we're building innovative solutions to manage, underwrite, and insure the rapidly evolving risks associated with AI systems. We're a dynamic team passionate about pioneering the future of AI risk management.

The Role

We're seeking an exceptional Applied Scientist who bridges the worlds of deep AI research and practical, production-grade applications. As our Applied Scientist, AI Risk, you'll be instrumental in both advancing our understanding of AI systems and translating that knowledge into robust risk assessment and evaluation frameworks. This isn't just about building models— it's about understanding their failure modes, vulnerabilities, and real-world reliability. You'll develop AI systems that evaluate other AI systems, creating the next generation of automated risk assessment tools. You'll be shaping how the insurance industry evaluates and prices AI risk.

Role Responsibilities
  • Conduct deep technical research into AI model behavior, failure modes, and edge cases, with a focus on practical risk assessment as well as deep research.
  • Design and develop AI systems—including specialized models, agents, and automated evaluation pipelines—that assess the safety, reliability, and risk profiles of other AI systems.
  • Build production-grade tooling and platforms for automated AI risk assessment, model testing, and continuous monitoring.
  • Evaluate diverse AI systems—from traditional ML models to Large Language Models and multimodal systems—identifying insurable risks, vulnerabilities, and potential failure scenarios.
  • Develop novel evaluation methodologies and metrics that capture AI-specific risks such as adversarial vulnerabilities, distribution shift, hallucinations, and behavioral misalignment.
  • Collaborate closely with our underwriting and actuarial teams to translate technical findings into actionable risk insights and pricing signals.
  • Stay at the forefront of AI research and safety literature, rapidly prototyping and validating new risk assessment techniques.
  • Write clean, maintainable, and well-tested code that scales from research prototypes to production systems.
  • Communicate complex technical concepts clearly to diverse stakeholders, from engineers to underwriters to executive leadership.
What We're Looking For
  • Advanced degree (Master's or PhD) in Computer Science, Machine Learning, Statistics, or related field, with demonstrated research experience in AI/ML.
  • Strong track record of applied research—you've published, contributed to open source, or shipped ML products that had real-world impact beyond academic settings.
  • Deep technical expertise in machine learning fundamentals, including both classical ML and modern deep learning approaches.
  • Experience building AI/ML systems from conception to deployment, not just running evaluations on existing models.
  • Hands‑on experience with model evaluation, testing, and validation—you think critically about where models fail, not just where they succeed.
  • Solid software engineering skills with expertise in Python and experience with ML frameworks (PyTorch, TensorFlow, JAX) and scientific computing libraries (NumPy, Pandas, Scikit‑learn).
  • Experience with LLMs and generative AI, including familiarity with their unique risks, evaluation challenges, and safety considerations.
  • Background in AI safety, robustness, interpretability, or adversarial ML is a significant asset.
  • Ability to work both independently on deep technical problems and collaboratively in a fast‑paced startup environment.
  • Intellectual curiosity about the "other side"—not just building AI, but understanding its risks, limitations, and societal implications.
  • Strong problem‑solving abilities and attention to detail, with a pragmatic approach to balancing research rigor with business needs.
What's In It For You
  • Pioneering a New Frontier: You'll be at the forefront of an emerging field, defining how AI systems are evaluated, understood, and insured at scale.
  • Dual‑Sided Impact: Work on both the technical frontier of AI evaluation and the practical challenge of real‑world risk assessment—a rare combination.
  • Meta‑AI Challenge: Tackle the fascinating problem of building AI systems that understand and evaluate other AI systems.
  • Impactful Work: Your research and tooling will directly shape how AI risk is quantified and managed across industries.
  • Startup Agility: Enjoy the fast‑paced, innovative, and collaborative culture of a growing startup where your ideas can quickly become reality.
  • Professional Growth: Unparalleled opportunities to develop expertise at the intersection of AI research, risk management, and insurance alongside deeply experienced AI and industry experts.
  • Technical Freedom: Latitude to pursue novel research directions and evaluation approaches that advance both the field and our business.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.