Enable job alerts via email!

Senior AI Performance Engineer

Crusoe

San Francisco (CA)

Hybrid

USD 183,000 - 210,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a pioneering company at the forefront of AI-first cloud infrastructure. As a Senior AI Performance Engineer, you'll optimize inference engines and enhance scalable AI infrastructure, directly impacting performance and efficiency. This role offers the chance to develop cutting-edge solutions in a hybrid work environment, collaborating with top-tier AI researchers and engineers. If you're passionate about accelerating AI workloads and pushing the boundaries of high-performance computing, this is your opportunity to make a meaningful impact in a sustainable technology landscape.

Benefits

Health insurance options
401(k) with 100% match
Paid Parental Leave
Generous paid time off
Tuition reimbursement
Cell phone reimbursement
Company-paid commuter benefit
Restricted Stock Units
Paid life insurance
Subscription to the Calm app

Qualifications

  • Expertise in CUDA or OpenCL for developing optimization engines.
  • Strong programming skills in Python for AI tasks.

Responsibilities

  • Optimize inference engines for maximum efficiency and scalability.
  • Develop CUDA kernels to enhance deep learning workloads.

Skills

CUDA
Python
Deep Learning Frameworks
Performance Optimization
CPU & GPU Architecture

Job description

Crusoe is building the World’s Favorite AI-first Cloud infrastructure company. We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the climate. Our AI platform is recognized as the "gold standard" for reliability and performance. Our data centers are optimized for AI workloads and are powered by clean, renewable energy.

Be part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.

About This Role:
Crusoe Energy is on a mission to align the future of computing with the future of the climate. As a Senior AI Performance Engineer, you will play a pivotal role in optimizing scalable inference engines to enhance performance, efficiency, and speed. Your contributions will directly impact Crusoe’s revenue model, as faster inference translates to greater token throughput and increased efficiency in our AI infrastructure. If you are passionate about accelerating AI workloads, optimizing inference engines, and pushing the boundaries of high-performance computing, this role is for you.

This is a full-time hybrid role based in San Francisco, CA, or Sunnyvale, CA, requiring in-office presence three times a week.

What You’ll Be Working On:

  • Optimize inference engines – Improve inference performance in engines such as VLLM, ensuring maximum efficiency and scalability.

  • Enhance scalable AI infrastructure – Implement optimizations that accelerate AI inference, directly impacting Crusoe’s efficiency and revenue generation.

  • Develop CUDA kernels – Write and deploy CUDA kernels to optimize deep learning workloads, improving computational performance.

  • Conduct performance analysis – Profile and analyze training and inference workloads to identify and resolve bottlenecks.

  • Engage with the AI research community – Track developments in scalable inference, contribute to open-source projects, and publish research to advance the field.

  • Improve onboarding and documentation – Enhance internal documentation and tooling standards to streamline team workflows and training.

  • Collaborate cross-functionally – Work closely with AI researchers, engineers, and infrastructure teams to develop cutting-edge solutions.

What You’ll Bring to the Team:

  • Expertise in CUDA or OpenCL – Demonstrated experience developing CUDA kernels or equivalent technologies.

  • Proficiency in Python – Strong programming skills, particularly in Python, for AI and performance optimization tasks.

  • Experience with deep learning frameworks – Hands-on knowledge of training infrastructure such as PyTorch or TensorFlow.

  • Strong understanding of CPU & GPU architecture – Ability to analyze and optimize performance at the hardware level.

Bonus Points:

  • Zero-to-Hero mindset – Experience taking a project from initial concept to full implementation.

  • Experience with vector instructions – Understanding of SIMD, AVX, or similar vector processing techniques.

  • Graphics shader knowledge – Background in graphics shaders as a proxy for CUDA expertise.

Benefits:

  • Industry competitive pay

  • Restricted Stock Units in a fast-growing, well-funded technology company

  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents

  • Employer contributions to HSA accounts

  • Paid Parental Leave

  • Paid life insurance, short-term and long-term disability

  • Teladoc

  • 401(k) with a 100% match up to 4% of salary

  • Generous paid time off and holiday schedule

  • Cell phone reimbursement

  • Tuition reimbursement

  • Subscription to the Calm app

  • MetLife Legal

  • Company-paid commuter benefit; $100 per pay period

Compensation:
Compensation will be paid in the range of $183,000 - $210,000. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant’s education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.

Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior AI Performance Engineer

ZipRecruiter

San Francisco

Hybrid

USD 205,000 - 240,000

10 days ago

Senior Web Performance Engineer - Remote ($150-$250K)

CyberCoders

San Francisco

Remote

USD 150,000 - 250,000

22 days ago

Senior AI Performance Engineer

Crusoe Energy Systems LLC

San Francisco

On-site

USD 183,000 - 210,000

30+ days ago

Senior Performance Engineer

Veeva Systems

Boston

Remote

USD 120,000 - 220,000

Yesterday
Be an early applicant

Senior Performance Engineer

Veeva Systems

Bend

Remote

USD 120,000 - 220,000

Yesterday
Be an early applicant

Senior Performance Engineer

Veeva Systems

Portland

Remote

USD 120,000 - 220,000

Yesterday
Be an early applicant

Senior Performance Engineer

Veeva Systems

Portland

Remote

USD 120,000 - 220,000

Yesterday
Be an early applicant

Senior Interconnection Performance Engineer

174 Power Global

Irvine

Remote

USD 150,000 - 185,000

8 days ago

AI/ML Application Performance Engineer

Cornelis Networks

Chesterbrook

Remote

USD 127,000 - 184,000

11 days ago