Enable job alerts via email!
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
An innovative company is looking for an AI Inference Engineer to enhance their team. This role focuses on the large-scale deployment of machine learning models for real-time inference. You will develop APIs for AI inference, optimize the inference stack, and ensure system reliability. The ideal candidate will have a strong background in Python and C++, along with experience in ML systems and deep learning frameworks. This position offers a competitive salary and comprehensive benefits, making it an exciting opportunity for those passionate about AI and machine learning.
We are seeking an AI Inference Engineer to join our expanding team. Our current technology stack includes Python, C++, TensorRT-LLM, and Kubernetes. This role offers the opportunity to work on large-scale deployment of machine learning models for real-time inference.
The compensation range for this role is $190,000 - $240,000. Additional benefits include equity, comprehensive health insurance, dental, vision, and a 401(k) plan.