Enable job alerts via email!
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
Join a forward-thinking company as an AI Inference Engineer, where you'll develop APIs for AI inference and optimize large-scale machine learning models. This role offers the chance to work with cutting-edge technologies like Python, C++, and TensorRT-LLM while contributing to a rapidly growing AI-powered search assistant with millions of users. You'll be part of a dynamic team focused on enhancing system reliability and performance, making impactful contributions to real-time inference solutions. If you're passionate about AI and eager to drive innovation, this opportunity is perfect for you.
Our current stack includes Python, C++, TensorRT-LLM, and Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.
The cash compensation range for this role is $190,000 - $240,000.
Since launching the world's first fully functional conversational answer engine over a year ago, we've experienced tremendous growth. Our AI-powered search assistant has 10 million monthly active users as of early 2024, with over 1 million app installations across iOS and Android. In 2023, we served over 500 million queries globally.
We have raised significant funding, including a $73.6 million Series B in January 2024 led by IVP with participation from NVIDIA, Jeff Bezos' fund, NEA, Databricks, and others. We also completed a $62.7 million Series B1 in April 2024, valuing Perplexity at over $1 billion.
Our investor base includes IVP, NEA, NVIDIA, Jeff Bezos, Databricks, Bessemer Venture Partners, and prominent individuals like Elad Gil, Nat Friedman, Naval Ravikant, and Tobi Lutke.
Final offer amounts depend on experience and expertise and may vary from listed ranges.
Compensation includes base salary and equity.
Benefits include comprehensive health, dental, and vision insurance, and a 401(k) plan.