Enable job alerts via email!

Lead Machine Learning Engineer, Performance and Scalability, Generative AI

Adobe Inc.

California, San Jose (MO, CA)

On-site

USD 162,000 - 302,000

Full time

5 days ago

Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Lead Engineer to enhance the performance and scalability of their Generative AI systems. This pivotal role involves optimizing high-performance AI pipelines to support millions of users globally. You will collaborate with machine learning researchers and engineers to ensure efficient deployment and monitoring of generative AI models. Join this innovative team to shape the future of creativity and make a significant impact on digital experiences for users worldwide. If you have a passion for cutting-edge technology and a desire to drive change, this opportunity is perfect for you.

Benefits

Health insurance

401(k) matching

Paid time off

Remote work options

Professional development

Wellness programs

Employee discounts

Flexible work hours

Stock options

Commuter benefits

Qualifications

8+ years in building high-performance ML infrastructure.
Strong programming skills in Python and C++.
Experience with cloud environments and ML model deployment.

Responsibilities

Architect and optimize ML pipelines for scalable inference.
Develop high-throughput serving pipelines for AI models.
Collaborate with teams to transition models to production.

Skills

Python

C++

ML infrastructure

GPU orchestration

Cloud-native architectures

Performance optimization

Education

MS in Computer Science

PhD in Computer Science

Tools

AWS

Kubernetes

Ray

ONNX

TensorRT

Our Company

Changing the world through digital experiences is what Adobe's all about. We give everyone-from emerging artists to global brands-everything they need to design and deliver exceptional digital experiences! We're passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.

We're on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!

About the Role

Adobe Firefly is seeking a Lead Engineer to focus on Performance and Scalability for our Generative AI systems, powering flagship products like Photoshop, Illustrator, Express, and firefly.adobe.com. In this senior role, you will be responsible foroptimizing high-performance, scalable AI pipelines, supporting millions of users worldwide.

You will work closely with machine learning researchers, infrastructure engineers, and applied scientists to ensure that generative AI models are efficiently deployed, scaled, and monitored, without directly implementing model training, quantization, or tensor parallelism.

Responsibilities

Architect and optimize ML pipelines to support scalable inference and model deployment on cloud-based GPU infrastructure (e.g., AWS P5 instances).

Develop and maintain high-throughput serving pipelines for generative AI models, ensuring low-latency, high-performance execution.

Enable model serving optimizations by designing systems that support tensor parallelism, quantization, distillation, and caching, in collaboration with ML research teams.

Develop automated monitoring and profiling tools to track system efficiency, detect performance regressions, and optimize resource utilization.

Optimize GPU resource allocation and orchestration across cloud-based ML workloads.

Integrate scalable load testing frameworks to validate model inference performance under high-traffic conditions.

Collaborate with infrastructure and applied ML teams to transition models from experimentation to production-ready, cloud-optimized deployments.

Establish standard methodologies for scaling and cloud-native ML architectures, ensuring efficient deployment across multi-region cloud environments.

Qualifications

8+ years of proven track recordin building high-performance ML infrastructure and scalable AI systems.

MS, or PHD in computer science or related field.

Strong programming skills in Python and C++, with expertise in building ML pipelines and model deployment infrastructure.

Experience deploying large-scale ML models in cloud environments, including AWS GPU instances, Kubernetes, Ray, or similar.

Experience with model conversion and optimization frameworks like ONNX and TensorRT, as well as AOT compilation techniques.

Experience with cloud-native architectures, autoscaling strategies, and fault-tolerant machine learning systems.

Proficiency in GPU orchestration, CUDA, and accelerated inference techniques.

Hands-on experience with profiling tools (e.g., Nsight, PyTorch Profiler, perf) for system performance analysis.

Ability to work in a fast-paced, startup-like environment with multi-functional teams.

Why Join Us?

Firefly is Adobe's groundbreaking family of AI models, crafted to transform content creation in our products. Join us to shape the future of creativity and enhance pipelines for millions of users in Photoshop, Illustrator, and Premiere Pro. This is a highly strategic and visible role, where you'll have the chance to create a significant impact on the future of generative AI at Adobe.

#FireflyGenAI

Our compensation reflects the cost of labor across several U.S. geographic markets, and we pay differently based on those defined markets. The U.S. pay range for this positionis $162,000 -- $301,200 annually. Paywithin this range varies by work locationand may also depend on job-related knowledge, skills,and experience. Your recruiter can share more about the specific salary range for the job location during the hiring process.

At Adobe, for sales roles starting salaries are expressed as total target compensation (TTC = base + commission), and short-term incentives are in the form of sales commission plans. Non-sales roles starting salaries are expressed as base salary and short-term incentives are in the form of the Annual Incentive Plan (AIP).

In addition, certain roles may be eligible for long-term incentives in the form of a new hire equity award.

State-Specific Notices:

California:

Fair Chance Ordinances

Adobe will consider qualified applicants with arrest or conviction records for employment in accordance with state and local laws and "fair chance" ordinances.

Colorado:

Application Window Notice

If this role is open to hiring in Colorado (as listed on the job posting), the application window will remain open until at least the date and time stated above in Pacific Time, in compliance with Colorado pay transparency regulations. If this role does not have Colorado listed as a hiring location, no specific application window applies, and the posting may close at any time based on hiring needs.

Massachusetts:

Massachusetts Legal Notice

It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

Adobe is proud to be anEqual Employment Opportunityemployer. We do not discriminate based on gender, race or color, ethnicity or national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, or any other applicable characteristics protected by law.Learn more.

Adobe aims to make Adobe.com accessible to any and all users. If you have a disability or special need that requires accommodation to navigate our website or complete the application process, emailaccommodations@adobe.comor call (408) 536-3015.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.