Enable job alerts via email!

Machine Learning Engineer, Amazon General Intelligence (AGI)

Lensa

Sunnyvale (CA)

On-site

USD 129,000 - 224,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Amazon is seeking a passionate Machine Learning Engineer to develop multi-modal and multi-lingual Large Language Models. You will leverage advanced hardware and vast data resources to create innovative AI solutions aimed at transforming customer experiences. Join our AGI team to drive breakthroughs in Generative AI technology. This opportunity offers significant career growth in a fast-paced environment.

Benefits

Health insurance
401(k) plan
Paid time off
Employee discounts

Qualifications

  • 3+ years of software development experience.
  • 2+ years in architecture of new and existing systems.
  • Experience programming with at least one software language.

Responsibilities

  • Develop and maintain platforms for LLM development and deployment.
  • Collaborate with scientists on data processing and model optimizations.
  • Lead development of novel algorithms and techniques for large-scale AI.

Skills

Machine Learning
Data Processing
Algorithm Development

Education

Master's degree in computer science or equivalent

Job description

Machine Learning Engineer, Amazon General Intelligence (AGI)

Amazon

Machine Learning Engineer, Amazon General Intelligence (AGI)
Sunnyvale, CA

Amazon Development Center U.S., Inc.

Senior Software Development Engineer, Annapurna Labs, Trainium Collectives, Elastic Collectives
Cupertino, CA

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Amazon.com Services LLC

Sr Software Engineer, Bidding Data & Analytics
Palo Alto, CA

Amazon Data Services, Inc.

Sr Software Development Engineer, OEM Solid State Drives Team
Cupertino, CA

Accellor

Boomi, Integration Architect
Santa Clara, CA

Workato

AI Solutions Architect
Palo Alto, CA

Amazon Web Services, Inc.

Systems Development Eng (AWS Generative AI & ML Servers), AWS Hardware Engineering Accelerators
Cupertino, CA

CoreWeave

Solutions Architect - HPC/AI/ML
Sunnyvale, CA

Arkose Labs

Principal Software Architect
San Mateo, CA

AMAX

Cloud Engineer - GPU Hosting
Fremont, CA

ALTA IT Services

CLOUD ENGINEER - 100% REMOTE FOREVER - W-2 - FED GOV AGENCY
Remote

Amazon

Sr. Machine Learning Engineer, Amazon General Intelligence (AGI)
Sunnyvale, CA

Technical Lead- AI-based Autonomous Driving Systems

Bosch Group

Technical Lead- AI-based Autonomous Driving Systems
Sunnyvale, CA

Nue.io Careers

Solution Architect
San Mateo, CA

Selector Software

Solutions Architect
Santa Clara, CA

Freshworks

Senior Solution Engineer
San Mateo, CA

Etched

Platform Solutions Architect
San Jose, CA

Freshworks

Solutions Architect (Device42)
San Mateo, CA

Qventus

Solutions Architect, Conversational AI & Prompt Engineering
Mountain View, CA

Jobleads-US

Sr. Machine Learning Engineer, Amazon General Intelligence (AGI)
Sunnyvale, CA

The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Software Development Engineer(SDE)/Machine Learning Engineer(MLE) to play pivotal role in the development of industry-leading multi-modal and multi-lingual Large Language Models (LLM). As our SDE/MLE superstar, you'll have the power to lead the charge in developing mind-blowing algorithms and modeling techniques that will push the boundaries of large model training using cutting-edge hardware like GPUs and AWS Trainium. Your groundbreaking work will directly impact our customers' lives through game-changing products and services powered by your Generative AI breakthroughs!

Get ready to dive into Amazon's vast and diverse data sources and harness the immense power of our large-scale computing resources to turbocharge the development of multi-modal Large Language Models (LLMs) and other awe-inspiring Generative Artificial Intelligence (Gen AI) applications. Your expertise and insights will be invaluable in defining data strategies, enrichment processes, model optimizations, and evaluation methods that will set new standards in the industry!

So, if you're passionate about pushing the limits of AI, thrive in a fast-paced and innovative environment, and are ready to make a lasting impact on the world, this is your chance! Join us on this exhilarating adventure and let's revolutionize AGI together! #MLE

Key job responsibilities

Ability to quickly learn cutting-edge technologies and algorithms in the field of Generative AI to participate in our journey to build the best LLMs.

Responsible for the development and maintenance of key platforms needed for developing, evaluating and deploying LLM for real-world applications.

Work with other team members to investigate design approaches, prototype new technology and evaluate technical feasibility.

Work closely with Applied scientists to process massive data, scale machine learning models while optimizing.

A day in the life

As a SDE/MLE with the AGI team, you will be responsible for leading the development of novel algorithms and modeling techniques to advance the state of the art of large model training using hardware like NVDIA GPUs. Your work will directly impact our customers in the form of products and services that make use of Generative AI innovations. You will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate development with multi-modal Large Language Models (LLMs) and other Generative Artificial Intelligence (Gen AI) applications. As a key player in our team, you'll have a significant influence on our overall strategy, shaping the future direction of AGI at Amazon. You'll be the driving force behind our system architecture and the champion of best practices that will ensure an unparalleled infrastructure of the highest quality. Work in an Agile/Scrum environment to move fast and deliver high quality software.

About the team

Join our AGI team and work at the forefront of AI. Collaborate with top minds pushing boundaries in deep learning, reinforcement learning, and more. Gain valuable experience and accelerate your career growth. This is a unique opportunity to create history and shape the future of artificial intelligence.

Mission of the team: We leverage our hyper-scalable, general-purpose large model training and inference systems to develop and deploy cutting-edge sensory AI foundational models that revolutionize machine perception, interpretation and interaction, with humans and with the physical world.

Basic Qualifications

3+ years of non-internship professional software development experience

2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience

Experience programming with at least one software programming language

Preferred Qualifications

3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience

Master's degree in computer science or equivalent

Experience in techniques like kernel fusion and custom kernels to improve GPU utilization, mixed precision training using lower precision and dynamic loss scaling while leveraging hardware specific mixed precision capabilities and/or demonstrated ability to implement efficient memory management like gradient (activation) checkpointing, gradient accumulation, offloading optimizer states, and smart prefetching.

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits . This position will remain posted until filled. Applicants should apply via our internal or external career site.

We are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental operations that enable AI to scale across multiple accelerators & servers. Most of our stack is C/C++ and relatively low level, so solid knowledge of...

AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine learning accelerators and servers that use them. This role is for a software engineer in the Machine Learning Inference Model Enablement team for AWS Neuron at Annapurna Labs. This role is responsible...

Who are we? Amazon Advertising builds and manages systems with high performance and availability. We serve and respond to hundreds of billions of requests annually, and have ambitions to grow that number several orders of magnitude, while maintaining response latencies in the milliseconds and...

You will be a member of the larger HWEngS Storage organization and will lead ideation, design and implementation of software automation projects in the Solid-State Drive (SSD) team. You will work with peers and cross-functional experts to build software services that automate workflows for the...

Job Description At Accellor, we are a trusted digital transformation partner that uses best-of-breed Cloud technology to deliver superior customer engagement and business effectiveness for clients. We’ve created an atmosphere that encourages curiosity, constant learning, and persistence. We...

Job Description About Workato Workato transforms technology complexity into business opportunity. As the leader in enterprise orchestration, Workato helps businesses globally streamline operations by connecting data, processes, applications, and experiences. Its AI-powered platform enables teams to...

Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build the future of the cloud for AI training and inference? Want to do industry leading work delivering continuous price performance improvements in the cloud for AI model training for multi billion variable LLMs? Come...

Job Description CoreWeave is the AI Hyperscaler, delivering a cloud platform of cutting edge services powering the next wave of AI. Our technology provides enterprises and leading AI labs with the most performant, efficient and resilient solutions for accelerated computing. Since 2017, CoreWeave...

Job Description The mission of Arkose Labs is to create an online environment where all consumers are protected from online spam and abuse. Recognized by G2 as the 2025 Leader in Bot Detection and Mitigation, with the highest score in customer satisfaction and largest market presence four quarters...

Job Description *Salary range: $120,000-$170,000 annually* AMAX is seeking a skilled Cloud Engineer with expertise in GPU workloads to join our team. In this role, you will be responsible for designing, deploying, and managing cloud infrastructure specifically tailored for GPU hosting. You will...

Job Description Job Title: Sr. Cloud Maintenance Engineer Location: 100% Remote Type: Contract Compensation: $65-70/HR Security Clearance: Must be able to obtain a Public Trust clearance Description: ALTA IT Services is seeking a highly skilled Sr. Cloud Maintenance Engineer to support the daily...

Sr. Machine Learning Engineer, Amazon General Intelligence (AGI) Job ID: 2782263 | Amazon.com Services LLC - A57 Our Machine Learning training infrastructure (ML Infra) team is responsible for designing, implementing, and optimizing large-scale computing infrastructure that powers our cutting-edge...

Company Description “Invented for Life” drives us at Bosch and our vision of future mobility. Autonomous vehicles will change the way we move and at Bosch we are working on making this future a reality. We are now growing our team to solve some of the hardest automated driving problems, and are...

Job Description About the Role We are seeking an experienced Solution Architect specializing in Configure, Price, Quote (CPQ) solutions with integrated billing, rating usage management, and revenue recognition capabilities. In this role, you will serve as the technical lead throughout the entire...

Job Description Salary: About Us Selector is building an operational intelligence platform for digital infrastructure. By adopting an AI/ML-based analytics approach, the platform provides actionable multi-dimensional insights to network, cloud, and application operators. It enables operations teams...

Job Description Company Description Organizations everywhere struggle under the crushing costs and complexities of “solutions” that promise to simplify their lives. To create a better experience for their customers and employees. To help them grow. Software is a choice that can make or break a...

Job Description Platform Solutions Architect About Etched Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can...

Job Description Company Description Organizations everywhere struggle under the crushing costs and complexities of “solutions” that promise to simplify their lives. To create a better experience for their customers and employees. To help them grow. Software is a choice that can make or break a...

Job Description Qventus is leading the transformation of healthcare operations. We enable hospitals to focus on what matters most: patient care. Our innovative solutions harness the power of machine learning, generative AI, and behavioral science to deliver exceptional outcomes and empower care...

Sr. Machine Learning Engineer, Amazon General Intelligence (AGI) Job ID: 2782263 | Amazon.com Services LLC - A57 Our Machine Learning training infrastructure (ML Infra) team is responsible for designing, implementing, and optimizing large-scale computing infrastructure that powers our cutting-edge...

Machine Learning Engineer, Amazon General Intelligence (AGI)

Description The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Software Development Engineer(SDE)/Machine Learning Engineer(MLE) to play pivotal role in the development of industry-leading multi-modal and multi-lingual Large Language Models (LLM). As...

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Financial Analyst

NTT Global Data Centers

Santa Clara null

Remote

Remote

USD 107’000 - 156’000

Full time

4 days ago
Be an early applicant

Principal Product Manager Remote - US

Twilio

Oakland null

Remote

Remote

USD 177’000 - 221’000

Full time

5 days ago
Be an early applicant

Software Engineer, Front-End Systems

Stealth AI Startup

San Francisco null

Remote

Remote

USD 150’000 - 250’000

Full time

14 days ago

Product Engineer, Intelligence Interfaces

Stealth AI Startup

San Francisco null

Remote

Remote

USD 150’000 - 250’000

Full time

8 days ago

Product Engineer, Expert Workflows

Stealth AI Startup

Hayward null

Remote

Remote

USD 150’000 - 250’000

Full time

9 days ago

Product Engineer, Expert Workflows

Stealth AI Startup

Fremont null

Remote

Remote

USD 150’000 - 250’000

Full time

9 days ago

Product Engineer, Expert Workflows

Stealth AI Startup

San Francisco null

Remote

Remote

USD 150’000 - 250’000

Full time

14 days ago

Account Director, Federal DoD

Analyticsengineering

Washington null

Remote

Remote

USD 150’000 - 200’000

Full time

2 days ago
Be an early applicant

Account Director, Federal Civilian

Analyticsengineering

Washington null

Remote

Remote

USD 150’000 - 200’000

Full time

2 days ago
Be an early applicant