Enable job alerts via email!

Software Development Engineer, EC2 Instance Networking

Amazon Web Services (AWS)

Sunnyvale (CA)

On-site

USD 129,000 - 224,000

Full time

3 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a pioneering team at Amazon Web Services as a Software Development Engineer, focused on developing cutting-edge networking solutions for AI training clusters. Utilize your C/C++ skills and knowledge of RDMA technologies to tackle challenging problems in distributed computing while benefiting from a culture of mentorship and career growth initiatives.

Benefits

Flexible working culture
Mentorship programs
Diversity and inclusion initiatives

Qualifications

  • 3+ years of software development experience.
  • Experience with RDMA technologies and Linux kernel development.
  • Strong problem-solving skills in complex distributed environments.

Responsibilities

  • Design and develop high-performance networking software solutions.
  • Optimize collective communication patterns for distributed workloads.
  • Collaborate on architecture decisions for next-gen AI infrastructure.

Skills

C/C++
RDMA technologies
Linux networking
distributed systems

Education

Bachelor's degree in Computer Science

Tools

SmartNIC programming

Job description

Software Development Engineer, EC2 Instance Networking
Software Development Engineer, EC2 Instance Networking

1 week ago Be among the first 25 applicants

Get AI-powered advice on this job and more exclusive features.

Description

Join our team building the scale-out networking backbone that powers the world's largest AI training clusters. We're developing high-performance RDMA and RoCE solutions that enable distributed training of trillion-parameter models across thousands of compute nodes on AWS infrastructure.

Description

Join our team building the scale-out networking backbone that powers the world's largest AI training clusters. We're developing high-performance RDMA and RoCE solutions that enable distributed training of trillion-parameter models across thousands of compute nodes on AWS infrastructure.

Our team is responsible for creating the networking software that connects massive AI accelerator clusters, focusing on SmartNIC integration, collective communication optimization, and ultra-high-bandwidth inter-rack connectivity. You'll be working at the intersection of cloud infrastructure and state-of-the-art AI hardware to solve some of the most challenging networking problems in distributed computing.

Key job responsibilities

  • Design and develop high-performance networking software solutions utilizing RDMA and RoCE technologies for large-scale AI clusters
  • Integrate SmartNIC acceleration hardware with EC2 control plane systems and APIs
  • Implement and optimize collective communication patterns for distributed AI training workloads
  • Develop comprehensive performance monitoring, metrics collection, and benchmarking tools for high-bandwidth cluster interconnects
  • Create automated testing frameworks and stress testing tools for multi-rack distributed systems
  • Debug complex system-level issues across hardware acceleration, kernel networking, and distributed applications
  • Collaborate on architecture decisions for next-generation scale-out AI infrastructure
  • Participate in design reviews, code reviews, and technical documentation

About The Team

Utility Computing (UC)

AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for customers who require specialized security solutions for their cloud services.

Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.

Diverse Experiences

AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

About AWS

Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

Inclusive Team Culture

Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (diversity) conferences, inspire us to never stop embracing our uniqueness.

Work/Life Balance

We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.

Mentorship & Career Growth

We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

Basic Qualifications

  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • Strong programming skills in C/C++ with focus on high-performance systems
  • Experience with RDMA technologies and RoCE implementations
  • Familiarity with collective communication libraries (NCCL, RCCL, OneCCL, MPI)
  • Experience with Linux networking, kernel development, and distributed systems
  • Understanding of high-performance computing clusters and parallel programming

Preferred Qualifications

  • 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
  • Bachelor's degree in computer science or equivalent
  • Experience with SmartNIC programming and network acceleration hardware APIs
  • Knowledge of large-scale AI training infrastructure and multi-rack cluster networking
  • Experience with performance optimization, benchmarking, and system-level debugging
  • Understanding of AI accelerator architectures and scale-out communication patterns
  • Experience with cloud infrastructure integration and virtualization technologies
  • Bachelor's degree in Computer Science, Computer Engineering, or related field
  • Strong problem-solving skills and experience with complex distributed systems
  • Proficiency in design and analysis of algorithms and data structures
  • Linux operating system knowledge
  • In-depth knowledge of TCP/IP
  • Kernel or embedded development, particularly Linux kernel
  • Strong knowledge of Computer Science fundamentals in data structures, algorithm design, problem solving, and complexity analysis
  • Knowledge of, at least, one modern programming language such as C, C++, rust, Python or Perl
  • Experience developing complex software systems that have been successfully delivered to customers
  • Knowledge of professional software engineering practices & best practices for the full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations
  • Ability to take a project from scoping requirements through actual launch of the project
  • Experience in communicating with users, other technical teams, and management to collect requirements, describe software product features, and technical designs
  • Experiencing mentoring junior software development engineers and driving engineering excellence

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site.


Company - Amazon Development Center U.S., Inc.

Job ID: A3004162

Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Information Technology, Consulting, and Engineering
  • Industries
    IT Services and IT Consulting

Referrals increase your chances of interviewing at Amazon Web Services (AWS) by 2x

Sign in to set job alerts for “Software Engineer” roles.
Software Engineer, AI Intern (Summer 2025)

Mountain View, CA $125,400.00-$188,100.00 2 weeks ago

Software Engineer, AI Platform - New Grad
New Grads 2025 - Software Engineer, Algorithm

San Jose, CA $120,000.00-$165,000.00 9 months ago

Menlo Park, CA $56.25-$173,000.00 3 weeks ago

New Grads 2025 - General Software Engineer

San Jose, CA $120,000.00-$165,000.00 4 months ago

Software Engineer 4 - TV & Web Player Platform
Software Engineer I (Intern) United States

San Jose, CA $44,000.00-$130,000.00 1 day ago

Full Stack Software Engineer - Post-training

Palo Alto, CA $180,000.00-$440,000.00 2 weeks ago

San Jose, CA $133,900.00-$242,000.00 1 week ago

Software Engineer - Intern (Summer 2025)

San Jose, CA $3,000.00-$4,000.00 8 months ago

Full Stack Software Engineer (L4), Product Localization Engineering

San Jose, CA $113,400.00-$206,300.00 2 weeks ago

Sunnyvale, CA $167,000.00-$185,500.00 2 hours ago

Santa Clara, CA $150,000.00-$175,000.00 7 months ago

Frontend Software Engineer - University Graduate 2025

San Mateo, CA $120,000.00-$280,000.00 2 weeks ago

Software Engineer(s) - New Grad (Fall 2025 Graduation)
eCommerce Full Stack Developer (React / Shopify) - On Site
Software Engineer, Google Distributed Cloud

Sunnyvale, CA $141,000.00-$202,000.00 2 weeks ago

Sunnyvale, CA $117,000.00-$234,000.00 5 days ago

New College Grad Software Engineer, Software Engineering Development (Apps)

San Jose, CA $92,735.00-$131,300.00 1 week ago

(General Hire) Software Engineer Graduate (Advertisement Team) - 2025 Start (BS/MS)

San Jose, CA $113,500.00-$250,000.00 5 hours ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Sr. Software Engineer, EC2 Instance Networking

Amazon Web Services (AWS)

Sunnyvale

On-site

USD 151,000 - 262,000

3 days ago
Be an early applicant

Software Development Engineer, EC2 Instance Networking

Amazon

Sunnyvale

On-site

USD 129,000 - 224,000

19 days ago

Sr. Software Engineer, EC2 Instance Networking

Amazon

Sunnyvale

On-site

USD 120,000 - 180,000

30+ days ago