Enable job alerts via email!
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
An innovative firm is seeking a skilled engineer to enhance its AI training and inference infrastructure. This role involves building and evolving network systems that connect training accelerators, ensuring they meet performance and reliability standards. As part of a multi-disciplinary team, you'll tackle challenges related to large-scale training systems, working closely with performance engineers to implement robust solutions. If you have a passion for cutting-edge technology and a background in networking and system software, this opportunity offers a chance to contribute to groundbreaking advancements in AI and immersive experiences.
Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing uses cases of AI. We need to build and evolve our network infrastructure that connects myriads of training accelerators like GPUs together. In addition, we need to ensure that the network is running smoothly and meets stringent performance, availability and reliability requirements of RDMA workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across the stack: network fabric and host networking, communication libraries and scheduling infrastructure.
Responsibilities:
Minimum Qualifications:
Preferred Qualifications:
About Meta:
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics.