
Activez les alertes d’offres d’emploi par e-mail !
Générez un CV personnalisé en quelques minutes
Décrochez un entretien et gagnez plus. En savoir plus
A cutting-edge AI company in Île-de-France seeks a Distributed Systems Software Engineer to architect and scale their accelerator systems. Responsibilities include developing high-performance libraries and implementing resource-sharing protocols. Candidates should have a Master's or PhD in Computer Science and deep knowledge in distributed environments. This position offers competitive compensation, stock options, relocation benefits, and a multicultural working environment.
Arago is an AI and computer hardware company whose mission is to drive the course of history forward. We do so by accelerating breakthroughs at the intersection of AI and semiconductors.
Founded in 2024 by AI researchers and physicists with deep expertise in photonics, electronics, software, mathematics, and machine learning, Arago brings together a lean team of engineers and scientists from the world’s top companies and research labs.
Composed of nine nationalities and operating from hubs in France, North America, and Israel, we believe in great science and fast achievements. Our work is guided by these core principles:
Do great things: we deliver work we’re proud to sign our name to.
High velocity: speed matters. We move quickly, one step at a time.
One unit: we’re all in this together, with relationships grounded in trust, respect, and camaraderie.
Arago is backed by executives from Apple, Arm, Nvidia, Microsoft, and Hugging Face, as well as prominent US and European deeptech venture firms and exited founders.
As a Distributed Systems Software Engineer, you will be the architect behind scaling Arago’s accelerator from a single device to massive multi-device and multi-node clusters. Your work focuses on creating the distribution libraries, programming model extensions, and observability tooling that make clustered accelerators usable, debuggable, and correct at scale. You will not only design these systems but also validate them through hands‑on prototype deployments, ensuring our hardware powers the next generation of distributed AI workloads.
Master’s or PhD degree in Computer Science, Distributed Systems, or a related field.
Strong understanding of OS internals relevant to distributed environments (networking stacks, I/O, IPC).
Solid knowledge of memory safety and concurrency control in complex, long-running systems.
Practical experience with virtualization frameworks and isolation in distributed deployments.
Proven understanding of resource‑sharing protocols, coordination primitives, and distributed memory models.
Familiarity with parallel programming models (e.g., MPI, NCCL, or similar) for distributed software.
Ability to model and emulate distributed system behavior to analyze performance and failure modes.
Language: English at a proficient level. French is a plus.
Develop high-performance distribution libraries enabling seamless multi-device and multi-node accelerator use.
Architect and implement distribution features within the programming model to support coordination and synchronization.
Implement robust protocols for resource sharing and memory unification across heterogeneous devices and nodes.
Drive the prototype deployment and validation of clustered accelerator systems.
Build specialized tooling for distributed systems, including cross-node tracing and global debugging.
Integrate virtualization and hardware-level isolation mechanisms into distributed deployments.
Model and emulate distributed accelerator behavior to ensure architectural correctness and peak performance.
Debug critical issues spanning the host, device, runtime, and network layers.
Competitive cash compensation, with final package based on location, experience, and the pay of team members in similar positions.
Meaningful stock option plan offered at the earliest stage of the company (included in the majority of full time offers).
Relocation bonus and coverage of moving expenses for relocation within 20 minutes of the company’s location.
Healthcare coverage (including family-friendly options), pension contributions, professional development support, and 25 days of PTO, in addition to public holidays.
Ownership of a key technical domain, with significant vertical and/or horizontal growth opportunities, based on performance and individual drive.
Reimbursement of 50% of the public transport subscription fee.
A high-paced, multicultural (with 10 nationalities), and engineering-led environment.
Screening Call : Get to know you beyond your CV.
Technical meeting : Deep dive into your past projects and technical achievements.
CEO Interview : Assess the fit with the team’s culture and long-term vision.
Reference Calls: Mandatory calls with your former managers to validate strengths, weaknesses, and work style.
Technical Assessment: Take-home technical assignment crafted to the role you’re applying for.
Final Interview : Discuss your technical assignment and address any remaining questions with team members.