Overview
London NEW Help deliver one of the best foundational models in the world at Microsoft AI. At Microsoft AI, we are on a mission to train the world's most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI.
Responsibilities
- Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations.
- Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack.
- Collaborate closely with teams on infrastructure, data, post-training, and multimodality.
- Embody our culture and values.
What we are looking for
- Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects.
- Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making.
- Have experience and/or in-depth understandings about large-scale distributed systems.
- Demonstrate an ability to work collaboratively in a fast-paced, innovative environment, Bachelor’s Degree in Computer Science or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python.
- Proven expertise in the area of pretraining.
Additional / Preferred Qualifications
- Demonstrated experience in large-scale AI.
- Passionate about conversational AI and its deployment.
- Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers.
- Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI.
- Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.