Enable job alerts via email!

Member of Technical Staff, AI Pretraining

Microsoft

Camden Town

On-site

GBP 70,000 - 90,000

Full time

Today
Be an early applicant

Job summary

A leading technology company in Camden Town seeks a talented AI Researcher to develop foundational AI models and collaborate on cutting-edge projects. Candidates should have strong analytical skills, experience in distributed systems, and a Bachelor's degree in Computer Science. This role involves engaging with cross-functional teams and driving large-scale training initiatives. Competitive salary and professional growth opportunities await the right candidate.

Qualifications

  • Proven expertise with a strong publication track record in relevant fields.
  • Significant technical leadership in high-impact projects.
  • Experience with large-scale distributed systems.
  • Ability to collaborate in a fast-paced, innovative environment.
  • Proven expertise in the area of pretraining.

Responsibilities

  • Develop algorithms and model architectures for large-scale training.
  • Drive implementations and oversee training runs on distributed systems.
  • Collaborate with teams on infrastructure and data.

Skills

Data-driven decision-making
Analytical skills
Collaboration
Technical leadership
C/C++/C#/Java/JavaScript/Python coding

Education

Bachelor’s Degree in Computer Science or related technical discipline
Job description
Overview

London NEW Help deliver one of the best foundational models in the world at Microsoft AI. At Microsoft AI, we are on a mission to train the world's most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI.


Responsibilities


  • Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations.

  • Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack.

  • Collaborate closely with teams on infrastructure, data, post-training, and multimodality.

  • Embody our culture and values.


What we are looking for


  • Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects.

  • Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making.

  • Have experience and/or in-depth understandings about large-scale distributed systems.

  • Demonstrate an ability to work collaboratively in a fast-paced, innovative environment, Bachelor’s Degree in Computer Science or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python.

  • Proven expertise in the area of pretraining.


Additional / Preferred Qualifications


  • Demonstrated experience in large-scale AI.

  • Passionate about conversational AI and its deployment.

  • Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers.

  • Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI.

  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.