Enable job alerts via email!

Software Engineer - Large Language Models

Fastino Labs

Remote

GBP 60,000 - 90,000

Full time

Yesterday

Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A forward-thinking AI company is seeking talented individuals to join their team focusing on developing next-generation language models. The role involves optimizing multimodal models, experimenting with novel architectures, and applying advanced techniques to align outputs with human quality standards. Ideal candidates have a strong background in AI product development with experience in deep learning frameworks. Enjoy the flexibility of remote work along with occasional travel to Silicon Valley for team collaboration.

Qualifications

Proficiency in building and shipping AI products quickly.
Experience with deep learning and computer vision methodologies.
Strong technical skills in deep learning frameworks.

Responsibilities

Experiment with novel language model architectures.
Optimize multimodal models for quality and performance.
Implement data processing pipelines for training.
Apply reinforcement learning techniques to align model outputs.
Collaborate with engineering to ship model updates.

Skills

AI product development

Independent research

PyTorch

JAX

TensorFlow

Education

Advanced degree in Computer Science or related field

Full-time | Remote with trips to Silicon Valley office | Reports to Founders

Introduction

Join us at Fastino as we build the next generation of LLMs. Our team, boasting alumni from Google Research, Apple, Stanford, and Cambridge is on a mission to develop specialized, efficient AI.
Fastino's GLiNER family of open source models has been downloaded more than 5 million times and is used by companies such as NVIDIA, Meta, and Airbnb
Fastino has raised $25M (as featured in TechCrunch) through our seed round and is backed by leading investors including Microsoft, Khosla Ventures, Insight Partners, Github CEO Thomas Dohmke, Docker CEO Scott Johnston, and others.

What You’ll Work On

Experiment with novel language model architectures, helping drive and execute Fastino's research roadmap
Optimize Fastino’s multimodal models to improve response quality, instruction adherence, and overall performance metrics
Architect data processing pipelines, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories
Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards
Build robust and real-world motivated evaluations
Partner with Fastino engineering team to ship model updates directly to customers
Establish best practices for code health and documentation on the team, to facilitate collaboration and reliable development

What We’re Looking For

Required - Great velocity for building and shipping agents / AI products.
Optional - Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies
Optional - Demonstrated ability to do independent research in Academic or Industry settings
Optional - Substantial industry experience in large-scale deep learning model training, with demonstrated expertise in at least one of Large Language Models, Vision-Language Models, Diffusion Models, or comparable generative AI architectures
Optional - Comprehensive technical proficiency and practical experience with leading deep learning frameworks, including advanced competency in one of PyTorch, JAX, TensorFlow, or equivalent platforms for model development and optimization

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Top locations

Top companies

Top positions