Job Search and Career Advice Platform

Enable job alerts via email!

Software Engineer - Large Language Models

Fastino Labs

Liverpool

Hybrid

GBP 60,000 - 100,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

An innovative AI startup in Liverpool is seeking talented individuals to join their team focusing on developing the next generation of language models. The role involves experimenting with novel architectures, optimizing models for performance, and collaborating closely with engineering teams to deliver updates to clients. Ideal candidates would have experience in AI product development and advanced degrees in relevant disciplines. The position offers a dynamic work environment and the potential for remote work with occasional travel to the company's Silicon Valley office.

Qualifications

  • Great velocity for building and shipping agents/AI products.
  • Advanced degree or substantial industry experience in AI.
  • Demonstrated expertise in training large-scale deep learning models.

Responsibilities

  • Experiment with novel language model architectures.
  • Optimize multimodal models for improved performance.
  • Architect data processing pipelines for quality training data.
  • Implement reinforcement learning techniques for model alignment.
  • Build robust evaluations to assess model quality.
  • Collaborate with the engineering team for model updates.
  • Establish best practices for code health and documentation.

Skills

Building AI products
Independent research
Large Language Models expertise
Deep learning frameworks

Education

Master's or PhD in Computer Science or related field

Tools

PyTorch
JAX
TensorFlow
Job description

Full-time | Remote with trips to Silicon Valley office | Reports to Founders

Introduction
  • Join us at Fastino as we build the next generation of LLMs. Our team, boasting alumni from Google Research, Apple, Stanford, and Cambridge is on a mission to develop specialized, efficient AI.
  • Fastino's GLiNER family of open source models has been downloaded more than 5 million times and is used by companies such as NVIDIA, Meta, and Airbnb
  • Fastino has raised $25M (as featured in TechCrunch) through our seed round and is backed by leading investors including Microsoft, Khosla Ventures, Insight Partners, Github CEO Thomas Dohmke, Docker CEO Scott Johnston, and others.
What You’ll Work On
  • Experiment with novel language model architectures, helping drive and execute Fastino's research roadmap
  • Optimize Fastino’s multimodal models to improve response quality, instruction adherence, and overall performance metrics
  • Architect data processing pipelines, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories
  • Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards
  • Build robust and real-world motivated evaluations
  • Partner with Fastino engineering team to ship model updates directly to customers
  • Establish best practices for code health and documentation on the team, to facilitate collaboration and reliable development
What We’re Looking For
  • Required - Great velocity for building and shipping agents / AI products.
  • Optional - Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies
  • Optional - Demonstrated ability to do independent research in Academic or Industry settings
  • Optional - Substantial industry experience in large-scale deep learning model training, with demonstrated expertise in at least one of Large Language Models, Vision-Language Models, Diffusion Models, or comparable generative AI architectures
  • Optional - Comprehensive technical proficiency and practical experience with leading deep learning frameworks, including advanced competency in one of PyTorch, JAX, TensorFlow, or equivalent platforms for model development and optimization
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.