Enable job alerts via email!

Senior AI Inference Engineer (100% Remote - United Kingdom) | London, UK

Tether Operations Limited

London

Remote

GBP 150,000 - 200,000

Full time

2 days ago
Be an early applicant

Job summary

A leading fintech company is seeking a Senior AI Inference Engineer to develop and deploy innovative AI technologies. The role involves collaborating with researchers and integrating AI features into products. Ideal candidates should have strong Python and C/C++ programming skills, experience with AI deployment platforms, and a degree in Computer Science or a related field. This opportunity is 100% remote based in the UK.

Qualifications

  • Strong programming skills in Python and C/C++.
  • Experience with deployment platforms required.
  • Knowledge in NLP and machine learning frameworks.

Responsibilities

  • Deploy machine learning models to edge devices.
  • Collaborate with researchers to transition models to production.
  • Integrate AI features into existing products.

Skills

Programming in Python
C/C++ programming skills
Knowledge in NLP
Transformers and fine-tuning
Computer vision
TensorFlow
PyTorch
JAX
CUDA

Education

Degree in Computer Science or related field

Tools

Deployment platforms such as Llama.cpp
ONNX
TVM
MLC LLM
IREE (MLIR)

Job description

Senior AI Inference Engineer (100% Remote - United Kingdom)

Join Tether and Shape the Future of Digital Finance

At Tether, we're pioneering a global financial revolution with innovative solutions that enable seamless, secure, and instant digital transactions using blockchain technology. Our products include the trusted stablecoin USDT, energy solutions for Bitcoin mining, AI and peer-to-peer technology advancements, digital learning initiatives, and more.

Our team is remote, diverse, and driven by a passion for fintech innovation. If you excel in English communication and aspire to work on cutting-edge platforms, Tether offers a unique opportunity to make a significant impact.

About the role

We aim to make advanced AI technologies accessible, building the next generation of AI models for both large-scale and edge device applications. Join us to develop AI solutions that surpass current industry standards, fostering broad accessibility and technological advancement.

Responsibilities
  1. Deploy machine learning models to edge devices using frameworks such as Llama.cpp, ONNX, TVM, MLC LLM, and IREE (MLIR).
  2. Collaborate with researchers to code, train, and transition models from research to production.
  3. Integrate AI features into existing products, leveraging the latest machine learning advancements.
Qualifications
  • Strong programming skills in Python and C/C++.
  • Experience with deployment platforms like Llama.cpp, ONNX, TVM, MLC LLM, and IREE (MLIR).
  • Knowledge in NLP, transformers, fine-tuning, computer vision, TensorFlow, PyTorch, JAX, and CUDA.
  • Experience with LLMs, fine-tuning, RAG, transformers is preferred.
  • Ability to quickly learn new technologies.
  • Degree in Computer Science, AI, Machine Learning, or related field, with a proven track record in AI R&D.

Join us in building innovative AI models that lead the industry and expand technological accessibility.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.