Enable job alerts via email!

Senior AI Engineer (ggml/llama.cpp Specialist) | London, UK

Tether Operations Limited

London

On-site

GBP 70,000 - 100,000

Full time

2 days ago

Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Tether Operations Limited is seeking a Senior AI Engineer to develop advanced AI models for a range of devices. This role focuses on deploying machine learning frameworks like Llama.cpp and GGML, collaborating for production-oriented solutions, and pushing the boundaries of fintech innovation. Candidates should possess strong programming skills and a solid educational background in Computer Science or AI, with experience in GPU deployment and deep learning technologies.

Qualifications

Proficiency in Python, C, and C++ programming.
Experience with Llama.cpp and ggml inference engines, especially for GPU deployment.
Understanding of deep learning, transformers, and LLMs.

Responsibilities

Deploy machine learning models on edge devices using llama.cpp, ggml, and onnx frameworks.
Collaborate with research teams to code, train, and transition models from research to production.
Integrate AI features into existing products.

Skills

Python

C++

Deep Learning

Transformers

LLMs

Integration of AI features

Learning new technologies

Education

Degree in Computer Science or AI

Tools

Llama.cpp

GGML

ONNX

Senior AI Engineer (ggml/llama.cpp Specialist)

Tether Operations Limited - London, United Kingdom

Join Tether and Shape the Future of Digital Finance

At Tether, we're pioneering a global financial revolution with cutting-edge solutions that enable seamless, secure, and instant digital transactions across blockchains. Our products include the trusted stablecoin USDT, energy-efficient Bitcoin mining solutions, innovative AI and data sharing platforms, and educational initiatives to democratize digital learning.

Why Join Us?

Our remote, global team is passionate about fintech innovation. If you have excellent English communication skills and want to make a mark in the industry, Tether offers a collaborative environment to grow and innovate.

About the job:

This role involves building versatile AI models for deployment on a range of devices, from edge devices like smartphones to large-scale systems, using GGML as the core inference backend. Join us in developing AI solutions that surpass current industry standards.

Responsibilities:

Deploy machine learning models on edge devices using llama.cpp, ggml, and onnx frameworks.
Collaborate with research teams to code, train, and transition models from research to production.
Integrate AI features into existing products with the latest machine learning advancements.

Minimum Requirements:

Proficiency in Python, C, and C++ programming.
Experience with Llama.cpp and ggml inference engines, especially for GPU deployment.
Understanding of deep learning, transformers, and LLMs.
Ability to quickly learn new technologies.
Degree in Computer Science, AI, or related field, with a strong AI R&D background.

Join us in building advanced AI models that lead the industry forward.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.