Job Search and Career Advice Platform

Enable job alerts via email!

Edge AI Inference Engineer - On-Device ML & C++

Tether Operations Limited

Remote

PLN 120,000 - 180,000

Full time

2 days ago
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A technology company in Warsaw is seeking a C++ Engineer to work on deploying machine learning models to edge devices and enhancing inference engines. The role involves collaborating with researchers to transition models from research to production and integrating AI features into existing products. Candidates should have strong programming skills in C++, experience with inference engines like Llama.cpp, and a degree in a related field.

Qualifications

  • Excellent programming skills in C++, experience in Javascript is a bonus.
  • Strong experience with Llama.cpp and ggml inference engines.
  • Good understanding of deep learning concepts and model architectures.
  • Experience with Watch's and LLMs.
  • Demonstrated ability to assimilate new technologies rapidly.

Responsibilities

  • Deploy machine learning models to edge devices using Llama.cpp, ggml, ONNX.
  • Collaborate with researchers for coding, training, and transitioning models.
  • Integrate AI features into existing products with the latest advancements.

Skills

C++ programming
Javascript
Deep learning concepts
Model architectures
Inference engines (Llama.cpp, ggml)

Education

Degree in Computer Science, AI, Machine Learning, or related field
Job description
A technology company in Warsaw is seeking a C++ Engineer to work on deploying machine learning models to edge devices and enhancing inference engines. The role involves collaborating with researchers to transition models from research to production and integrating AI features into existing products. Candidates should have strong programming skills in C++, experience with inference engines like Llama.cpp, and a degree in a related field.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.