
¡Activa las notificaciones laborales por email!
Genera un currículum adaptado en cuestión de minutos
Consigue la entrevista y gana más. Más información
A leading technology firm in Ciudad de México seeks an experienced C++ engineer focused on enhancing AI inference engines for edge devices. Your responsibilities will include deploying machine learning models and collaborating on production environments. An excellent understanding of deep learning and strong C++ skills are essential. Ideal candidates will hold a related degree and have experience with Llama.cpp and similar technologies. This is a great opportunity to contribute to groundbreaking AI research and development.
You’ll work on the C++ layer that powers local AI, porting and enhancing inference engines like llama.cpp, ONNX and similar, to run efficiently on Нижних devices. Your focus is on the runtime: making models load faster, run leaner, and perform well across different hardware. You’ll ensure that the inference layer is stable, optimized, and ready for integration with the rest of the stack.
This role is for engineers who want to work close to the metal, enabling private and fast on-device AI without relying on cloud infrastructure.