
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A leading AI solutions firm in Johannesburg is seeking an AI / LLM Ops Engineer to design and maintain LLM pipelines. This role involves deploying agent frameworks and ensuring model performance. The ideal candidate has hands-on experience with LLMs, is proficient in Python, and understands cloud infrastructure.
Do you want to help shape the future of intelligent systems, focusing on LLMs and agents?
Do you enjoy turning cutting‑edge research into production‑grade AI solutions?
Then join Elixirr Digital as our next AI / LLM Ops Engineer!
Join Elixirr Digital as an AI / LLM Ops Engineer , where you will drive innovation and help shape the future of intelligent systems for clients across industries like banking, insurance, healthcare, telecommunications, consumer goods, and more.
We’re looking for proactive self‑starters who constantly read research, apply it, and use AI to improve processes, fully embracing the technology.
At Elixirr Digital, we’ve been early adopters of generative AI, working across open‑source and foundation models.
From building agents that automate internal workflows to deploying LLM‑powered tools that delight users, we push boundaries, apply emerging tech, and deliver real impact.
If you’re excited to work at the forefront of generative AI, helping enterprises transform from the inside out, we want to hear from you.
Let’s build the future of AI together!
Design, build, and maintain LLM / agent pipelines : from prototyping to fine‑tuning, deployment, monitoring, and scaling.
Design and deploy agentic systems and orchestration frameworks leveraging LangChain, AutoGen, or other agent‑oriented architectures to enable scalable, reliable automation.
Implement and operationalise RAG pipelines including document ingestion, embedding generation, context chunking, and vector database integration for optimized retrieval and knowledge management.
Monitor model performance (latency, accuracy, drift), ensuring reliability and robustness.
Ensure security and infrastructure scalability (cloud / GPU / TPU, etc.).
Innovate : explore research, new frameworks, agentic capabilities; prototype internal tools; help improve process & tooling.
Hands‑on with LLMs : fine‑tuning, deployment, building agentic solutions.
Practical experience with agent frameworks / tools (e.g., LangChain, AutoGen, Crew, Swarm, etc.).
Solid understanding of cloud infrastructure, containerisation (Docker, Kubernetes), orchestration, CI / CD pipelines.
Proven track record of optimising model performance, managing latency, monitoring, drift detection, logging (Ragas, DeepEval, LangSmith).
Comfortable scripting / programming in Python (plus automation tooling).
Experience with multi‑modal agents or models (text, image, audio).
Prior work with vector databases (Pinecone, Weaviate, etc.), retrieval or knowledge store architectures.
We could be a perfect fit if you are : Passionate about technology.
You anticipate, recognise, and resolve technical problems using a variety of specialised tools for application development and support.
Independent.
You are a self‑motivated and ambitious individual, capable of managing multiple responsibilities effectively.
Problem‑solver.
You think creatively and find solutions to complex challenges.
Creative and outside‑the‑box thinker.
You look beyond blog posts and whitepapers, competitions, and even state‑of‑the‑art benchmarks to solve real‑world problems.
Communicator.
Strong verbal and written communication skills are essential to ensure effective collaboration and timely delivery of results within the team.
Proficient in English.
We work across continents in a global environment, so fluent English, both written and spoken, is a must.
Global Impact : Work with top‑tier clients across industries, solving real‑world challenges with transformative AI solutions.
Cutting‑Edge Innovation : Develop and deploy AI tools and platforms that redefine what’s possible in enterprise AI.
Collaborative Culture : Join a team of innovators passionate about advancing AI while fostering a collaborative, inclusive environment.
Diverse Projects : Tackle a wide range of AI applications—from NLP to reinforcement learning—for use cases as varied as field support for telecom workers and optimising customer workflows in banks.
From working with cutting‑edge technologies to solving complex challenges for global clients, we make sure your work matters.
And while you’re building great things, we’re here to support you.
Performance bonus Employee Stock Options Grant Employee Share Purchase Plan (ESPP) Competitive compensation Health & Wellbeing : Health benefits plan Flexible working hours Pension plan Projects & Tools : Modern equipment Big clients and interesting projects Cutting‑edge technologies Learning & Growth : Growth and development opportunities Internal LMS & knowledge hubs We don’t just offer a job - we create space for you to grow, thrive, and be recognised.
Intrigued?
Apply now!
#J-