Enable job alerts via email!

Principal AI Engineer (LLM Agents & Orchestration)

Vyro

Malaysia

On-site

MYR 80,000 - 100,000

Full time

Today

Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A cutting-edge technology company based in Malaysia is seeking a Principal AI Engineer to lead architectural development for AI-driven 'Super Agents'. You will design workflows, build integration layers, and optimize inference pipelines for advanced LLMs. The ideal candidate has deep expertise in Python and modern AI stacks. Additional responsibilities include architecting memory systems and establishing testing frameworks. Join a passionate team committed to redefining creativity through innovative AI products.

Benefits

Competitive compensation

Culture of learning and growth

Opportunity to work on innovative AI products

Qualifications

Deep proficiency in Python (or TypeScript) and the modern AI stack.
Strong grasp of model internals and experience with Function Calling.
Hands-on experience with vector databases and embedding strategies.

Responsibilities

Design and implement stateful agentic workflows.
Build robust integration layers for LLM interactions.
Optimize inference pipelines for speed and reliability.
Architect advanced Retrieval-Augmented Generation systems.
Establish a testing framework for model outputs.

Skills

Deep proficiency in Python

Deep proficiency in TypeScript

Experience with LangChain

Experience with LlamaIndex

Experience with DSPy

Hands-on experience with vector databases

Experience in event-driven architectures

Principal AI Engineer (LLM Agents & Orchestration)

Role Title: Principal AI Engineer (LLM Agents & Orchestration) Team: Chatly Engineering Focus: Building Autonomous "Super Agents"

Who We Are

Vyro is redefining the future of digital creativity. We build cutting‑edge content creation tools powered by Artificial Intelligence and Machine Learning, helping millions of creators, designers, and storytellers bring their imagination to life — effortlessly.

With a global user base of over 5 million active creators every month, Vyro’s 20+ AI‑powered apps are transforming how people design, edit, and express themselves across images, videos, and beyond.

From intuitive AI photo editors to next‑gen video creation platforms, our products are designed to make creativity accessible, fast, and limitless.

At Vyro, we’re a team of innovators, builders, and dreamers — known as Vyronauts — driven by passion, purpose, and the belief that technology should inspire creativity, not complicate it.

If you’re excited about shaping the next wave of AI‑powered creativity, Vyro is the place to be.

About the Role

We are looking for a deep expert in Large Language Models (LLMs) to lead the architectural development of our new "Super Agent" within Chatly. You will move beyond simple chat interfaces to build autonomous agents capable of complex reasoning, tool usage, and seamless integration with external workflows.

Key Responsibilities

Agent Architecture: Design and implement stateful agentic workflows (using frameworks like LangGraph or custom Python/TypeScript solutions) that can plan, execute, and self‑correct.
Integration Ecosystem: Build the "hands" of the agent. Develop robust integration layers that allow the LLM to interact with our internal APIs, databases, and third‑party tools reliably.
Latency & Reliability: Optimize inference pipelines for speed (streaming, token optimization) and reliability (handling hallucinations, structured output validation).
Memory Systems: Architect advanced RAG (Retrieval‑Augmented Generation) systems to give the Super Agent persistent memory and context awareness across sessions.
Evaluation & Observability: Establish a rigorous testing framework for non‑deterministic model outputs to ensure the agent behaves as expected in production.

Technical Requirements

Core Stack: Deep proficiency in Python (or TypeScript) and the modern AI stack (LangChain, LlamaIndex, DSPy).
Model Internals: Strong grasp of how to leverage specific model strengths (e.g., GPT‑4o for reasoning, Haiku/Flash for speed) and experience with Function Calling/Tool Use.
Vector Search: Hands‑on experience with vector databases (Pinecone, Milvus, Weaviate) and embedding strategies.
System Design: Experience designing event‑driven architectures where agents respond to triggers, not just user prompts.

Why Join Us?

Opportunity to work on innovative AI products like Chatly and Imagine, shaping the future of creativity and user interaction.
Be part of a passionate, fast‑moving team that values innovation and data‑driven decisions.
Competitive compensation and benefits.
A culture where learning, growth, and experimenting with new ideas are deeply encouraged.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Top locations

Top companies

Top positions