
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A cutting-edge technology company based in Malaysia is seeking a Principal AI Engineer to lead architectural development for AI-driven 'Super Agents'. You will design workflows, build integration layers, and optimize inference pipelines for advanced LLMs. The ideal candidate has deep expertise in Python and modern AI stacks. Additional responsibilities include architecting memory systems and establishing testing frameworks. Join a passionate team committed to redefining creativity through innovative AI products.
Role Title: Principal AI Engineer (LLM Agents & Orchestration) Team: Chatly Engineering Focus: Building Autonomous "Super Agents"
Vyro is redefining the future of digital creativity. We build cutting‑edge content creation tools powered by Artificial Intelligence and Machine Learning, helping millions of creators, designers, and storytellers bring their imagination to life — effortlessly.
With a global user base of over 5 million active creators every month, Vyro’s 20+ AI‑powered apps are transforming how people design, edit, and express themselves across images, videos, and beyond.
From intuitive AI photo editors to next‑gen video creation platforms, our products are designed to make creativity accessible, fast, and limitless.
At Vyro, we’re a team of innovators, builders, and dreamers — known as Vyronauts — driven by passion, purpose, and the belief that technology should inspire creativity, not complicate it.
We are looking for a deep expert in Large Language Models (LLMs) to lead the architectural development of our new "Super Agent" within Chatly. You will move beyond simple chat interfaces to build autonomous agents capable of complex reasoning, tool usage, and seamless integration with external workflows.
Agent Architecture: Design and implement stateful agentic workflows (using frameworks like LangGraph or custom Python/TypeScript solutions) that can plan, execute, and self‑correct.
Integration Ecosystem: Build the "hands" of the agent. Develop robust integration layers that allow the LLM to interact with our internal APIs, databases, and third‑party tools reliably.
Latency & Reliability: Optimize inference pipelines for speed (streaming, token optimization) and reliability (handling hallucinations, structured output validation).
Memory Systems: Architect advanced RAG (Retrieval‑Augmented Generation) systems to give the Super Agent persistent memory and context awareness across sessions.
Evaluation & Observability: Establish a rigorous testing framework for non‑deterministic model outputs to ensure the agent behaves as expected in production.
Core Stack: Deep proficiency in Python (or TypeScript) and the modern AI stack (LangChain, LlamaIndex, DSPy).
Model Internals: Strong grasp of how to leverage specific model strengths (e.g., GPT‑4o for reasoning, Haiku/Flash for speed) and experience with Function Calling/Tool Use.
Vector Search: Hands‑on experience with vector databases (Pinecone, Milvus, Weaviate) and embedding strategies.
System Design: Experience designing event‑driven architectures where agents respond to triggers, not just user prompts.
Why Join Us?
Opportunity to work on innovative AI products like Chatly and Imagine, shaping the future of creativity and user interaction.
Be part of a passionate, fast‑moving team that values innovation and data‑driven decisions.
Competitive compensation and benefits.
A culture where learning, growth, and experimenting with new ideas are deeply encouraged.