Job Search and Career Advice Platform

Enable job alerts via email!

AI Operations Engineer

BukuWarung

Daerah Khusus Ibukota Jakarta

On-site

IDR 503.186.000 - 838.645.000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A tech company in Indonesia is seeking an AI Engineer to bridge AI product development and infrastructure management. The role involves designing, deploying, and maintaining AI systems to enhance internal tools and production workloads. Candidates should have over 5 years of experience, strong skills in Python and ML engineering, and proficiency in deploying AI solutions. The company values innovation and ethics, ensuring compliance with data security frameworks.

Qualifications

  • 5+ years of hands-on experience in backend or ML engineering roles.
  • Experience deploying and monitoring ML/LLM workloads in production.
  • Strong programming skills in Python, familiar with microservice design.

Responsibilities

  • Design, train, and deploy machine learning or LLM-based models.
  • Build modular APIs and microservices for data processing.
  • Collaborate with product teams to prototype AI-first user experiences.
  • Design and maintain scalable ML infrastructure.
  • Implement observability for AI models to track performance.
  • Automate workflows for model retraining and deployment.

Skills

Python programming
Machine Learning Engineering
AI Product Development
Containerization (Docker, Kubernetes)
Cloud platforms (GCP, AWS, Azure)
Observability tools (Prometheus, Grafana)
MLOps best practices
Automation skills

Education

Bachelor’s or Master’s degree in Computer Science, Engineering, or related field

Tools

FastAPI
Flask
Terraform
Airflow
PyTorch
TensorFlow
Hugging Face
LangChain
Job description

BukuWarung’s vision is to empower 60mn MSMEs in Indonesia to become financially aware and enable them to manage and grow their business using our technology platform from bookkeeping and digital payments to AI-driven merchant operations.

As part of our next growth phase, we are expanding our AI Platform and Operations function to build scalable, intelligent systems that accelerate product development, automate operations, and make infrastructure self-optimizing.

We are looking for an AI Engineer (AI Operations Engineer) who can bridge the gap between AI product development and infrastructure management designing, deploying, and maintaining AI systems that power both internal tools and production workloads.

Key Responsibilities
1) AI Product Development

Design, train, and deploy machine learning or LLM-based models that solve core operational and product problems (e.g., anomaly detection, classification, forecasting, and conversational AI).

Build modular APIs and microservices for inference, data processing, and automation.

Collaborate with product teams to prototype, test, and iterate on AI-first user experiences.

Convert experimental notebooks into production-grade pipelines and scalable services.

2) AI Infrastructure & Reliability

Design and maintain scalable ML infrastructure across training, deployment, and monitoring workflows.

Build CI/CD pipelines for model delivery, manage containerized inference systems, and ensure production reliability.

Implement observability for AI models tracking drift, latency, performance, and cost.

Collaborate with DevOps and platform engineering to optimize compute utilization, GPU scheduling, and cost management.

3) Automation & AIOps

Automate workflows for model retraining, deployment, and validation.

Build systems for intelligent alerting, anomaly detection, and auto-remediation of AI services.

Integrate AI pipelines into existing DevOps and monitoring tools for proactive issue management.

Develop robust data ingestion and processing pipelines (structured/unstructured).

Manage feature stores, vector databases, and embeddings pipelines for retrieval-augmented generation (RAG) systems.

Build internal developer tools and utilities for faster experimentation and monitoring.

Partner closely with AI researchers, backend engineers, and product managers to translate business needs into reliable AI systems.

Contribute to MLOps best practices, documentation, and standardization.

Ensure compliance with BukuWarung’s data security, audit, and ethical AI frameworks.

Qualifications
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field

5+ years of hands-on experience in backend or ML engineering roles

  • Strong programming skills in Python (FastAPI, Flask) and familiarity with microservice design
  • Experience deploying and monitoring ML/LLM workloads in production (batch and real-time)
  • Proficiency with:
    • ML/AI frameworks (PyTorch, TensorFlow, Hugging Face, LangChain)
    • Infrastructure tools (Docker, Kubernetes, Terraform, Airflow)
    • Cloud platforms (GCP, AWS, or Azure)
    • Observability stack (Prometheus, Grafana, ELK, OpenTelemetry)
  • Experience managing GPU-based workloads and cost optimization

    • Excellent problem-solving, debugging, and automation skills
    • Familiarity with vector databases (Pinecone, Weaviate, FAISS) and RAG pipeline architecture
    Preferred Experience

    Built and deployed AI-powered automation systems or developer tools

    Experience with LLM fine-tuning, embedding generation, or prompt engineering

    Exposure to distributed systems and scalable API design

    Understanding of data governance, security, and compliance in AI workflows

    Previous experience in fintech, SaaS, or infrastructure-heavy products

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.