Enable job alerts via email!

AI RAG Engineer

Hitachi eBworx

Selangor

On-site

MYR 150,000 - 200,000

Full time

17 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading consulting firm, Hitachi eBworx, is looking for an experienced AI RAG Engineer to revolutionize search and retrieval using AI. You will work on cutting-edge technologies, fine-tune language models, and enhance AI-driven systems, playing a crucial role in transforming data into actionable insights.

Benefits

Career Growth Programs
Competitive Salary
Bonuses
Access to Latest Tools & Technologies

Qualifications

  • Experience with fine-tuning LLMs and knowledge of Transformer models.
  • Proficiency in Python and frameworks like PyTorch and TensorFlow.
  • Hands-on experience with vector databases and ETL pipelines.

Responsibilities

  • Develop and optimize RAG pipelines for AI-driven search.
  • Implement retrieval models using vector databases.
  • Integrate structured and unstructured data with AI systems.

Skills

NLP
Reinforcement Learning
Machine Learning
Semantic Search
SQL
NoSQL
Python

Tools

PyTorch
TensorFlow
JAX
Docker
Kubernetes

Job description

Add expected salary to your profile for insights

We are seeking an experienced AI RAG Engineer to design and optimize Retrieval-Augmented Generation (RAG) pipelines, leveraging cutting-edge AI and machine learning techniques. In this role, you will work on integrating large language models (LLMs) with structured and unstructured data sources, fine-tuning AI models, and improving knowledge retrieval for enhanced accuracy and efficiency.

This is an exciting opportunity to be at the forefront of AI-driven search and retrieval, working with some of the latest advancements in NLP, reinforcement learning, and scalable AI infrastructure.

What Awaits You:

Latest Tools & Tech – Work with cutting-edge technologies to stay ahead.
Career Growth – Access training programs for upskilling or reskilling to build your portfolio.
Great Pay & Perks – Competitive salary and bonuses to reward your expertise and contributions.

What You'll Do:

  • Develop and optimize RAG pipelines to enhance AI-driven search and retrieval systems.
  • Implement retrieval models using vector databases such as FAISS, Pinecone, Weaviate, and Milvus.
  • Fine-tune LLMs using supervised learning and reinforcement learning techniques like RLHF (Reinforcement Learning from Human Feedback).
  • Design and train embedding models to improve document retrieval and knowledge extraction.
  • Integrate structured and unstructured knowledge bases with AI systems to enhance response relevance.
  • Improve response accuracy and contextual coherence in AI-generated outputs.
  • Develop and optimize retrieval and ranking algorithms for real-time applications.
  • Leverage open-source AI models (e.g., Llama, Mistral, GPT, Claude) to enhance retrieval efficiency.
  • Optimize AI model deployment and improve training pipelines for production environments.

What We Need From You:

  • Experience with LLM fine-tuning and training (e.g., Hugging Face, OpenAI API, LangChain).
  • Strong understanding of Transformer models and NLP architectures.
  • Knowledge of vector embeddings, semantic search, and retrieval models (BM25, DPR, ColBERT, Hybrid Search).
  • Expertise in reinforcement learning and self-supervised learning.
  • Hands-on experience with RLHF (Reinforcement Learning from Human Feedback).

Programming & Frameworks

  • Proficiency in Python, and experience with PyTorch, TensorFlow, JAX.
  • Experience with LLM orchestration frameworks (LangChain, LlamaIndex).
  • Strong skills in SQL and NoSQL databases for knowledge storage.
  • Experience with API integrations for AI systems (OpenAI, Anthropic, Hugging Face).
  • Familiarity with distributed computing and ML model scaling.
  • Experience with vector databases (FAISS, Pinecone, Milvus, Weaviate).
  • Strong understanding of ETL pipelines for processing large-scale datasets.
  • Experience deploying AI models in cloud environments (AWS, Azure, GCP).
  • Knowledge of containerization (Docker, Kubernetes) and MLOps practices.
Unlock job insights

Salary match Number of applicants Skills match

Your application will include the following questions:

  • What's your expected monthly basic salary?
  • Which of the following types of qualifications do you have?
  • Have you worked in a role which requires experience with machine learning?

Computer Software & Networking 101-1,000 employees

Hitachi eBworx is a leading international consulting and technology solutions firm delivering innovative and high-performance solutions to banks in the region. Hitachi eBworx is a subsidiary of Hitachi, Ltd., a Fortune Global 500 Company. We are now embarking on a three-year growth plan targeting solutions, services, and geographical expansion. To achieve this plan, we are looking to expand our management and delivery bandwidths, and increase our talent pool.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.