Enable job alerts via email!

Semantic Backend Engineer: Pipelines & Embeddings

Infuse

Gauteng

Remote

ZAR 500 000 - 700 000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A technology company in Gauteng is seeking an applied ML engineer to lead the semantic ingestion pipeline, transforming raw PDFs into searchable assets. Responsibilities include managing the ETL pipeline, implementing content freshness logic, and collaborating on UX integration. Ideal candidates should have experience with ML pipelines and a passion for turning unstructured data into structured resources.

Qualifications

  • 3+ years of experience in building ML pipelines.
  • Experience with semantic search and large-scale tagging.
  • Ability to handle unstructured data.

Responsibilities

  • Own the ETL pipeline from raw PDFs to structured resources.
  • Finalize the summarization and classification flow.
  • Implement logic for freshness in content indexing.
  • Collaborate closely with the Tech Lead on UX integration.

Skills

Python
PyTorch
sentence-transformers
OpenAI APIs
FastAPI
Milvus
pgvector
PyPDF / Tika
Airflow
Docker
Job description
A technology company in Gauteng is seeking an applied ML engineer to lead the semantic ingestion pipeline, transforming raw PDFs into searchable assets. Responsibilities include managing the ETL pipeline, implementing content freshness logic, and collaborating on UX integration. Ideal candidates should have experience with ML pipelines and a passion for turning unstructured data into structured resources.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.