Enable job alerts via email!

NLP Engineer Machine Learning Engineer|5+ years| Remote

PradeepIT Consulting Services

Singapore

Remote

SGD 80,000 - 110,000

Full time

2 days ago
Be an early applicant

Job summary

A consulting technology firm in Singapore is seeking an experienced NLP/ML Engineer to lead the development of intelligent document understanding systems. The role involves building scalable systems for document parsing and knowledge graph construction, focusing on both English and German texts. The ideal candidate has strong experience in NLP, proficiency in Python, and hands-on experience with layout-aware models.

Qualifications

  • Strong experience in NLP and document understanding.
  • Proficiency in Python and relevant libraries.
  • Hands-on experience with layout-aware models.

Responsibilities

  • Design and implement document hierarchy and section segmentation pipelines.
  • Build multilingual entity recognition systems for English and German texts.
  • Maintain knowledge graphs representing semantic relationships.

Skills

NLP
Information extraction
Multilingual proficiency
Python programming

Tools

Python
NLTK
transformers
spaCy
Neo4j

Job description

NLP Engineer / Machine Learning Engineer | 5+ years | Remote

About the job

We are seeking a hands-on NLP/ML engineer to lead the development of an intelligent document understanding pipeline. The goal is to extract structured data from complex, unstructured RFQ documents (40100+ pages, in German and English). You will be responsible for building scalable systems that combine document parsing, layout analysis, entity extraction, and knowledge graph construction, ultimately feeding downstream applications such as analytics and LLM models.

Key Responsibilities

  1. Design and implement document hierarchy and section segmentation pipelines using layout-aware models (e.g., DocLayout-YOLO, LayoutLM, Donut).
  2. Build multilingual entity recognition and relation extraction systems for English and German texts.
  3. Develop custom tokenization, parsing, and information extraction logic using tools like NLTK, transformers, and spaCy.
  4. Construct and maintain knowledge graphs representing semantic relationships using graph data structures and databases such as Neo4j.
  5. Integrate outputs into structured, LLM-friendly formats (e.g., JSON, Markdown) for downstream extraction tasks.
  6. Collaborate with product and domain experts to define information schemas, ontologies, and validation methods.

Qualifications

  1. Strong experience in NLP, document understanding, and information extraction from unstructured/multilingual documents.
  2. Proficiency in Python and experience with libraries such as transformers, spaCy, and NLTK.
  3. Hands-on experience with layout-aware models like DocLayout-YOLO, LayoutLM, Donut, or similar.
  4. Familiarity with knowledge graphs and graph databases such as Neo4j, RDF.

Additional Details

Recruiter: PradeepIT Consulting Services

Sign-in to see how your skills match this job, find your perfect job, get noticed by top recruiters, and get hired fast.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.