Attiva gli avvisi di lavoro via e-mail!

Semantic Search Engineer, Emilia-Romagna

Axiom Software Solutions

Emilia-Romagna

In loco

EUR 40.000 - 65.000

Tempo pieno

Oggi
Candidati tra i primi

Descrizione del lavoro

A leading software company in Emilia-Romagna is seeking a Semantic Search Engineer to develop solutions for data acquisition and enrichment in search engines. The ideal candidate will have strong experience with Apache Solr, AWS, and data pipelines. Responsibilities include crawling data from various sources, enriching it with linked data, and managing indexing workflows. This role offers a dynamic opportunity to work with advanced technologies and cloud infrastructure.

Competenze

  • Experience with crawling data using Apache ManifoldCF.
  • Familiarity with Microsoft Graph API for accessing Microsoft 365 data.
  • Ability to enrich data with RDF triples and linked data using Apache Marmotta.
  • Proficient in managing data workflows with Pipeship.
  • Experience in indexing data in Apache Solr or AWS OpenSearch.
  • Knowledge of hosting applications on AWS, including EC2/EKS and S3.

Mansioni

  • Crawl SharePoint, file systems, or databases for data acquisition.
  • Enrich data with RDF triples and linked data.
  • Manage ingestion, enrichment, and indexing workflows.
  • Push enriched data to search engines.
  • Monitor cloud infrastructure on AWS.

Conoscenze

Apache Solr
Pipeship
AWS
Data Specialist
Machine Learning
Microsoft Graph

Strumenti

Apache ManifoldCF
Apache Marmotta
Descrizione del lavoro
Semantic Search Engineer

We are looking for Semantic Search Developers for crawling and feeding data into search engines. Should have worked with Apache Solr, Pipeship, AWS, Data Specilist, ML and MS Graph

  1. Data Acquisition

    Use Apache ManifoldCF to crawl SharePoint, file systems, or databases.

    Use Microsoft Graph API for structured Microsoft 365 data.

  2. Semantic Enrichment

    Use Apache Marmotta to enrich data with RDF triples and linked data.

  3. Pipeline Orchestration

    Use Pipeship to manage ingestion, enrichment, and indexing workflows.

  4. Indexing Search

    Push enriched data into Apache Solr or AWS OpenSearch.

    Use custom analyzers and faceting for semantic search.

  5. Cloud Infrastructure

    Host components on AWS EC2 / EKS, store data in S3, and monitor with CloudWatch.

Ottieni la revisione del curriculum gratis e riservata.
oppure trascina qui un file PDF, DOC, DOCX, ODT o PAGES di non oltre 5 MB.