Attiva gli avvisi di lavoro via e-mail!

Semantic Search Engineer, Emilia-Romagna

Axiom Software Solutions

Emilia-Romagna

In loco

EUR 40.000 - 65.000

Tempo pieno

Oggi

Candidati tra i primi

Descrizione del lavoro

A leading software company in Emilia-Romagna is seeking a Semantic Search Engineer to develop solutions for data acquisition and enrichment in search engines. The ideal candidate will have strong experience with Apache Solr, AWS, and data pipelines. Responsibilities include crawling data from various sources, enriching it with linked data, and managing indexing workflows. This role offers a dynamic opportunity to work with advanced technologies and cloud infrastructure.

Competenze

Experience with crawling data using Apache ManifoldCF.
Familiarity with Microsoft Graph API for accessing Microsoft 365 data.
Ability to enrich data with RDF triples and linked data using Apache Marmotta.
Proficient in managing data workflows with Pipeship.
Experience in indexing data in Apache Solr or AWS OpenSearch.
Knowledge of hosting applications on AWS, including EC2/EKS and S3.

Mansioni

Crawl SharePoint, file systems, or databases for data acquisition.
Enrich data with RDF triples and linked data.
Manage ingestion, enrichment, and indexing workflows.
Push enriched data to search engines.
Monitor cloud infrastructure on AWS.

Conoscenze

Apache Solr

Pipeship

AWS

Data Specialist

Machine Learning

Microsoft Graph

Strumenti

Apache ManifoldCF

Apache Marmotta

Semantic Search Engineer

We are looking for Semantic Search Developers for crawling and feeding data into search engines. Should have worked with Apache Solr, Pipeship, AWS, Data Specilist, ML and MS Graph

Data Acquisition
Use Apache ManifoldCF to crawl SharePoint, file systems, or databases.

Use Microsoft Graph API for structured Microsoft 365 data.
Semantic Enrichment
Use Apache Marmotta to enrich data with RDF triples and linked data.
Pipeline Orchestration
Use Pipeship to manage ingestion, enrichment, and indexing workflows.
Indexing Search
Push enriched data into Apache Solr or AWS OpenSearch.

Use custom analyzers and faceting for semantic search.
Cloud Infrastructure
Host components on AWS EC2 / EKS, store data in S3, and monitor with CloudWatch.

Ottieni la revisione del curriculum gratis e riservata.

oppure trascina qui un file PDF, DOC, DOCX, ODT o PAGES di non oltre 5 MB.