
Attiva gli avvisi di lavoro via e-mail!
A leading software company in Emilia-Romagna is seeking a Semantic Search Engineer to develop solutions for data acquisition and enrichment in search engines. The ideal candidate will have strong experience with Apache Solr, AWS, and data pipelines. Responsibilities include crawling data from various sources, enriching it with linked data, and managing indexing workflows. This role offers a dynamic opportunity to work with advanced technologies and cloud infrastructure.
We are looking for Semantic Search Developers for crawling and feeding data into search engines. Should have worked with Apache Solr, Pipeship, AWS, Data Specilist, ML and MS Graph
Use Apache ManifoldCF to crawl SharePoint, file systems, or databases.
Use Microsoft Graph API for structured Microsoft 365 data.
Use Apache Marmotta to enrich data with RDF triples and linked data.
Use Pipeship to manage ingestion, enrichment, and indexing workflows.
Push enriched data into Apache Solr or AWS OpenSearch.
Use custom analyzers and faceting for semantic search.
Host components on AWS EC2 / EKS, store data in S3, and monitor with CloudWatch.