Enable job alerts via email!

Senior Web Data Acquisition Engineer (Resilient Crawling)

ZEN

Rzeszów

On-site

PLN 120,000 - 180,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading financial technology company in Rzeszów is seeking a Senior Web Data Acquisition Engineer to design and manage web data pipelines. You will utilize scraping techniques and APIs while ensuring data quality and compliance. Ideal candidates have experience with high-throughput systems and a passion for data engineering in a fast-paced, supportive environment.

Benefits

Learning opportunities with production AI
Talented supportive peers
High autonomy and ownership

Qualifications

  • Experience with high-throughput web data pipelines.
  • Ability to balance rendering vs fetching decisions.
  • Knowledge of web scraping ethics and legal guidelines.

Responsibilities

  • Design, build, and run web data pipelines.
  • Create robust extractors and parsers.
  • Collaborate with AI/Platform teams.

Skills

Scraping knowledge (JS & non-JS)
Proxy management
Python or Java/Kotlin
REST APIs
Data normalization
Monitoring & alerting
Job description
Senior Web Data Acquisition Engineer (Resilient Crawling)

TL;DR Checklist:

[ ] Scrape JS & non-JS (headless, VMs)

[ ] Emulate human like behavior (proxies, rate, CAPTCHA, fingerprints)

[ ] Parsers resilient to layout changes

[ ] Monitor & alert on blocks/errors; retry/backoff

[ ] PythonorJava/Kotlin

[ ] REST APIs creation and consumption

All checked? In that case, send us your CV !
And if you want more details, read on.

About ZEN.COM

ZEN.COM is a smart financial app designed for your everyday life – at home and on the go. We make payments, online shopping, and personal finance management fast, secure, and effortless. With ZEN, you can enjoy cashback on purchases, full control over your spending, and peace of mind thanks to purchase protection. A built-in multi-currency account lets you spend abroad or shop internationally with great exchange rates and no hidden fees.

The Opportunity

We operate with a startup mindset: small, senior, fast-moving. The team is energetic, collaborative, and pragmatic. We useAI at multiple layers(change detection, parser generation, anomaly detection, agentic ops)—you’ll both use and shape these production‑ready applications and learn a lot along the way.

What You’ll Do

Design, build, and runhigh‑throughput web data pipelines across diverse, modern web stacks (rendered and non‑rendered).

Make smartrender vs. fetchdecisions to balance accuracy, latency, and cost.

Createrobust extractors/parsers(HTML/DOM/XPath/JSON) with auto‑healing patterns and clear schemas.

Engineer resilience againstdynamic access controls(rate limits, traffic shaping, session/identity rotation) while following legal/ethical guidelines.

Ownobservability: success rate, freshness, latency, cost per successful unit.

Collaborate with AI/Platform to integrateLLM‑assisted maintenance, anomaly detection, and triage bots.

What You Bring

Practical experience handlingrate limits/challengesand building systems that degrade gracefully.

Solid grounding indata quality: normalization, validation, deduplication, and schema/versioning.

Why Join Us

Impact:Your work becomes the backbone of pricing & availability intelligence used across products and markets.

Learning:Hands‑on withproduction AI(LLMs, embeddings, agents) and modern data engineering.

Team:Talented, supportive peers; high autonomy and ownership; clear problem statements.

Pace & Pragmatism:We ship, measure, and improve.

Nice to Have

Multi‑region traffic management and cost optimization.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.