Ativa os alertas de emprego por e-mail!

Platform Monitoring Specialist

Phiture

Lisboa

Híbrido

EUR 50 000 - 70 000

Tempo integral

Hoje
Torna-te num dos primeiros candidatos

Resumo da oferta

A leading tech firm in Lisbon is seeking a Platform Monitoring Specialist to design, scale, and maintain its monitoring stack. Candidates should have proven experience with observability tools, especially Datadog, and a strong background in AWS services and CI/CD pipelines. The position offers a competitive salary, flexible work hours, and a culture of innovation and collaboration.

Serviços

Competitive salary
Flexible benefits package
Portuguese Health Insurance
Unlimited days off

Qualificações

  • Proven experience in observability or monitoring.
  • Strong focus on incident response frameworks.
  • 4+ years in a platform, SRE, or observability role.

Responsabilidades

  • Own and evolve monitoring, alerting, and observability infrastructure.
  • Collaborate with engineering teams to define metrics and logs.
  • Build and maintain dashboards and alerts using Datadog.
  • Act as first responder during platform incidents.

Conhecimentos

Platform, DevOps, or Site Reliability Engineer experience
Expertise with Datadog or equivalent
Experience with AWS services
Linux system administration
Infrastructure-as-Code knowledge
CI/CD pipelines and automation
Strong troubleshooting skills
Fluent in English

Ferramentas

Datadog
Terraform
Prometheus
Grafana
New Relic
GitHub Actions
Descrição da oferta de emprego
Overview

At Bloq.it, we’ve created the world’s leading smart locker solution. Solving online deliveries by enabling everyone to participate easily, reducing delivery costs and making them more sustainable. We’re quickly expanding, and after growing at 1000% for three years in a row, we’re now the fastest-growing Smart Locker company in the world and one of the fastest growing scale-ups in Europe.

We are in search of a Platform Monitoring Specialist to join our innovative team as our new #bloqstar. In this role, you\u2019ll play a crucial role in designing, scaling, and maintaining our monitoring stack, ensuring deep visibility into our hybrid cloud/edge systems and helping teams anticipate and resolve issues before they impact our customers.

What You’ll Be Doing
  • Own and evolve our monitoring, alerting, and observability infrastructure, ensuring coverage across all environments (Cloud, Lockers, CI/CD pipelines);
  • Collaborate with engineering teams to define metrics, logs, and tracing strategies that reflect business-critical SLIs/SLOs;
  • Build and maintain dashboards and alerts using Datadog, driving insights for engineering, QA, and operations teams;
  • Act as first responder and escalation point during platform incidents, coordinating diagnostics and driving post-mortems;
  • Develop automated health checks, alert tuning processes, and data integrity checks for critical services;
  • Support continuous improvement of monitoring playbooks, runbooks, and documentation.
What You’ll Bring To The Table
  • Proven experience as a Platform, DevOps, or Site Reliability Engineer with a specialized focus in observability or monitoring;
  • Solid expertise with Datadog (or equivalent platforms like Prometheus, Grafana, New Relic);
  • Strong experience with AWS services, Linux system administration, and Infrastructure-as-Code (e.g., Terraform, CDK);
  • Proficiency with CI/CD pipelines and automation (GitHub Actions preferred);
  • Experience working with logging, tracing, and metric systems, and designing high-signal alerting rules;
  • Strong troubleshooting and problem-solving skills in production environments;
  • Fluent in English, both written and spoken.
It Would Be Great If You Would Also Have
  • 4+ years in a platform, SRE, or observability role in a production-grade environment;
  • Familiarity with Atlas MongoDB, MQTT brokers, and distributed edge devices;
  • Experience defining SLIs/SLOs/SLAs and implementing reliability guardrails;
  • Knowledge of incident response frameworks and root cause analysis methodologies.
Why join us?
  • The opportunity to join our Software team and play a pivotal role in building and improving our infrastructure, while contributing to innovative solutions that redefine Bloq.it\'s revolution in the smart locker industry;
  • A dynamic and fast-paced work environment with a culture of innovation, collaboration, and continuous learning;
  • Competitive salary and flexible benefits package, tailored to your experience and skills;
  • Eligibility for performance-based bonus, tied to your results and designed to reward your impact;
  • Work how you work best - we offer a remote-friendly policy and flexible hours so you can stay productive and keep life balanced;
  • Portuguese Health Insurance;
  • Unlimited days off (subject to manager approval).

Ready to join the revolution?

At Bloq.it, we provide end-to-end solutions for Smart Lockers, and our software ecosystem, Bloq.it OS, is the leading tech solution available in the market. We\u2019ve had the pleasure of working with some of the biggest names in e-commerce, logistics, and retail in Europe, such as Vinted and DHL. We have recently become the fastest-growing Smart Locker company in the world. Before Bloq.it the industry had stagnated and lacked innovation and good products. We strive to have the same impact in this industry as Tesla has had on the car industry. We believe that Smart Lockers are a big part of the future for everyone, and we want to play our part in making sure that Smart Lockers become as mainstream as the mobile phone.

Obtém a tua avaliação gratuita e confidencial do currículo.
ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.