Job Search and Career Advice Platform

Aktiviere Job-Benachrichtigungen per E-Mail!

SRE / Platform Engineer (Remote)

Antler

Remote

EUR 60.000 - 80.000

Vollzeit

Vor 2 Tagen
Sei unter den ersten Bewerbenden

Erstelle in nur wenigen Minuten einen maßgeschneiderten Lebenslauf

Überzeuge Recruiter und verdiene mehr Geld. Mehr erfahren

Zusammenfassung

A dynamic technology startup is seeking a Site Reliability Engineer to ensure the reliability and performance of its core AI systems. You will design and maintain automation tools, collaborate with cross-functional teams, and refine incident response practices. Ideal candidates have over 5 years of experience in Infrastructure Engineering, expertise with Infrastructure as Code and major cloud platforms, and strong programming skills. The role offers remote work with a competitive equity package and the chance to impact a growing, innovative startup.

Leistungen

Equity compensation
Exciting workload with ownership
Remote working environment

Qualifikationen

  • 5+ years of experience in Site Reliability Engineering or Infrastructure Engineering.
  • Deep expertise with Infrastructure as Code tools like Terraform or Pulumi.
  • Strong experience with observability platforms and incident response tooling.
  • Proficiency with major cloud platforms like GCP, AWS, or Azure.
  • Strong programming and scripting skills for automation and tooling.

Aufgaben

  • Own the reliability, scalability, and performance of Peec AI's core systems.
  • Design, build, and maintain tooling and monitoring for services.
  • Partner with product and engineering teams on new feature reliability.
  • Develop and refine incident response practices.
  • Identify and address bottlenecks and operational inefficiencies.

Kenntnisse

Site Reliability Engineering
Infrastructure Engineering
Infrastructure as Code
Observability platforms
Cloud platforms
Programming skills (TypeScript, Python)
CI/CD
Kubernetes

Tools

Terraform
Datadog
PagerDuty
Jobbeschreibung
Location

Worldwide (±3 hours CET)

Employment Type

Full time

Location Type

Remote

Department

Engineering

What you’ll do
  • Own the reliability, scalability, and performance of Peec AI’s core systems and infrastructure

  • Design, build, and maintain the tooling, automation, and monitoring that keep our services fast, secure, and highly available

  • Partner closely with product and engineering teams to ensure new features are reliable, observable, and easy to operate from day one

  • Develop and refine incident response practices, ensuring issues are triaged quickly and resolved with minimal user impact

  • Proactively identify and address bottlenecks, single points of failure, and operational inefficiencies across the stack

  • Champion operational excellence and a culture of reliability, driving best practices across the engineering organization

What we’re looking for
  • 5+ years of experience in Site Reliability Engineering, Infrastructure Engineering, or similar roles supporting production systems at scale

  • Deep expertise with Infrastructure as Code tools (Terraform, Pulumi, CloudFormation, etc.)

  • Strong experience with observability platforms (e.g., Datadog, Sentry, Prometheus, Grafana) and incident response tooling (PagerDuty, Incident.io, or similar)"

  • Proven proficiency with major cloud platforms (GCP, AWS, or Azure) and modern distributed systems

  • Strong programming and scripting skills (e.g., TypeScript and Python) for automation and tooling

  • A track record of diagnosing complex system problems and implementing robust, long-term solutions

  • Solid understanding of CI/CD, Kubernetes, containerization, networking, databases, and cloud security principles

  • Excellent problem-solving skills, attention to detail, and a strong commitment to operational excellence

Bonus Points
  • Experience supporting AI/ML workloads or data-intensive systems

  • Prior SRE experience in a high-growth startup or globally distributed infrastructure environment

  • Familiarity with zero-downtime migrations, multi-region architectures, or compliance frameworks

What we offer
  • Exciting and challenging work with real impact and ownership at one of Europe’s fastest-growing Series A startups

  • Aggressive equity compensation package

  • Remote working (applicants must be located within ±3 hours of the Berlin (CET) time)

Hol dir deinen kostenlosen, vertraulichen Lebenslauf-Check.
eine PDF-, DOC-, DOCX-, ODT- oder PAGES-Datei bis zu 5 MB per Drag & Drop ablegen.