Site Reliability Engineer (SRE) – Blockchain Infrastructure
We are looking for a Site Reliability Engineer (SRE) to join our infrastructure team. Our mission is to ensure reliability, performance, and scalability of blockchain APIs and services that power users worldwide.
Your role
- Support and evolve our monitoring and alerting systems for blockchain APIs.
- Troubleshoot incidents, perform root cause analysis, and improve system reliability.
- Automate deployment and operational workflows for blockchain nodes and services.
- Collaborate with developers and infrastructure engineers to ensure smooth delivery of new features.
- Help optimize system performance and resource usage across bare metal and cloud environments.
What we’re looking for
- Experience with monitoring/observability stacks (Prometheus, Grafana, Loki, VictoriaMetrics, or similar).
- Experience with automated testing in infrastructure and/or services.
- Basic programming skills in Go and/or Python.
- Familiarity with containers and orchestration (Docker, Kubernetes is a plus).
- Hands‑on experience with CI/CD pipelines (GitHub Actions, ArgoCD, etc.).
- Experience with high‑availability systems and troubleshooting performance issues.
- Interest in blockchain technologies and willingness to dive into new protocols.
Nice to have
- Previous experience running blockchain nodes.
- Linux systems administration skills.
- Familiarity with infrastructure‑as‑code (Terraform, Ansible, or similar).
- Knowledge of networking (load balancing, DNS, firewalls, BGP).