Site Reliability Engineer (SRE) - Platform Infrastructure team (100% Remote - USA)
Join to apply for the Site Reliability Engineer (SRE) - Platform Infrastructure team (100% Remote - USA) role at Hopper.
About The Job
Hopper is seeking a Senior Site Reliability Engineer to join our Platform Infrastructure team. This team builds and maintains the cloud foundation powering products used by millions of travelers worldwide.
Our mission is to empower engineers across Hopper to ship fast, stay resilient, and scale effortlessly. If you are passionate about automation, scalability, and improving developer experience, this could be the perfect role for you.
You will help evolve a large-scale, multi-region infrastructure in Google Cloud, supporting hundreds of engineers and multiple product teams. Your contributions will include building automated, self-service platform tools that are secure, reliable, cost-efficient, and user-friendly.
This Role Might Be a Great Fit If You
- Thrive on automating repetitive tasks and creating platform-level solutions.
- Enjoy enabling product teams with intuitive tools for infrastructure and deployment.
- Prefer practical, reliable solutions over over-engineering.
- Care about operational excellence, including scalability, high availability, performance, and cost optimization.
- View developer experience as a product and seek continuous improvement.
What Your Day-to-Day Will Look Like
- Enhance platform tooling to support growth across Hopper.
- Design simple, consistent, and scalable infrastructure workflows.
- Drive automation to reduce manual work and improve reliability.
- Scale infrastructure offerings to meet team needs while maintaining a cohesive platform.
- Participate in incident response as part of a distributed, sustainable on-call rotation.
- Support engineering teams through troubleshooting, answering infrastructure questions, and code reviews.
- Collaborate with a small, high-impact team focused on operational excellence, performance, and developer experience.
Ideal Candidate Profile
- Experience in SRE, DevOps, Software Engineering, or Systems Engineering, with a focus on reliable, scalable infrastructure.
- Strong troubleshooting skills in distributed systems and cloud environments.
- Solid system design skills with an emphasis on simplicity and performance.
- Effective communication skills for cross-team collaboration.
Cloud & Infrastructure Skills
- Hands-on experience with major cloud providers, preferably Google Cloud Platform.
- Proficiency with Infrastructure as Code, ideally Terraform.
- Experience with containers, Kubernetes, Helm, or Kustomize.
- Knowledge of Service Mesh technologies like Istio.
Networking & Security Knowledge
- Understanding of networking fundamentals: DNS, TLS, certificates, ingress controllers.
- Best practices in cloud security, IAM, RBAC, network segmentation.
- Familiarity with authentication and authorization protocols.
Observability & Tooling
- Experience with logs, metrics, tracing, and APM tools (preferably Datadog).
- Knowledge of CI/CD pipelines and deployment automation.
- Familiarity with SQL and NoSQL databases.
Scripting & Automation
- Ability to script in Bash, Python, or similar languages for automation tasks.
Perks and Benefits
- Competitive salary, pre-IPO equity, and funding.
- Unlimited PTO and travel stipend.
- Remote work options, co-working space access, and work-from-home stipend.
- Generous parental leave policies.
- Entrepreneurial culture with open communication.
- Impactful small teams.
- Comprehensive health coverage and insurance benefits.
About Hopper
Hopper aims to be the leading global travel platform, leveraging data and machine learning to offer innovative travel and fintech solutions. Serving hundreds of millions of travelers, Hopper continues to grow through its mobile app, website, and B2B solutions like HTS, partnering with brands like Capital One, Air Canada, and Uber.