Site Reliability Engineer - Networking
We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure, and compliant cloud environments.
In this role, you will work alongside experienced engineers in a collaborative and demanding environment, contributing to the development and operation of mission-critical platforms that support real-time trading and clearing services. You will also be instrumental in ensuring system reliability, scalability, and performance across our technology stack.
Key Responsibilities
Cloud Infrastructure Engineering
- Design, implement, and manage scalable AWS infrastructure usingTerraform.
- Architect secure and efficient network topologies usingVPCs, subnets, route tables, and security groups.
- ManageAWS Control Towerfor multi-account governance and compliance.
- Deploy and manageKubernetes (K8s)clusters for container orchestration.
- Integrate and maintainElastic Stackfor observability and monitoring.
- Configure and manageCloudflarefor DNS, WAF, and edge security.
Infrastructure & Security Operations
- Own day-to-dayinfrastructure operations, including monitoring, patching, and performance tuning.
- Implement and maintainCloud Security Posture Management (CSPM)tools to ensure continuous compliance and risk visibility.
- Identify, prioritize, and remediatevulnerabilitiesacross cloud workloads and infrastructure.
- Collaborate with security teams to enforce best practices in IAM, encryption, and data protection.
- Participate in incident response and root cause analysis for infrastructure-related issues.
Automation & Collaboration
- Automate infrastructure provisioning and configuration using CI/CD pipelines.
- Work closely with application development teams to support application delivery and platform reliability.
- Document infrastructure designs, operational procedures, and security controls.
Required Qualifications
- 3+ years of hands-on experience with AWS cloud services.
- Proficiency in Terraform and infrastructure as code principles.
- Experience with Cloudflare, AWS Control Tower, and Kubernetes.
- Strong understanding of AWS observability.
- Proven experience in network design and security group/subnet architecture.
- Familiarity with CSPM tools.
- Experience with vulnerability management and remediation workflows.
- Strong scripting skills (e.g., Bash, Python) and CI/CD tooling.
- Experience with Ansible and Packer for automation and image creation.
- Excellent troubleshooting and communication skills.
Preferred Qualifications
- Experience with GitOps tools (e.g., ArgoCD, Flux).
- Knowledge of compliance frameworks (e.g., CIS, NIST, ISO 27001).
- Familiarity with container security and runtime protection tools.
- Hands-on experience withPagerDutyor similar incident response platforms.