A Message from Our CEO:
We are seeking a highly skilled Cloud Engineer & Infrastructure Security professional to design, build, and secure our hybrid infrastructure (cloud + on-prem).
The ideal candidate will have deep experience with Kubernetes, Terraform, Helm, and a strong background in infrastructure security, DevSecOps, and on-prem deployments.
This role is critical for architecting scalable, secure, and observable infrastructure supporting mission‑critical applications and LLM (Large Language Model) workloads.
Your Responsibilities:
- Infrastructure & Cloud Management
Deploy and manage Kubernetes clusters (cloud & on‑prem) using Terraform and Helm.
- Design Secure Network Topology (VPCs, VPNs, firewalls).
- Infrastructure Security & Zero Trust
Implement zero trust models, IAM, and least‑privilege access. Enforce security policies, micro‑segmentation, and secrets management.
- DevSecOps & CI/CD Security
Integrate security scanning, SBOM, and policy‑as‑code into pipelines. Automate compliance and security checks during build and deploy.
- LLM & Hybrid Deployments
Build and maintain infrastructure for LLM workloads (vLLM, KServe). Support hybrid cloud and on‑prem deployments ensuring consistency and security.
- Monitoring & Observability
Implement monitoring, logging, and alerting using Grafana, Azure Monitor, Prometheus. Maintain dashboards, SLIs/SLOs, and performance metrics.
- Linux & Automation
Harden Linux systems, automate routine tasks, and support incident response. Develop scripts and tools to streamline operations.
- Collaboration & Strategy
Partner with engineering, security, and operations teams. Mentor teams on cloud best practices and emerging technologies.
What we look for:
- Strong experience with Kubernetes, including cluster provisioning, scaling, and security.
- Proficient in Terraform and Helm for infrastructure‑as‑code and deployment automation.
- Expertise in infrastructure security, zero trust models, and IAM best practices.
- Hands‑on experience with DevSecOps: security scanning, SBOM generation, secrets management, and policy‑as‑code.
- Solid understanding of cloud networking: VPC design, VPN, and firewall configuration.
- Experience with hybrid or on‑prem deployments alongside cloud environments.
- Skilled in Linux administration, scripting, and automation for operational efficiency.
- Familiarity with monitoring and observability tools (Azure Monitor, Grafana, Prometheus).
- Experience building and managing infrastructure for LLM or AI workloads (vLLM, KServe).
Nice - to - have:
- Cloud and security certifications (e.g., CKA/CKAD, Terraform Associate, CISSP).
- Experience with GitOps workflows (Argo CD, Flux) and CI/CD security pipelines.
- Knowledge of policy frameworks (OPA, Gatekeeper, Kyverno) and workload identity systems (SPIFFE/SPIRE).
- Familiarity with GPU/accelerator‑based infrastructure for ML/LLM workloads.
- Background in SRE practices, including SLO/SLI design and incident response.
- Contributions to open‑source cloud, DevSecOps, or LLM infrastructure projects.
What we offer:
- Competitive salary and performance‑based bonuses.
- Fully remote, flexible work environment.
- Modern laptop and hardware provided by us.
- Specialized training in AI, automation, and digital productivity tools.
- Global exposure—collaborate with top‑tier founders and fast‑growing startups.
- Continuous learning and career growth opportunities in an international environment.