Overview
Cast AI is the leading Application Performance Automation (APA) platform, enabling customers to cut cloud costs, improve performance, and boost productivity – automatically. Built originally for Kubernetes, Cast AI delivers real-time, autonomous optimization across any cloud environment. The platform analyzes workloads, rightsizes resources, and rebalances clusters without manual intervention, ensuring applications run faster, more reliably, and more efficiently. Cast AI is headquartered in Miami, Florida with employees in more than 32 countries and supports customers across major cloud, hybrid, and on-premises environments. Over 2,100 companies rely on Cast AI.
What’s next? Supported by a $108M Series C, we’re doubling down on making APA the standard for DevOps and MLOps. We are hiring across multiple teams!
Responsibilities
As a Senior Software Engineer, you will have the opportunity to work on different key features of our product. We are currently hiring Senior Software Engineers for the following teams:
Reporting
- Builds a scalable reporting system that ingests millions of rows per second into our time-series databases, providing insights into cost savings, workload efficiencies, and Cast AI automation impact.
Pricing
- Drives the synchronization of public and customer cloud resources, availability, and dynamic pricing across all major cloud providers. Enables autoscaling by leveraging discounts, commitments, and cross-cluster tracking to maximize savings. Provides a reliable source of truth for node pricing, resources, components, discounts, and commitments.
Autoscaler
- Automates Kubernetes node autoscaling to optimize clusters, balance workloads, remove underutilized nodes, and dynamically allocate capacity in real-time, thereby reducing cluster costs.
Workload Optimization (WOOP)
- Automates workload resource management by dynamically adjusting resource allocations, helping developers reduce costs and improve application reliability.
AI Enabler
- Helps customers deploying and managing LLMs in their Kubernetes cluster and optimizes workloads by providing cost visibility and intelligent routing for LLM requests to cost-effective compute resources.
- An intelligent agentic system that detects application performance issues and proactively resolves them. By integrating with observability stacks and Kubernetes, APA automates optimization, scaling, security, and recovery.
Sec Posture
- Builds a Kubernetes Security product that surfaces threats by ingesting data from vulnerability advisories, image scans, configurations, and runtime behavior.
Wire
- The De-facto Team builds and maintains essential services such as authorization, notifications, audit logs, and feature flags to enable secure, scalable use of the Cast AI platform. Focus areas include SSO, granular permissions, and billing for enterprise customers and internal teams.
Tools we use daily
Languages: GoLang (primary), Python (secondary for some cases)
Cloud & Orchestration: Kubernetes, AWS, GCP, Azure
Databases & Storage: PostgreSQL, Cloud Object Storage
Messaging & APIs: GCP Pub / Sub, gRPC for internal communication, REST for public APIs
Observability: Prometheus, Grafana, Loki, Tempo
CI / CD & GitOps: GitLab CI with ArgoCD
Qualifications
- Strong software engineering skills with experience in distributed systems and backend development (ideally GoLang, but not a hard requirement if you’re willing to transition)
- Strong debugging, optimization, and performance-tuning skills
- Deep understanding of cloud platforms with hands-on experience in AWS, Google Cloud Platform (GCP), Microsoft Azure, and Kubernetes
- CI / CD and DevOps practices experience
- Strong English skills, both verbal and written
- Ability to work independently and collaboratively within a team
- Startup mindset: adaptable, proactive, and comfortable with ambiguity
- Proactive, problem-solving mindset with a 22yes we can2 attitude
What’s in it for you?
- Competitive salary (€6,500 - €9,000 gross, depending on experience)
- Flexible, remote-first global environment
- Equity options
- Private health insurance
- Fast-paced workflow with most feature projects completed in 1 to 4 weeks
- Spend 10% of work time on personal projects or self-improvement
- Learning budget for professional and personal development, including access to international conferences and courses
- Annual hackathon to spark ideas and strengthen team bonds
- Team-building budget and company events
- Equipment budget to ensure you have what you need
- Extra days off for work-life balance
End-of-description note: J-18808-Ljbffr