Job Title: Confluent Kafka Administrator
Location: Remote / Onsite
Experience:7+ Years
Employment Type: Full-Time
Role Overview
We are looking for an experienced Confluent Kafka Administrator to manage and support enterprise-grade Kafka environments. The ideal candidate will be responsible for installation, configuration, monitoring, and troubleshooting Kafka clusters to ensure stability, performance, and security.
Key Responsibilities
Kafka Cluster Management
- Install, configure, and maintain Confluent Kafka clusters (on-premises or cloud).
- Manage Kafka brokers, Zookeepers, topics, partitions, replication, and retention policies.
- Perform version upgrades, patches, expansions, and cluster migrations.
Operations & Monitoring
- Monitor cluster performance, throughput, consumer lag, and system health.
- Automate operational tasks using scripting (Python/Bash).
- Implement monitoring solutions using Prometheus, Grafana, Datadog, or similar tools.
Troubleshooting & Optimization
- Diagnose and resolve production issues such as consumer lag, broker failures, replication delays, etc.
- Tune performance for brokers, topics, partitions, and producers/consumers.
Security & Compliance
- Implement and manage security features including RBAC, SSL/TLS, ACLs, Kerberos.
- Ensure compliance with internal governance and data protection standards.
Documentation & Support
- Maintain documentation for configurations, processes, and SOPs.
- Provide on-call support for high-priority incidents.
Required Skills & Qualifications
- 7+ years of IT experience with at least 3–4 years dedicated to Kafka administration.
- Strong hands-on experience with Confluent Kafka platform.
- Expertise in troubleshooting Kafka issues and optimizing performance.
- Experience with Linux-based systems (RHEL/Ubuntu).
- Proficiency in scripting (Python, Bash).
- Experience with monitoring and logging tools (ELK, Grafana, Prometheus).
- Knowledge of network, security, and distributed systems fundamentals.
Preferred Qualifications
- Experience with Kafka on cloud platforms (AWS MSK, Azure Event Hubs, Confluent Cloud).
- Knowledge of CI/CD and automation frameworks (Ansible, Terraform).
- Experience with container platforms (Docker/Kubernetes).