About the group:
Cognizant’s Cloud, Infrastructure, and Security Services Practice (CIS), is all about accepting digital transformation by driving core modernization holistically across layers. We help customers transform infrastructure and workplace to meet the constantly evolving needs of the digital era. Our broad approach delivers key results for our customers by achieving cloud driven modernization and workplace and operational transformation to own the business in a secure environment.
Role Title
Container Platform Engineer
Location
Toronto, ON
Job Description
Mandatory technical skills include:
Strong to expert knowledge of supporting (Container platforms) including GKE workloads. The candidate could have worked on AWS/Azure/GCP. But he should have good experience in Kubernetes.
GKE Specifics:
- 5+ years of experience with container technologies such as Kubernetes, Google Kubernetes Engine (GKE),AKS, Docker, Podman.
- Familiarity with Cloud PaaS services and Anthos Service Mesh.
- Experience developing CI/CD pipelines using technologies such as GitHub Actions, Jenkins.
- Strong understanding of network security principles, encryption protocols and identity management concepts.
- Strong understanding of Kubernetes resource types (i.e. cluster roles, services, deployments etc.).
- Experience developing Helm Charts.
- Experience implementing Kubernetes technologies such as network policies, service mesh, certificate manager, ingress controllers.
- Experience developing compliance policies/scripts using tools such as Google Org Policy, Aquasec, Wiz.
- Experience supporting Cloud services such as GKE, BigQuery, Cloud SQL (SQL/PostgreSQL), REDIS, Cassandra, BigTable, Cloud Filestore, Persistent Storage, Apigee, Kafka, Dataflow, GCS.
- Knowledge of monitoring tools such as Dynatrace, Datadog, etc.
- Experience and knowledge supporting an Azure Public Cloud environment (while not necessary) would be valuable.
- Thorough problem determination skills to troubleshoot and resolve business application issues.
- Knowledge with OS technologies (RedHat Linux, Windows).
- DevOps and Agile understanding.
- Working knowledge of Local Area Networks (LAN) and Wide Area Networks (WAN).
- Comfortable with working in a rapidly changing, technically complex environment.
- Knowledge of scripting languages and tools such as Python, JavaScript, Powershell, Bash.
- Comfortable with the Agile methodology.
- Responsible for DEV to PROD GCP Cloud Containers/PaaS/IaaS/etc. support and processes. This is to ensure quality, performance, and availability of Public Cloud services (GCP).
- The successful candidate must have demonstrated ability to learn new technologies and processes, resolve incidents, and solving problems by collaborating with others.
- The candidate will be responsible for providing operational support for platforms and infrastructure hosted on TD's GCP Public Cloud. The role requires familiarity with ITIL processes (change, incident, and problem management) and availability for off-hours escalated support.
- Manage non-standard/complex P1, P2 (major incidents), and P3 and P4 incidents and service requests.
- Drive root cause analysis on repeatable incidents to help prevent issues in the future.
- Ensure customer service satisfaction and enable continuous improvements.
- Oversee vendor’s service delivery and escalation.
- Provide operational consultancy for future-state technologies.
- Stay updated with emerging security threats and industry best practices related to container security and cloud-native technologies.
- Participate in incident response activities, security incident investigations, and post-mortem analysis to improve incident handling processes.
- Monitor containerized environments to optimize performance and utilization.
- Critical thinker with strong research and analytics skills.
- Professional certifications such as Certified Kubernetes Administrator (CKA), Certified Kubernetes Security Specialist (CKS), Certified Terraform Associate.