Enable job alerts via email!

AI Infrastructure Architecture

Ekfrazo Technologies Private Limited

Johannesburg

On-site

ZAR 600,000 - 1,000,000

Full time

5 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Ekfrazo Technologies is seeking an AI Infrastructure Architect to drive their enterprise-wide digital transformation. In this essential role, you will develop and implement robust AI infrastructure solutions that support AI model training, deployment, and compliance. Candidates should possess strong expertise in cloud computing and be adept at optimizing AI workloads. Join a leadership team dedicated to innovation and operational excellence in a friendly working environment.

Benefits

Work alongside & learn from best in class talent
Join a well-known brand within Telecommunications

Qualifications

  • 8+ years in IT Infrastructure, Cloud Computing, or AI Systems Architecture.
  • Expertise in managing AI workloads in cloud and hybrid environments.
  • Hands-on experience with AI infrastructure design and cloud AI solutions.

Responsibilities

  • Develop scalable AI infrastructure strategy aligned with digital transformation goals.
  • Design and implement automated MLOps pipelines for efficient AI operations.
  • Enhance AI data storage and processing capabilities for real-time analytics.

Skills

AI Infrastructure & Cloud Architecture
AI Model Deployment & MLOps
Kubernetes & AI Workload Orchestration
Cloud Platforms (AWS, Azure, GCP)
Infrastructure as Code (Terraform, Ansible)
AI Security & Compliance
Performance Tuning for AI Systems

Education

Bachelor’s degree in Computer Science, IT Infrastructure Engineering, or related field
Certifications in cloud computing (AWS, Azure, GCP), MLOps, or DevOps

Job description

Your trusted digital partner for tailored IT solutions with expertise in web & app dev, e-comm, digital marketing, branding, & web security. We enable your overall success on every online platform & a complete digital transformation. At Ekfrazo, we acknowledge the minutest of details that go into every work that we do. To accomplish this, we take all the responsibilities of delivering a perfected work on time which includes Software & Product Development, Web & Mobile Apps, Branding, Digital Marketing, IT Consulting, Recruitment & Staffing. Our areas of expertise include delivering the best possible work based on the ideas and the requirements that you have with a blend of our creativity and skills, be it in any domain.

The Role

You will be responsible for :

Key Activities & Responsibilities

  • Develop a scalable AI infrastructure strategy aligned with enterprise-wide digital transformation goals, ensuring a high-performance and secure foundation for AI workloads
  • Architect AI infrastructure solutions across cloud and on-prem environments, optimizing flexibility, performance, and security to meet evolving business needs
  • Implement and oversee infrastructure for AI model training, inferencing, and execution frameworks that enhance processing efficiency and overall model performance
  • Design and implement AI infrastructure solutions that scale seamlessly to accommodate enterprise-wide AI adoption, support business growth, ensure high availability, and incorporate future advancements in AI and cloud computing
  • Establish AI infrastructure security SOPs, access control mechanisms, and compliance frameworks to mitigate risks and ensure data protection
  • Design and implement automated MLOps pipelines that streamline AI model deployment, monitoring, retraining, and governance for efficient AI operations
  • Deploy Infrastructure as Code (IaC) solutions using Terraform, Ansible, or equivalent tools to automate AI infrastructure provisioning and scaling
  • Optimize high-performance computing environments, ensuring efficient utilization of GPU, CPU, and storage resources for AI and ML workloads
  • Manage AI workload orchestration using Kubernetes and containerization technologies to enhance scalability and performance across distributed AI environments
  • Enhance AI data storage and processing capabilities by optimizing pipelines, retrieval mechanisms, and data engineering strategies for real-time analytics
  • Work closely with IT and Software COE teams to integrate AI infrastructure seamlessly with enterprise systems and applications
  • Develop AI infrastructure cost optimization strategies, balancing performance, scalability, and budget constraints to maximize ROI
  • Deploy real-time AI infrastructure monitoring tools to continuously track system health, identify performance bottlenecks, detect potential anomalies, and implement proactive optimization measures to enhance system reliability and efficiency
  • Implement AI governance and model version control policies, ensuring regulatory compliance, model integrity, security, ethical AI practices, and proactive risk mitigation
  • Stay ahead of AI infrastructure innovations, emerging cloud technologies, and industry best practices to enhance enterprise AI capabilities

Ideal Profile

Skills and Experience

Education :

  • Bachelor’s degree in Computer Science, IT Infrastructure Engineering, or a related field
  • Certifications in cloud computing (AWS, Azure, GCP), MLOps, or DevOps (preferred)

Experience :

  • 8+ years of experience in IT Infrastructure, Cloud Computing, or AI Systems Architecture
  • Hands on experience in AI infrastructure design, cloud-based AI solutions, or MLOps
  • Expertise in managing AI workloads across cloud and hybrid environments.
  • Proven track record in scaling AI infrastructure for large enterprises
  • Strong experience in Kubernetes, containerization, and orchestration tools
  • Experience in optimizing AI workloads for performance and cost efficiency

Skills :

  • Expertise in AI Infrastructure & Cloud Architecture
  • Strong Understanding of AI Model Deployment & MLOps
  • Advanced Proficiency in Kubernetes & AI Workload Orchestration
  • Hands-on Experience with Cloud Platforms (AWS, Azure, GCP)
  • Proficiency in Infrastructure as Code (Terraform, Ansible)
  • AI Security & Compliance Knowledge
  • AI Infrastructure Cost Optimization Strategies
  • Performance Tuning for AI Systems & Workloads

What's on Offer?

  • Work alongside & learn from best in class talent
  • Join a well known brand within Telecommunications
Create a job alert for this search
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.