Description:
We are looking for a Cloud Infrastructure Manager to lead and scale the engineering capability behind our cloud platforms. This role forms a key part of the Technology Solutions leadership team, sitting alongside our Cloud Architect and reporting to the Head of Technology Solutions.
Responsibilities:
Lead, mentor, and grow a high-performing team of Cloud Engineers, SREs, and DevSecOps experts who thrive on innovation and impact. Drive the execution of our Cloud Infrastructure roadmap, aligning with Mukurus strategic platform and business goals. Take ownership of Mukurus AWS-based cloud environments defined as infrastructure-as-code with Terraform and containerised with Kubernetes ensuring performance, cost-efficiency, and resilience at every stage of the SDLC. Champion DevOps culture across engineering, fostering collaboration, shared ownership, and continuous delivery practices. Ensure uptime and recovery goals are met, and oversee compliance with RTOs, RPOs, patching, monitoring, and alerting standards. Partner closely with our Cloud Architect to deliver well-architected, observable, and cost-optimised infrastructure solutions. Collaborate across the business from Platform Engineering and Product, to Software Engineering, Security, and Governance to enable cross-functional success. Implement and maintain controls aligned to compliance frameworks like PCI-DSS and ISO27001. Build and manage tooling across CI/CD, observability, documentation, and cloud cost monitoring. Drive innovation by enabling teams with the right tools, autonomy, and environment to experiment, iterate, and deliver value. Manage team capacity, budgets, and ongoing capability growth in line with emerging technologies and business needs.Requirements:
Grade 12 or equivalent. Tertiary qualification in a relevant field (desirable). Cloud Infrastructure Design & Operations (AWS). Infrastructure-as-Code (IaC) with Terraform. Containerisation with Kubernetes. Linux system administration and scripting. CI/CD pipeline design and management. Incident and outage management.t Cloud cost optimisation strategies. Cloud security best practices and governance. Compliance frameworks (PCI-DSS, ISO27001). Cloud monitoring, alerting, and observability tools. Infrastructure monitoring and logging solutions. Cloud environment performance tuning. Resilience and disaster recovery planning. Collaboration with cross-functional teams (Platform, Security, Product, Software Engineering). DevOps culture and practices (shared ownership, continuous delivery). Automation of infrastructure deployment and management. Budget and capacity planning Agile practices and mindset.
29 May 2025;
from:
gumtree.co.za