ROLE & RESPONSIBILITIES:
Own and improve operational reliability and availability of critical applications across their lifecycle.
Design, implement and maintain automation for infrastructure provisioning and deployments (IaC).
Build and optimize CI/CD pipelines to enable fast, reliable and repeatable releases.
Develop monitoring, alerting and observability solutions to detect and prevent incidents.
Lead incident response for escalated production issues and drive root cause analysis and remediation.
Implement and enforce operational best practices, runbooks and playbooks for the team.
Collaborate closely with development teams to improve observability, testability and deployability of
applications.
Drive performance tuning, capacity planning and availability engineering activities.
Plan and execute upgrades, migrations and infrastructure improvements with minimal downtime.
Ensure security, compliance and certificate management practices are applied to platforms and services.
Mentor and coach junior DevOps and operations team members to uplift team capability.
Contribute to business case development, risk identification and operational documentation
QUALIFICATIONS/EXPERIENCE:
IT degree or equivalent qualification and solid background in systems engineering, DevOps or cloud operations.
Minimum 5+ years’ experience in DevOps, cloud infrastructure or reliability engineering roles.
Relevant cloud certification(s) and proven experience implementing automation and monitoring at enterprise
scale (AWS/Azure preferred)
Submit your CV to: recruitment@imizizi.co.za and Subject line Role title