Description:
A telecommunication company based in Midrand is seeking a Junior Infrastructure Monitoring & Support Analyst to join their operations team.
This is a first-line support role focused on monitoring cloud and on-prem infrastructure, proactively identifying system issues, and responding to alerts and user-reported incidents. The successful candidate will be responsible for maintaining operational visibility across platforms such as AWS, GCP, and VMware, and services including Internally Developed Apps, RADIUS, mobile applications, and general service endpoints. This role will serve as the first point of contact for operational support and may grow into the broader support rotation.
Key Responsibilities
- Monitor health and availability of infrastructure across: AWS, GCP, and VMware.
- Monitor key services for uptime, latency, and anomalies: Feasibility API, General APIs, Mobile App, RADIUS.
- Triage and log incoming incidents from alerting systems or support channels.
- Escalate service-affecting issues to the relevant teams with appropriate urgency.
- Maintain incident timelines, and escalation artifacts.
- Execute defined system tests and health checks (pre-release, post-deploy).
- Contribute to runbooks and playbooks for common alert patterns.
Requirements:
A telecommunication company based in Midrand is seeking a Junior Infrastructure Monitoring & Support Analyst to join their operations team.
This is a first-line support role focused on monitoring cloud and on-prem infrastructure, proactively identifying system issues, and responding to alerts and user-reported incidents. The successful candidate will be responsible for maintaining operational visibility across platforms such as AWS, GCP, and VMware, and services including Internally Developed Apps, RADIUS, mobile applications, and general service endpoints. This role will serve as the first point of contact for operational support and may grow into the broader support rotation.
Key Responsibilities
- Monitor health and availability of infrastructure across: AWS, GCP, and VMware.
- Monitor key services for uptime, latency, and anomalies: Feasibility API, General APIs, Mobile App, RADIUS.
- Triage and log incoming incidents from alerting systems or support channels.
- Escalate service-affecting issues to the relevant teams with appropriate urgency.
- Maintain incident timelines, and escalation artifacts.
- Execute defined system tests and health checks (pre-release, post-deploy).
- Contribute to runbooks and playbooks for common alert patterns.
A telecommunication company based in Midrand is seeking a Junior Infrastructure Monitoring & Support Analyst to join their operations team.
This is a first-line support role focused on monitoring cloud and on-prem infrastructure, proactively identifying system issues, and responding to alerts and user-reported incidents. The successful candidate will be responsible for maintaining operational visibility across platforms such as AWS, GCP, and VMware, and services including Internally Developed Apps, RADIUS, mobile applications, and general service endpoints. This role will serve as the first point of contact for operational support and may grow into the broader support rotation.
Key Responsibilities
- Monitor health and availability of infrastructure across: AWS, GCP, and VMware.
- Monitor key services for uptime, latency, and anomalies: Feasibility API, General APIs, Mobile App, RADIUS.
- Triage and log incoming incidents from alerting systems or support channels.
- Escalate service-affecting issues to the relevant teams with appropriate urgency.
- Maintain incident timelines, and escalation artifacts.
- Execute defined system tests and health checks (pre-release, post-deploy).
- Contribute to runbooks and playbooks for common alert patterns.
- Monitor health and availability of infrastructure across: AWS, GCP, and VMware.
- Monitor key services for uptime, latency, and anomalies: Feasibility API, General APIs, Mobile App, RADIUS.
- Triage and log incoming incidents from alerting systems or support channels.
- Escalate service-affecting issues to the relevant teams with appropriate urgency.
- Maintain incident timelines, and escalation artifacts.
- Execute defined system tests and health checks (pre-release, post-deploy).
- Contribute to runbooks and playbooks for common alert patterns.
Skills & Experience
Essential:
- Understanding of cloud infrastructure concepts (especially AWS or GCP).
- Familiarity with monitoring dashboards and logging tools.
- Excellent communication and problem-solving skills.
- Ability to stay calm under pressure and respond rapidly to incidents.
- Basic understanding of networking, API health, and authentication flows (e.g., RADIUS).
- Must have achieved 80 % or higher in Matric Mathematics.
Desirable:
- Exposure to operational support or NOC environment.
- AWS Cloud Practitioner / GCP Associate Cloud Engineer certification.
- Familiarity with Prometheus/Grafana or similar alerting systems.
- Experience in structured system testing (QA/UAT).
Skills & Experience
Essential:
- Understanding of cloud infrastructure concepts (especially AWS or GCP).
- Familiarity with monitoring dashboards and logging tools.
- Excellent communication and problem-solving skills.
- Ability to stay calm under pressure and respond rapidly to incidents.
- Basic understanding of networking, API health, and authentication flows (e.g., RADIUS).
- Must have achieved 80 % or higher in Matric Mathematics.
Desirable:
- Exposure to operational support or NOC environment.
- AWS Cloud Practitioner / GCP Associate Cloud Engineer certification.
- Familiarity with Prometheus/Grafana or similar alerting systems.
- Experience in structured system testing (QA/UAT).
- Understanding of cloud infrastructure concepts (especially AWS or GCP).
- Familiarity with monitoring dashboards and logging tools.
- Excellent communication and problem-solving skills.
- Ability to stay calm under pressure and respond rapidly to incidents.
- Basic understanding of networking, API health, and authentication flows (e.g., RADIUS).
- Must have achieved 80 % or higher in Matric Mathematics.
- Exposure to operational support or NOC environment.
- AWS Cloud Practitioner / GCP Associate Cloud Engineer certification.
- Familiarity with Prometheus/Grafana or similar alerting systems.
- Experience in structured system testing (QA/UAT).
Skills & Experience
Essential:
- Understanding of cloud infrastructure concepts (especially AWS or GCP).
- Familiarity with monitoring dashboards and logging tools.
- Excellent communication and problem-solving skills.
- Ability to stay calm under pressure and respond rapidly to incidents.
- Basic understanding of networking, API health, and authentication flows (e.g., RADIUS).
- Must have achieved 80 % or higher in Matric Mathematics.
Desirable:
- Exposure to operational support or NOC environment.
- AWS Cloud Practitioner / GCP Associate Cloud Engineer certification.
- Familiarity with Prometheus/Grafana or similar alerting systems.
- Experience in structured system testing (QA/UAT).