... large-scale fault-tolerant systems. Lead "Design for Run" initiatives, ensuring ... to develop new metrics and monitoring solutions. Work with cutting-edge ... infrastructure as code. Proficiency in monitoring and alerting tools like Grafana ...
a month ago
... . Optimize Operations: Lead security hardening, infrastructure automation, and monitoring improvements on AWS ...
28 days ago