Related skills
sre azure aws gcp postmortemπ Description
- Build and lead a global incident response team
- Own incident response as commander for high-severity outages
- Design on-call models with follow-the-sun coverage
- Set standards for incident communications and handoffs
- Drive postmortems, CRCA, and customer-rooted actions
- Advance AI-assisted tooling and runbooks for faster responses
π― Requirements
- 10+ years in SRE, incident mgmt, or reliability engineering
- 3+ years leading incident/reliability teams
- Proven incident commander for major outages
- Cloud infra experience across AWS, GCP, or Azure
- Distributed systems failure knowledge; Kafka/event streaming preferred
- Postmortems, corrective actions, and RCA tracking
π Benefits
- Belonging and inclusive culture
- Remote-first, global, time-zone distributed team
- Opportunity to shape incident response practices
- Collaborative, high-impact engineering environment
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!