Related skills
cloud aws leadership monitoring technical documentationπ Description
- Lead production monitoring in a 24x7x365 environment.
- Manage reliability engineers and drive service reliability improvements.
- Proactively identify issues; coordinate incident resolution.
- Improve monitoring and health checks across applications.
- Collaborate with IT and business teams for continuous improvement.
- Communicate status and incident updates to leadership.
π― Requirements
- Production monitoring & 24x7x365 ops experience.
- Incident management, root cause analysis for cloud apps.
- Hands-on AWS and cloud monitoring tools.
- Build/implement monitoring; automate processes; alerts.
- System health monitoring and production issue troubleshooting.
- Strong leadership across IT, business, infra teams.
- Clear communication for updates and incident reports.
- Document technical knowledge for production support.
π Benefits
- Comprehensive health coverage
- Flexible PTO
- Federal holidays off
- Tuition reimbursement
- Professional development support
- Wellness stipends
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!