Related skills
datadog pagerduty grafana new relic jira service management๐ Description
- Incident Commander for major incidents; coordinate cross-functional teams and SLA targets.
- Own incident communications; draft updates to leadership, CS, partners; manage status page.
- Lead blameless PIRs; identify root causes; assign actions with owners and deadlines.
- Analyze incident trends in non-incident periods; report findings to product/engineering.
- Enforce incident management framework: severity model, priority matrix, SLAs, and gates.
- Oversee and mentor the on-shift Operations Engineer; runbooks and knowledge transfer.
๐ฏ Requirements
- 6+ years in incident mgmt, SRE, NOC, or technical operations.
- Proven incident mgmt across multi-team response and exec communications.
- Excellent English communication; draft exec updates under pressure.
- ITIL foundation; understand lifecycle of incident, problem, and change.
- Observability depth: read logs and metrics in Datadog, Grafana, or New Relic.
- Hands-on tooling: Datadog, PagerDuty/OpsGenie, JIRA Service Management, Slack.
๐ Benefits
- Medical, dental, vision coverage, PTO, and career roadmap.
- Training and education opportunities for professional growth.
- Inclusive, equal-opportunity workplace; Fair Chance Act awareness.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Operations Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!