Related skills
azure ansible terraform aws grafana๐ Description
- Define multi-year production engineering strategy across AWS, Azure, GCP, and bare-metal.
- Lead, mentor, and grow a high-performing SRE org; hire Directors and Principal SREs.
- Drive AI-first and automation-first initiatives to reduce toil and enable self-healing.
- Establish enterprise-wide observability standards; define SLIs/SLOs and budgets.
- Serve as executive owner for Service Health Reviews; align with Product, Security, and Engineering.
๐ฏ Requirements
- 18+ years of relevant experience; 10+ years leading large-scale SRE/engineering.
- Deep mastery of distributed architectures, Linux, and multi-cloud (AWS/Azure/GCP).
- Proven cross-department operational strategy; improve availability and MTTM.
- Executive leadership and communication; influence product roadmaps globally.
- Strong production ownership: defining SLOs/SLIs; reliability and incident programs.
- Experience with observability standards: Prometheus, Grafana, OpenTelemetry.
๐ Benefits
- Various health plans
- Time off for vacation and sick leave
- Parental leave options
- Retirement options
- Education reimbursement
- In-office perks and more
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!