Senior Director, Site Reliability Engineering

Added
3 hours ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

azure ansible terraform aws grafana

๐Ÿ“‹ Description

  • Define multi-year production engineering strategy across AWS, Azure, GCP, and bare-metal.
  • Lead, mentor, and grow a high-performing SRE org; hire Directors and Principal SREs.
  • Drive AI-first and automation-first initiatives to reduce toil and enable self-healing.
  • Establish enterprise-wide observability standards; define SLIs/SLOs and budgets.
  • Serve as executive owner for Service Health Reviews; align with Product, Security, and Engineering.

๐ŸŽฏ Requirements

  • 18+ years of relevant experience; 10+ years leading large-scale SRE/engineering.
  • Deep mastery of distributed architectures, Linux, and multi-cloud (AWS/Azure/GCP).
  • Proven cross-department operational strategy; improve availability and MTTM.
  • Executive leadership and communication; influence product roadmaps globally.
  • Strong production ownership: defining SLOs/SLIs; reliability and incident programs.
  • Experience with observability standards: Prometheus, Grafana, OpenTelemetry.

๐ŸŽ Benefits

  • Various health plans
  • Time off for vacation and sick leave
  • Parental leave options
  • Retirement options
  • Education reimbursement
  • In-office perks and more
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’