Lead Site Reliability Engineer

Added
20 days ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

cloud terraform aws grafana prometheus

πŸ“‹ Description

  • Define SRE strategy, architecture, and roadmap aligned to goals.
  • Lead design/deploy of containerized workloads and IaC in regulated clouds.
  • Establish observability, monitoring, and alerting at scale.
  • Drive incident management, on-call rotations, and root cause analysis.
  • Partner with security/compliance to meet regulatory requirements.
  • Champion automation and operational excellence.

🎯 Requirements

  • BS in CS/Cybersecurity/Software Eng or equivalent; 5+ years SRE/DevOps/cloud infra
  • Kubernetes expertise
  • Terraform/IaC experience
  • AWS cloud experience
  • Monitoring/alerting and performance optimization experience
  • Troubleshooting and incident management for distributed systems

🎁 Benefits

  • Remote-first, flexible work environment
  • Work on secure, mission-critical platforms
  • Open-source, collaborative culture
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’