Related skills
gitops terraform aws grafana prometheusπ Description
- Senior SRE to keep Akuity platform reliable at enterprise scale.
- Own reliability across the platform with engineering, infra, and product.
- Build systems and culture that scale with confidence.
- High-ownership role; you won't just respond to incidents.
- Collaborate with teams to embed reliability into features.
π― Requirements
- 5+ years in SRE/platform engineering or SaaS ops.
- Deep Kubernetes expertise across scheduler, networking, storage.
- Strong AWS fundamentals: EC2/EKS, VPC, Route53, IAM, S3.
- Experience defining and operating against SLOs; error budgets.
- Proficiency with observability tools: Prometheus, Grafana, OpenTelemetry.
- Scripting/automation: Go, Python, Bash; automate what you touch.
π Benefits
- Fully remote: work from anywhere in US time zones.
- Home office stipend and equipment budget.
- Equity participation in a well-funded company.
- Competitive compensation, commensurate with experience.
- Flexible time off and a culture that respects it.
- Work directly with the engineers who built Argo CD and Kargo.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!