Related skills
datadog docker terraform github actions aws๐ Description
- Own the technical direction of Remote's SRE/Platform domain and roadmap.
- Define reliability strategy: SLOs/SLIs, error budgets, observability, incidents.
- Lead cross-team infra initiatives from discovery to delivery.
- Identify and drive AI enablement across the engineering org to reduce toil.
- Drive AI-powered automation for platform ops: alerting, triage, and self-healing runbooks.
๐ฏ Requirements
- 8+ years of experience in SRE/DevOps/Platform Engineering.
- Deep Kubernetes expertise: operating, designing, and scaling production clusters.
- Cloud infra at scale on AWS.
- Terraform infrastructure as code practice.
- SLOs/SLIs, error budgets, and alerting strategies.
- Observability with Datadog, Grafana, and Prometheus.
๐ Benefits
- work from anywhere
- flexible paid time off
- flexible working hours
- 16 weeks paid parental leave
- stock options
- learning budget
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!