Related skills
docker terraform grafana prometheus kubernetes๐ Description
- Design and operate scalable platform infrastructure
- Facilitate blameless post-incident reviews and root-cause analysis
- Run chaos engineering and disaster recovery exercises
- Deploy and evolve SRE practices with SLOs/SLIs and error budgets
- Reduce toil through automation, IaC, and tooling
- Improve observability with dashboards and monitoring
๐ฏ Requirements
- 8+ years in DevOps/SRE with SRE principles
- Expertise: Kubernetes, Docker, Istio/Envoy/Linkerd
- IaC and CI/CD experience (Terraform/Ansible/CloudFormation; GitLab)
- Cloud platforms: AWS, GCP, Azure
- Incident management and blameless postmortems
- BS in Computer Science or equivalent
๐ Benefits
- Global mental health and financial wellness resources
- Medical, dental, vision, life and disability coverage
- Retirement options (401(k)/pension)
- Paid time off and flexible work arrangements
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!