Staff Platform Site Reliability Specialist (Observability & Kubernetes)
Related skills
terraform grafana kubernetes gitlab ci/cd grafana loki๐ Description
- Own, operate, and evolve Everbridge's observability platform (EKS).
- Ensure reliability, scalability, and performance across the stack.
- Design instrumentation, dashboards, alerts, and SLOs.
- Manage Grafana ecosystem: Loki, Mimir, Tempo, and Alerting.
- Maintain Kubernetes clusters and EKS lifecycle; drive upgrades.
- Implement IaC and automation with Terraform, Packer, and GitLab CI/CD.
๐ฏ Requirements
- 6+ years in SRE / Platform Engineering.
- Strong Grafana ecosystem experience.
- Kubernetes and Amazon EKS expertise.
- Terraform proficiency.
- Experience with HashiCorp Packer and GitLab CI/CD.
- Ability to design scalable, highly available systems.
๐ Benefits
- Healthcare, dental, and mental health benefits.
- Disability income, life, and AD&D insurance.
- Retirement savings plan with employer match.
- Paid time off.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!