Staff Platform Site Reliability Specialist (Observability & Kubernetes)

Added
11 days ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

terraform grafana kubernetes gitlab ci/cd grafana loki

๐Ÿ“‹ Description

  • Own, operate, and evolve Everbridge's observability platform (EKS).
  • Ensure reliability, scalability, and performance across the stack.
  • Design instrumentation, dashboards, alerts, and SLOs.
  • Manage Grafana ecosystem: Loki, Mimir, Tempo, and Alerting.
  • Maintain Kubernetes clusters and EKS lifecycle; drive upgrades.
  • Implement IaC and automation with Terraform, Packer, and GitLab CI/CD.

๐ŸŽฏ Requirements

  • 6+ years in SRE / Platform Engineering.
  • Strong Grafana ecosystem experience.
  • Kubernetes and Amazon EKS expertise.
  • Terraform proficiency.
  • Experience with HashiCorp Packer and GitLab CI/CD.
  • Ability to design scalable, highly available systems.

๐ŸŽ Benefits

  • Healthcare, dental, and mental health benefits.
  • Disability income, life, and AD&D insurance.
  • Retirement savings plan with employer match.
  • Paid time off.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’