Site Reliability Specialist (Observability & Kubernetes)

Added
less than a minute ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

terraform grafana kubernetes eks loki

๐Ÿ“‹ Description

  • Own, operate, and evolve Everbridgeโ€™s observability platform
  • Build a highly available, scalable observability stack
  • Standardize instrumentation, dashboards, alerts, and SLOs
  • Support incident response and root cause analysis
  • Maintain Grafana stack and telemetry (Loki, Mimir, Tempo)
  • Manage EKS/Kubernetes infrastructure

๐ŸŽฏ Requirements

  • 6+ years in SRE / platform engineering
  • Strong Grafana ecosystem experience
  • Kubernetes and Amazon EKS expertise
  • Terraform proficiency
  • OpenTelemetry experience
  • Large-scale observability systems experience

๐ŸŽ Benefits

  • Healthcare and dental benefits
  • Parental planning and family benefits
  • Mental health support
  • Disability income benefits and life/AD&D insurance
  • 401(k) plan with company match
  • Paid time off and fitness reimbursements
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’