Site Reliability Engineer (SRE) II

Added
12 days ago
Type
Full time
Salary
Salary not provided

Related skills

gitops opentelemetry flux gke argocd

๐Ÿ“‹ Description

  • Maintain and support production systems in the Echo ecosystem.
  • Ensure high availability, performance, and reliability of platform services.
  • Define, monitor, and improve SLOs, SLIs, and error budgets.
  • Proactively identify system risks and implement reliability improvements.
  • Participate in incident response, troubleshooting, and post-incident reviews.

๐ŸŽฏ Requirements

  • 4-7 years in SRE, DevOps, or Platform Engineering.
  • Strong hands-on experience with GKE production workloads.
  • GitOps experience (ArgoCD or Flux).
  • Kubernetes networking and cloud networking fundamentals.
  • OpenTelemetry (OTEL) observability experience.
  • Experience defining and operating SLOs/SLIs.

๐ŸŽ Benefits

  • Health: medical, dental, and vision
  • Time away: vacation and holidays
  • Development: generous tuition reimbursement and internal professional development resources
  • Equal opportunity employer
  • #LI-Remote
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’