Senior Site Reliability Engineer

Added
10 days ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

terraform aws redis mysql python

๐Ÿ“‹ Description

  • Build and operate foundational, security-critical services with high availability.
  • Automate infrastructure to reduce operational toil.
  • Design and evolve systems using SRE best practices.
  • Define SLIs, SLOs, and error budgets to guide engineering decisions.
  • Improve observability, alerts, and incident response to reduce MTTD/MTTR.
  • Participate in on-call rotations with sustainable operations and automatic remediations.

๐ŸŽฏ Requirements

  • Write production-grade Python/Go code to automate ops.
  • Experience operating Kubernetes in production.
  • On-call readiness and diagnosing production issues.
  • Observability systems and actionable alerts.
  • Apply SRE concepts: SLIs, SLOs, and error budgets.
  • Infrastructure as code experience (Terraform, Kubernetes manifests).

๐ŸŽ Benefits

  • Annual bonus plan and equity.
  • Sign-on payments.
  • Health, welfare, and wellbeing benefits.
  • Relocation assistance.
  • Hybrid work model.
  • Generous overall benefits.

๐Ÿšš Relocation support

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’