Senior Site Reliability Engineer

Added
18 hours ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

terraform aws grafana kubernetes pulumi

πŸ“‹ Description

  • Operate and evolve our EKS-based Kubernetes platform for reliability.
  • Design CI/CD for websites and Thunderbird releases; enable OIDC auth in GitHub Actions.
  • Write infrastructure as code on AWS with Pulumi, Terraform, or OpenTofu.
  • Evolve observability stack (VictoriaMetrics, VictoriaLogs, Grafana, Vector) and instrument services.
  • Security-focused infra: least-privilege IAM, secrets management, network segmentation.
  • Diagnose and debug production incidents; perform root-cause analysis and improvements.

🎯 Requirements

  • 7+ years in infrastructure/platform engineering or SRE with hands-on Kubernetes and cluster management.
  • Hands-on IaC on AWS using Terraform, OpenTofu, or Pulumi.
  • Security-focused infra: identity, least privilege, secrets hygiene, and network controls.
  • Ownership mindset; drive work to completion and raise risks early.
  • Excellent async written comms; comfortable with distributed teams.
  • Collaborate with software engineers and non-engineering stakeholders to boost reliability.

🎁 Benefits

  • 24 days PTO per year (prorated)
  • Your birthday
  • Year-end company shutdown
  • Quarterly wellbeing stipend for personal / family activities
  • Health, dental, and vision insurance
  • 401(k) contributions
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’