Staff Site Reliability Engineer

Added
23 hours ago
Type
Full time
Salary
Salary not provided

Related skills

azure terraform aws postgresql kubernetes

πŸ“‹ Description

  • Monitor production reliability: availability, capacity, throughput.
  • Collaborate with engineering to embed reliability in the roadmap.
  • Support prioritization and resolution of critical bugs from support or sales.
  • Automate deployments to improve reliability and scalability.
  • Ensure scalable artifact deployment across environments via automation.
  • Proactively monitor vulnerabilities and coordinate with security to fix them.

🎯 Requirements

  • 7+ years of SaaS platform experience at scale
  • Expertise in managed Kubernetes (EKS, AKS, GKE)
  • Knowledge of AWS, Azure, GCP; Terraform, Ansible, Buildkite, Pulumi, ArgoCD
  • Python, Go, and Bash scripting; Java a bonus
  • Linux OS, internals, and administration
  • Cloud networking (NAT gateways, VPNs, Private Service Connect) and PostgreSQL

🎁 Benefits

  • 100% employer-paid medical insurance
  • Generous PTO and holidays
  • RSU stock grants
  • Professional development opportunities
  • Monthly cell phone stipend
  • Mental health support resources
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’