Senior Staff Site Reliability Engineer

Added
9 days ago
Type
Full time
Salary
Salary not provided

Related skills

azure ansible terraform aws postgresql

๐Ÿ“‹ Description

  • Responsible for ongoing reliability and robustness of production infrastructure by monitoring availability, capacity, and throughput.
  • Evolve systems by adding reliability into product roadmap.
  • Coordinate reprioritization or fix critical bugs for support or sales requirements as needed.
  • Make recommendations to production infrastructure by interfacing with engineering to ensure 100% availability.
  • Ensure scalable artifacts deployment to all environments by automation scripts.
  • Constantly monitor infrastructure vulnerabilities and remedy them by working with the security team.

๐ŸŽฏ Requirements

  • 5+ years of experience working with SaaS products at scale.
  • Working knowledge of managed Kubernetes (EKS, AKS and GKE).
  • Knowledge of Cloud Platforms and related tooling: AWS, Azure, GCP, Terraform, Ansible, Buildkite, Pulumi and ArgoCD.
  • Experience in Python/Shell scripting. Bonus if you have Go, Java, etc.
  • Experience with Linux operating systems internals and administration.
  • Experience with cloud networking like VPNs, Privatelinks, and Private Service Connect (GCP).
  • Experience with PostgreSQL.

๐ŸŽ Benefits

  • 100% employer-paid medical insurance
  • Generous paid time-off policy (PTO), plus paid sick time, inclusive parental leave policy, holidays, and volunteer days off
  • RSU stock grants
  • Professional development and training opportunities
  • Company virtual happy hours, free food, and team-building activities
  • Monthly cell phone stipend
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’