Senior Site Reliability Engineer, Environment Automation

Added
6 hours ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

ansible terraform helm aws prometheus

πŸ“‹ Description

  • Build and scale multi-tenant infra with Terraform, Ansible, Kubernetes.
  • Debug production issues across Kubernetes, cloud services, and apps.
  • Automate operations at scale with IaC, upgrades, and pipelines.
  • Monitor capacity with Prometheus, ELK, and Grafana.
  • Lead incident response and postmortems to reduce risk.
  • Architect automation and collaborate with teams for reliability.

🎯 Requirements

  • Production-scale experience across multiple tenants/environments.
  • Terraform & IaC mastery; Ansible; Jsonnet a plus.
  • Kubernetes in production; diagnose deployment issues.
  • Go and/or Ruby code reading and analysis.
  • Large-scale operations and incident response.
  • GitLab platform proficiency and collaboration.

🎁 Benefits

  • Health, finances, and well-being benefits.
  • Flexible paid time off.
  • Equity and ESPP participation.
  • Growth and development funds.
  • Parental leave.
  • Home office support.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’