Staff Site Reliability Engineer

Added
10 days ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

datadog ansible terraform aws grafana

📋 Description

  • Design, build, and operate our hybrid cloud/on-prem infra.
  • Implement SRE best practices focusing on users, monitoring, and automation.
  • Build cloud infrastructure patterns in AWS with security, reliability, scalability.
  • Integrate on‑prem datacenters with cloud to enable hybrid cloud.
  • Improve reliability via root-cause analysis and design reviews.
  • Participate in platform on-call rotations and incident response.

🎯 Requirements

  • 8+ years of relevant experience.
  • Automate toil via scripting, Ansible, and Python/Go.
  • Monitoring with Datadog, Grafana, and Prometheus.
  • Infrastructure as code: Terraform or CloudFormation.
  • Experience with hardware stacks (iDRAC/IPMI/NVIDIA UFM/Juniper).
  • Willingness to travel up to 25%.

🎁 Benefits

  • Annual Pay Range: $165,750 - $224,450
  • Not overtime eligible
  • Eligible for equity
  • LI-Remote
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs →