Senior Site Reliability Engineer

Added
10 minutes ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

terraform cloudformation aws prometheus python

πŸ“‹ Description

  • Build monitoring to keep platform healthy and reliable
  • Implement alerting and runbooks for faster detection/remediation
  • Debug complex issues across multiple components of the stack
  • Participate in on-call rotation and blameless postmortems
  • Design and implement platform components to enable features
  • Build Kubernetes controllers to automate operations

🎯 Requirements

  • Bachelor's in CS/CE/EE/Robotics or related field with 4+ years (MS 2+; PhD)
  • Linux internals, TCP/IP networking, storage subsystems
  • Go or Python development for production software
  • Experience scaling and securing services in AWS/GCP or cloud-native
  • IaC with Terraform or CloudFormation
  • Kubernetes controllers in Go

🎁 Benefits

  • Health, dental, and vision insurance
  • HSA with employer match
  • 401(k) retirement plan with immediate vesting
  • Paid parental leave
  • Paid medical leave
  • Unlimited vacation and 15 paid holidays

πŸ›ƒ Visa sponsorship

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’