Senior Site Reliability Engineer

Added
43 minutes ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

azure linux aws networking python

πŸ“‹ Description

  • Work on our multi-tenant distributed storage systems.
  • Define SLOs, shape capacity plans, ensure reliability.
  • Collaborate to improve Atlas storage stack and confidence in uptime.
  • Join a small, senior SRE team as founding members.
  • Support 24/7 on-call rotation for storage infra.
  • Improve performance from app to kernel with a bias toward software solutions to toil.

🎯 Requirements

  • 6+ years of software development and distributed systems.
  • Proficiency in Python or Go.
  • Experience with stateful storage or DBs at scale.
  • Kubernetes/containerization experience.
  • Cloud platforms: AWS, GCP, or Azure.
  • Linux internals and networking (TCP/IP, DNS, TLS).
  • Customer-focused and automation-driven.
  • On-call experience and reliability mindset.

🎁 Benefits

  • Equity and employee stock purchase program.
  • Generous parental leave policy.
  • Fertility and adoption assistance.
  • RRSP with employer match.
  • Mental health counseling and benefits.
  • Flexible paid time off.
  • Disability accommodations in hiring/interview.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’