Added
15 days ago
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
datadog docker terraform aws pythonπ Description
- Drive and refine modern SRE practices across services
- Design observability across metrics, logs, traces, dashboards
- Partner with product/engineering to design reliable services
- Evolve AWS infrastructure via Terraform IaC
- Contribute code to reliability tooling and health checks
- Participate in incident response and post-incident reviews
π― Requirements
- 5+ years in SRE, DevOps, or production engineering
- Led multi-sprint reliability/infrastructure initiatives
- Hands-on with SLIs/SLOs, error budgets, and toil reduction
- Proficient in Python or TypeScript/Node.js
- Observability stack experience: Datadog, Prometheus, Grafana
- AWS production experience; Terraform IaC; Docker/Kubernetes
π Benefits
- Generous equity grant
- MacBook provided
- Comprehensive benefits package
- Flexible PTO and hybrid work schedules
- Work from home stipend
- Hubs in Los Angeles, San Francisco, Toronto, and Raleigh with hybrid schedules
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!