Related skills
datadog sre pagerduty distributed systems incident response📋 Description
- Be calm and collected, cool under pressure during incidents
- Collaborate with technical and non-technical teams
- Work with large-scale, secure, distributed systems
- Share information to reduce silos and break down barriers between teams
- Learn new technologies and adopt the right tools to manage these services; mind SLAs and MTTR
- Plan and lead strategic objectives for the engineering team
🎯 Requirements
- Masters or bachelor’s degree in computer science, or relevant experience
- 5+ years of experience in an SRE or Software Engineering role
- Production environments at scale experience
- Strong observability focus with SLO, SLI, KPI mindset
- Hands-on shepherding services from design to production
- Experience tackling site-wide outages with lessons learned and prevention
🎁 Benefits
- Healthcare benefits
- Internet/cell phone reimbursement
- Learning and development stipend
- Potential opportunities to travel to Palo Alto HQ
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!