Added
less than a minute ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
github javascript aws sql lambdaπ Description
- Act as primary incident commander for high-severity outages, coordinating response.
- Lead blameless post-mortems to dissect failures and drive improvements.
- Perform in-depth troubleshooting to resolve bugs and improve workflows.
- Own and evolve observability strategy to move from reactive to predictive insights.
- Dive into Ruby code, review PRs, manage feature flags, and run production jobs.
- Lead problem management; identify trends and partner on permanent fixes.
- Drive continuous improvement for tools, workflows, and docs.
- Drive escalations in challenging situations with engineering and IT teams.
π― Requirements
- 5+ years in Production/SRE with AWS (ECS, Lambda, CloudWatch) and SQL
- Expert-level observability/APM tools
- Strong dev background: Ruby/JavaScript; GitHub workflows
- Proven incident command experience under pressure
- Highly organized, detail-oriented, proactive
- Bias for Action; navigate ambiguity and timelines
- Translate complex failures into business narratives
π Benefits
- Considerable employer contributions for health, dental, and vision programs
- Generous PTO, paid holidays, and paid parental leave
- 401(k) matching program
- Merit advancement opportunities
- Career development and training
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!