Related skills
github javascript aws sql lambdaπ Description
- Act as primary incident commander for high-severity outages, coordinating response.
- Lead blameless post-mortems and document systemic improvements.
- Perform in-depth troubleshooting to resolve bugs and improve workflows.
- Own and evolve the observability strategy; move from reactive alerts to predictive insights.
- Dive into Ruby code, review GitHub PRs, manage feature flags, and run production jobs.
- Lead problem management; identify systemic trends and push permanent fixes with Product teams.
π― Requirements
- 5+ years in Production/SRE; deep AWS (ECS, Lambda, CloudWatch) and SQL.
- Expert-level experience with Observability/APM tools.
- Strong development background; comfortable reading/debugging Ruby/JavaScript; GitHub workflows.
- Proven incident command experience; manage bridge, silence noise, drive resolution.
- Bias for Action; navigate ambiguity, manage timelines, stay calm under pressure.
- Translate complex failures into clear business-value narratives for non-technical executives.
π Benefits
- Employer contributions for health, dental, and vision programs.
- Generous PTO, paid holidays, and parental leave.
- 401(k) matching program.
- Merit advancement opportunities.
- Career development and training.
- Team spirit and culture with belonging.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!