Related skills
datadog terraform aws grafana pythonπ Description
- Design automated reliability and self-healing systems for production.
- Own and improve incident management tooling and on-call health.
- Develop observability infra: monitoring, alerts, SLOs, and latency visibility.
- Contribute to AI-driven tooling for autonomous remediation.
- Collaborate with product teams to reduce operational burden.
- Define and champion operational excellence practices across engineering.
π― Requirements
- 8+ years designing and building software in a team.
- Bachelor's degree in Computer Science/Engineering or equivalent.
- 3+ years infra/platform engineering experience.
- Expertise in observability, reliability, and data analysis.
- Go and/or Python; Terraform; AWS or GCP.
- Mentoring engineers; experience with incident management tooling and IaC.
π Benefits
- Flexible, employee-led remote model.
- Professional development stipend.
- Comprehensive health and parental leave plans.
- Equity in eligible roles.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!