Related skills
datadog terraform github actions aws prometheusπ Description
- Lead the design of scalable, fault-tolerant multi-region AWS systems
- Define and track SLOs/SLIs to drive decisions
- Conduct blameless post-incident reviews to fix root causes
- Build internal automation to remove manual work
- Create automated runbooks for incident response
- Evolve observability for proactive insights
π― Requirements
- Bachelor's degree in Computer Engineering or similar
- 5+ years SRE or similar role
- 3+ years AWS with container orchestration
- 2+ years Kubernetes
- Observability with Prometheus, Datadog, OpenTelemetry
- Terraform for infrastructure as code
π Benefits
- Remote work
- Generous PTO
- Wellness and learning allowances
- Annual Airalo Away retreat
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!