Related skills
azure bash aws grafana prometheus๐ Description
- Develop automation to manage infrastructure rollouts across cloud providers.
- Optimize telemetry to identify events and aid debugging.
- Collaborate with engineering to improve cloud-service performance.
- Debug live site events and perform postmortems and RCA.
- Participate in SLA-driven on-call rotation (after-hours, weekends).
๐ฏ Requirements
- 7+ years of Site Reliability Engineer experience
- Infrastructure automation; scripting (Python, Bash)
- Prometheus stack experience; Grafana, Mimir, Loki a plus
- Kubernetes and container ecosystem knowledge
- Experience with AWS, Azure, or Google Cloud
- B.S. Degree in Computer Science or related field
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!