Related skills
terraform postgresql grafana prometheus kubernetes๐ Description
- Collaborate with infra and product teams to enforce telemetry practices across services.
- Own and operate the observability team's Kubernetes infra; define docs, flows, and standards.
- Ensure industry-standard deployment and reliability practices; develop reliability software for observability.
- Orchestrate and scale VictoriaMetrics, OpenTelemetry Collector, and Vector.
๐ฏ Requirements
- 5+ years in a Site Reliability Engineering role.
- Experience operating and supporting clustered applications in production.
- Hands-on experience deploying and managing applications in Kubernetes.
- Working knowledge of PostgreSQL, including admin, performance tuning, and troubleshooting.
- Proficiency with at least one IaC tool (Terraform, Pulumi, OpenTofu, or equivalent).
- Experience with telemetry tooling such as OpenTelemetry, VictoriaMetrics, Grafana, Prometheus.
๐ Benefits
- Fully Remote
- ESOP
- Tech Allowance
- Health Benefits
- Annual Off-Sites
- Flexible Work
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to DevOps Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!