Related skills
datadog terraform github actions aws grafana๐ Description
- Design, implement, maintain scalable observability for cloud-native apps
- Own monitoring across AWS and Kubernetes (EKS) clusters/workloads
- Operate and maintain self-hosted stacks: Prometheus, Grafana, Mimir, Loki, Tempo
- Manage and optimize DataDog metrics, logs, APM, alerts, cost monitoring
- Improve observability architecture for high availability, scalability, fault tolerance
- Automate observability with IaC (Terraform, Helm) and CI/CD
๐ฏ Requirements
- 5+ years in cloud-native monitoring/observability
- Strong AWS experience; 5+ years with Kubernetes
- Kubernetes monitoring: metrics, logs, traces; alerts; SLOs/SLIs
- Operate self-hosted stacks: Prometheus, Grafana, Mimir, Loki, Tempo; scalable
- DataDog experience: metrics, logs, APM, alerts, cost monitoring
- IaC/scripting (Terraform, Helm); CI/CD familiarity (GitHub Actions)
๐ Benefits
- Stock grant opportunities based on role/location
- Perks and benefits vary by employment status and country
- Remote work flexibility with optional WeWork access
- Inclusive, equal-opportunity workplace
- Accommodations available on request during recruitment
- Global healthcare and benefits across 109 countries
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!