Related skills
gitops terraform grafana prometheus kubernetes📋 Description
- Ensure reliability and performance of data services (Trino, Iceberg, S3, Kafka, Flink)
- Define SRE practices: SLIs/SLOs, error budgets, observability
- Build/maintain monitoring, alerts, and incident response (Prometheus, Grafana)
- Contribute to migration to VeepeeCloud lakehouse; cloud/on-prem coexistence
- Operate Kubernetes services (GKE/EKS and on-prem)
- Automate infrastructure with Terraform, Atlantis, Crossplane
🎯 Requirements
- Kubernetes in production
- Distributed data systems or willingness to learn
- SRE principles: monitoring, alerting, SLAs/SLOs
- Infrastructure as Code (Terraform or similar)
- GitOps workflows
- Observability tools (Prometheus, Grafana, logging)
🎁 Benefits
- Variable bonus
- Dynamic, international teams
- Self-education courses on our e-learning platform
- Meetups and conferences locally and internationally
- Flexible Office with up to 3 days at home
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!