Added
2 days ago
Type
Full time
Salary
Salary not provided

Related skills

gitops terraform grafana prometheus kubernetes

📋 Description

  • Ensure reliability and performance of data services (Trino, Iceberg, S3, Kafka, Flink)
  • Define SRE practices: SLIs/SLOs, error budgets, observability
  • Build/maintain monitoring, alerts, and incident response (Prometheus, Grafana)
  • Contribute to migration to VeepeeCloud lakehouse; cloud/on-prem coexistence
  • Operate Kubernetes services (GKE/EKS and on-prem)
  • Automate infrastructure with Terraform, Atlantis, Crossplane

🎯 Requirements

  • Kubernetes in production
  • Distributed data systems or willingness to learn
  • SRE principles: monitoring, alerting, SLAs/SLOs
  • Infrastructure as Code (Terraform or similar)
  • GitOps workflows
  • Observability tools (Prometheus, Grafana, logging)

🎁 Benefits

  • Variable bonus
  • Dynamic, international teams
  • Self-education courses on our e-learning platform
  • Meetups and conferences locally and internationally
  • Flexible Office with up to 3 days at home
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs →