Staff Engineer - Site Reliability

Added
1 day ago
Type
Full time
Salary
Salary not provided

Related skills

terraform prometheus kubernetes gcp kafka

📋 Description

  • Own reliability, availability, and performance of middleware and cloud platforms.
  • Optimize Kafka environments: performance tuning, capacity planning, upgrades.
  • Administer Vector Database platforms; support Weaviate.
  • Manage GCP and GKE environments to ensure scalable infra.
  • Drive infrastructure automation and platform reliability initiatives.
  • Lead production incident response, troubleshooting, and post-incident reviews.

🎯 Requirements

  • 5+ years Kafka production experience.
  • 2+ years Vector Database experience; Weaviate preferred.
  • 5+ years GCP and Kubernetes (GKE) production experience.
  • Incident management, automation, and CI/CD experience (SRE/DevOps).
  • Infrastructure as Code experience, preferably Terraform.
  • Automation and scripting in Python, Go, or Shell.

🎁 Benefits

  • Medical insurance for employee and family.
  • Group term and personal accident insurance.
  • Work-life balance: leaves and paid holidays.
  • Provident Fund and gratuity.
  • Employee assistance program and wellness initiatives.
  • Growth: ongoing learning and career development opportunities.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs →