Senior Site Reliability Engineer

Added
16 hours ago
Type
Full time
Salary
Salary not provided

Related skills

ansible terraform aws grafana prometheus

๐Ÿ“‹ Description

  • Lead migration of legacy apps to scalable Kubernetes clusters across environments.
  • Design and optimize highly available, scalable Kubernetes clusters.
  • Build and maintain CI/CD pipelines with Spinnaker, ArgoCD, Jenkins on Kubernetes.
  • Automate Kubernetes infrastructure with Helm, Kustomize, and Operators; self-healing workflows.
  • Enhance monitoring with Prometheus, Grafana, ELK; identify bottlenecks and failures.
  • Implement Kubernetes security best practices including secrets management and service mesh.

๐ŸŽฏ Requirements

  • Kubernetes core concepts and large-scale clusters.
  • Docker, container orchestration; Helm, Kustomize, Operators.
  • Kubernetes networking and service meshes (Istio, Linkerd).
  • Infrastructure automation with Terraform, CloudFormation, or Ansible on AWS.
  • CI/CD pipelines in Kubernetes (Spinnaker, ArgoCD, Jenkins).
  • Centralized logging/monitoring with Prometheus, Grafana, ELK.

๐ŸŽ Benefits

  • Benefits
  • Social Impact
  • Talent and Fostering Connection + Community at Okta
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’