Senior Site Reliability Engineer

Added
3 days ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

jenkins ansible terraform github actions aws

πŸ“‹ Description

  • Own reliability, scalability, and security of prod apps/platform
  • Design and manage monitoring/logging/alerting stack (Prometheus, Loki, Alloy, Grafana)
  • Define SLIs/SLOs and own alerting
  • Lead incident response and blameless postmortems
  • Automate for scale and security with IaC (Terraform, Ansible); manage Kubernetes
  • Eliminate toil and automate for air-gapped environments

🎯 Requirements

  • Active Top Secret clearance
  • 5+ years in Platform, DevOps, or Site Reliability Engineering
  • Kubernetes design, deployment, and operations
  • Terraform and Ansible for IaC
  • AWS or AWS GovCloud
  • Observability experience with Grafana/ELK or Datadog

🎁 Benefits

  • Relocation assistance
  • Hybrid work arrangement
  • Opportunity to work with DoD environments

🚚 Relocation support

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’