Senior Site Reliability Engineer, Colorado Springs

Added
1 day ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

ansible terraform aws grafana python

πŸ“‹ Description

  • Own reliability, scalability, and security of the production app/platform.
  • Design, implement, and manage monitoring, logging, and alerting stack (Prometheus, Loki, Alloy, Grafana).
  • Lead incident response and act as incident commander during critical incidents.
  • Automate scale; manage Kubernetes with Terraform/Ansible; embed RMF/STIG controls.
  • Collaborate across teams to reduce toil and improve deployment and ops.

🎯 Requirements

  • Active Top Secret clearance.
  • 5+ years in Platform, DevOps, or Site Reliability Engineering with infra/ops focus.
  • IaC: Terraform (or CloudFormation), Ansible.
  • Kubernetes design, deployment, operations; cloud and on-prem.
  • CI/CD: GitLab CI/CD, Jenkins, GitHub Actions; Scripting: Python/Go/Bash.
  • Cloud: AWS or GovCloud; Observability: Grafana/ELK/Datadog; Networking fundamentals.

🎁 Benefits

  • Relocation assistance

🚚 Relocation support

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’