Added
1 day ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
ansible terraform aws grafana pythonπ Description
- Own reliability, scalability, and security of the production app/platform.
- Design, implement, and manage monitoring, logging, and alerting stack (Prometheus, Loki, Alloy, Grafana).
- Lead incident response and act as incident commander during critical incidents.
- Automate scale; manage Kubernetes with Terraform/Ansible; embed RMF/STIG controls.
- Collaborate across teams to reduce toil and improve deployment and ops.
π― Requirements
- Active Top Secret clearance.
- 5+ years in Platform, DevOps, or Site Reliability Engineering with infra/ops focus.
- IaC: Terraform (or CloudFormation), Ansible.
- Kubernetes design, deployment, operations; cloud and on-prem.
- CI/CD: GitLab CI/CD, Jenkins, GitHub Actions; Scripting: Python/Go/Bash.
- Cloud: AWS or GovCloud; Observability: Grafana/ELK/Datadog; Networking fundamentals.
π Benefits
- Relocation assistance
π Relocation support
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!