Related skills
terraform cloudformation networking mysql python📋 Description
- Join a 24/7 Production Operations team managing critical infra in multi-cloud environments.
- Oversee staging and production environments to maximize uptime and reliability.
- Implement and maintain observability for real-time monitoring, alerts, and metrics.
- Lead incident management responses to alerts and outages, coordinating teams.
- Analyze recurring incidents to identify root causes and reduce toil.
- Design and develop automation tools to detect, triage, and remediate production issues.
🎯 Requirements
- Solid understanding of distributed system concepts.
- Familiarity with Kubernetes and container orchestration.
- Hands-on experience with CloudFormation and Terraform.
- Strong analytical and problem-solving skills for diagnosing and resolving issues.
- Proficient in Python programming.
- Excellent verbal and written communication skills.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!