Production Engineer/Site Reliability Engineer (Shift Basis)

Added
3 days ago
Type
Full time
Salary
Salary not provided

Related skills

terraform cloudformation networking mysql python

๐Ÿ“‹ Description

  • Join a 24/7 Production Operations team for multi-cloud infrastructure.
  • Oversee staging and production environments to maximize uptime.
  • Implement observability for real-time monitoring and alerting.
  • Lead incident management, coordinating teams for quick resolution.
  • Analyze recurring incidents to identify root causes and reduce toil.
  • Design automation tools to detect, triage, and remediate issues.

๐ŸŽฏ Requirements

  • Solid understanding of distributed system concepts.
  • Experience with production systems in public cloud infrastructures.
  • Familiarity with Kubernetes.
  • Hands-on with CloudFormation and Terraform.
  • Proficient with Python.
  • Strong analytical and problem-solving skills for diagnosing issues.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’