Team Lead/Reliability Engineer

Added
2 days ago
Type
Full time
Salary
Salary not provided

Related skills

cloud aws leadership monitoring technical documentation

πŸ“‹ Description

  • Lead production monitoring in a 24x7x365 environment.
  • Manage reliability engineers and drive service reliability improvements.
  • Proactively identify issues; coordinate incident resolution.
  • Improve monitoring and health checks across applications.
  • Collaborate with IT and business teams for continuous improvement.
  • Communicate status and incident updates to leadership.

🎯 Requirements

  • Production monitoring & 24x7x365 ops experience.
  • Incident management, root cause analysis for cloud apps.
  • Hands-on AWS and cloud monitoring tools.
  • Build/implement monitoring; automate processes; alerts.
  • System health monitoring and production issue troubleshooting.
  • Strong leadership across IT, business, infra teams.
  • Clear communication for updates and incident reports.
  • Document technical knowledge for production support.

🎁 Benefits

  • Comprehensive health coverage
  • Flexible PTO
  • Federal holidays off
  • Tuition reimbursement
  • Professional development support
  • Wellness stipends
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’