Added
1 day ago
Type
Full time
Salary
Salary not provided

Related skills

datadog grafana prometheus splunk elk stack

📋 Description

  • Participate in 24/7 on-call monitoring; detect issues with Engineering.
  • Communicate with merchants in real time during incidents; share updates.
  • Lead initiatives and automate monitoring for better reliability.
  • Investigate alerts and provide feedback to build better logging and alerts.
  • Mitigate merchant impact by acting on alerts with Engineering; document learnings.
  • Prioritize, automate, and scale detection capabilities.

🎯 Requirements

  • 5 to 10 years in incident communication and monitoring operations.
  • Willing to participate in on-call rotation in fast-paced environment.
  • Experience with Prometheus, Grafana, ELK Stack.
  • Experience with Datadog, Dynatrace, Splunk.
  • Strong analytical and problem-solving skills.
  • Ability to translate complex concepts for non-technical audiences.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs →