Related skills
datadog grafana prometheus splunk elk stack📋 Description
- 24/7 on-call monitoring of platform and merchant performance.
- Communicate with merchants in real time during incidents.
- Collaborate with Ops, Product, Eng to improve monitoring and reliability.
- Lead initiatives and automate monitoring tooling.
- Investigate alerts and improve logs and alerting.
- Prioritize, automate, and scale detection capabilities.
🎯 Requirements
- 5-10 years in incident communication and platform monitoring ops.
- Willing to join on-call rotation in a fast-paced environment.
- Experience with Prometheus, Grafana, ELK Stack.
- Experience with Datadog, Dynatrace, Splunk.
- Excellent analytical and problem-solving skills.
- Thrive in collaborative, global teams.
- Define and standardize processes; translate tech for non-technical audiences.
- Manage multiple complex tasks and stay calm under pressure.
- Office-based Bengaluru role; no remote option.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!