Site Reliability Engineer

Added
less than a minute ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

datadog javascript docker bash aws

πŸ“‹ Description

  • Improve monitoring, alerting and observability of services
  • Handle critical alerts and incidents; work with R&D on availability
  • Identify root causes and orchestrate outages across teams
  • Improve alerting for infrastructure, services and business logic
  • Collaborate with R&D and Support on integration and monitoring
  • Write RCAs and runbooks; automate actions with Python, Lambda, shell, ArgoCD

🎯 Requirements

  • 3+ years SRE/Infra Backend in SaaS
  • Python, JavaScript, Bash coding
  • DataDog, Splunk, Prometheus, New Relic monitoring
  • Linux kernel to shell experience; Docker/Kubernetes
  • AWS, Google Cloud, Azure
  • Ansible/ArgoCD configuration management

🎁 Benefits

  • Equal opportunity employer with inclusive culture
  • Generous benefits and equity options
  • Opportunity to work on cutting-edge crypto infrastructure
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’