Staff Site Reliability Engineer

Added
less than a minute ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

bash grafana prometheus python firewalls

πŸ“‹ Description

  • Design, code, and deploy software and automation.
  • Create scalable monitoring and end-to-end infrastructure solutions.
  • Monitor services, participate in on-call, and prevent incidents.
  • Document and automate processes; deploy patches and admin tools.
  • Collaborate with cross-functional teams to improve integrations.

🎯 Requirements

  • US Citizenship required; 5+ years in 24/7 NOC or Cloud Ops.
  • Proficiency in Python or Bash.
  • Networking: HTTP, DNS, TCP/IP, ICMP, OSI model.
  • Experience with monitoring tools (Nagios, Grafana, Prometheus) and load balancers.
  • Willing to work after hours/weekends for releases.

🎁 Benefits

  • Health plans
  • Vacation and sick time
  • Parental leave options
  • Retirement options
  • Education reimbursement
  • In-office perks
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’