Systems Reliability Engineer

Added
3 hours ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

ansible puppet linux bash python

πŸ“‹ Description

  • Provide support for MEMX exchange platforms, including on-call and incident response.
  • Isolate and resolve unplanned system outages.
  • Collaborate with cross-functional teams to ensure platform availability.
  • Improve operational processes (deployments/upgrades) by identifying issues.
  • Document actions to create repeatable, automatable processes.
  • Debug issues across services and interaction points.
  • Enhance monitoring and alerting based on symptoms.
  • Run essential nightly exchange processes and automate where possible.

🎯 Requirements

  • Good Linux knowledge and shell proficiency
  • Mid to advanced Linux administration and scripting
  • Bash scripting; Python a plus
  • Config management tools: Ansible, Chef, Puppet
  • 2+ years in operations/support with incident response
  • Incident tracking/escalation procedures

🎁 Benefits

  • Work From Home
  • Health Care Plan (Medical, Dental & Vision)
  • 401k Retirement Plan
  • Life Insurance (Basic, Voluntary AD&D)
  • Unlimited Paid Time Off
  • Training & Development
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to DevOps Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related DevOps Jobs

See more DevOps jobs β†’