Production Support & Reliability Engineer

Added
9 days ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

azure helm linux aws kubernetes

πŸ“‹ Description

  • Act as the primary L3 owner for all on-prem CI deployments and incidents.
  • Own customer issues end-to-end from triage to root-cause or escalation.
  • Serve as the primary contact for high-severity CI incidents with SRE, R&D, and Product.
  • Lead/co-lead on-prem CI deployments and upgrades; validate prerequisites; coordinate maintenance windows.
  • Monitor on-prem CI environments; drive first-line response to health, capacity and latency.
  • Design, build, and maintain a Support-owned on-prem CI lab across AWS, GCP, and Azure.

🎯 Requirements

  • 5+ years experience in technical support, SRE, or production operations.
  • Hands-on Linux, containers, and Kubernetes (EKS/GKE/AKS or self-managed).
  • Experience deploying and supporting virtual appliances or on-prem products in enterprise environments.
  • Strong networking knowledge (load balancers, TLS, DNS, proxies, firewalls) in hybrid/on-prem.
  • Comfortable reading and interpreting logs and metrics from distributed systems; form hypotheses quickly.
  • Excellent written and verbal communication; turn patterns into runbooks and training material.

🎁 Benefits

  • Salary: $130K – $150K per year (USD).
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’