Site Reliability Engineer

Added
8 hours ago
Type
Full time
Salary
Salary not provided

Related skills

datadog aws grafana prometheus kubernetes

๐Ÿ“‹ Description

  • Own end-to-end technical success of the Managed Gateways Platform and reliability.
  • Architect and operate systems to maintain 99.99% uptime.
  • Shape the technical direction for Managed Gateways and drive innovation.
  • Collaborate with product leadership to define strategy, roadmap, and goals.

๐ŸŽฏ Requirements

  • Bachelor's or Master's in CS or a related field.
  • 3+ years building/operating reliable SaaS/PaaS systems.
  • Hands-on experience with AWS, Azure, or GCP.
  • Strong experience with Kubernetes.
  • Observability tools: Datadog, Prometheus, Grafana, Loki.
  • Designing and developing highly scalable distributed systems.
  • Networking: OSI L4/L7, DNS, TLS/SSL, HTTP, and cloud networks.
  • Incident management and communicating under pressure.
  • Backend development in Go.
  • 3+ years SaaS development with 99.99% reliability.
  • Strong verbal/written communication.
  • Experience with PostgreSQL (bonus).
  • Experience building Kubernetes Controllers (bonus).
  • Experience with L4/L7 proxies (Nginx, HAProxy, Envoy) (bonus).
  • Contributions at technical conferences or meetups as a speaker (bonus).
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’