Senior Site Reliability Engineer (SRE)

Added
7 minutes ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

terraform aws grafana python kubernetes

๐Ÿ“‹ Description

  • Design, implement, maintain, and monitor reliable production systems at scale.
  • Lead incident response, mitigate production issues, and conduct post mortem analysis.
  • Proactively monitor performance, analyze failures, identify bottlenecks, and propose solutions.
  • Create and support observability/monitoring tools and vendor integrations.
  • Promote cross-functional collaboration to improve reliability, scalability, resilience, and security.
  • Train and mentor other engineers.

๐ŸŽฏ Requirements

  • 5+ years of reliability-focused engineering in fast-paced, enterprise environments.
  • Cloud: AWS, Azure, and/or GCP.
  • Infrastructure as code: Terraform or Crossplane.
  • Developing applications in Python, Ruby, or Go.
  • Deploying and operating Kubernetes at scale.
  • Experience with monitoring tools like Grafana, New Relic, or Datadog.

๐ŸŽ Benefits

  • Company-subsidized medical, dental, and vision plans.
  • 401(k) plan with company match.
  • Annual bonus.
  • Flexible PTO (2 weeks strongly encouraged).
  • 16-week paid parental leave and disability benefits.
  • Workplace flexibility and modern work schedules.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’