Senior Site Reliability Engineer I

Added
37 minutes ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

terraform grafana prometheus kubernetes opentelemetry

๐Ÿ“‹ Description

  • Own and evolve distributed tracing infra (Jaeger, OpenTelemetry).
  • Build and operate log aggregation (Grafana Loki, Alloy).
  • Maintain and improve metrics infra (Cortex, Prometheus, Grafana) for alerts and SLOs.
  • Write internal tooling to make observability self-service.
  • Manage observability infra as code (Terraform, ArgoCD, Helm).
  • Collaborate with engineering to define instrumentation standards and SLOs.

๐ŸŽฏ Requirements

  • Bachelor's Degree in CS/Engineering or equivalent field.
  • 7+ years in SRE, platform or infrastructure engineering.
  • Strong Linux fundamentals; comfortable in Kubernetes environments.
  • Hands-on with LGTM stack: Loki, Grafana, Tempo/Jaeger, or Cortex.
  • IaC experience โ€” Terraform strongly preferred; CDK is a plus.
  • Experience with Go, Python, or Java.
  • United States citizen โ€” able to gain CJIS clearance for US production access.

๐ŸŽ Benefits

  • Competitive salary and 401k with employer match
  • Discretionary time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Development Programs
  • And snacks in our offices
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’