Senior Site Reliability Engineer - Observability (x/f/m)

Added
13 hours ago
Type
Full time
Salary
Salary not provided

Related skills

prometheus python kubernetes go ruby

๐Ÿ“‹ Description

  • Lead the observability strategy across the platform to scale logging and tracing.
  • Drive large-scale reliability initiatives incl incident detection and postmortems.
  • Participate in on-call rotation and improve alerting and telemetry.
  • Collaborate in a European-scale team to support doctors and patients.

๐ŸŽฏ Requirements

  • 3+ years on a large-scale production platform
  • Experience with cloud platforms: AWS, Azure or Google Cloud
  • Docker and Kubernetes expertise
  • Helm and ArgoCD GitOps experience
  • Observability stack: logging, tracing, metrics
  • Proficient in Ruby, Python or Go; fluent English

๐ŸŽ Benefits

  • Deutschlandticket fully paid by Doctolib
  • 28 vacation days + annual increase up to 30
  • Hybrid work setup (up to 2 remote days per week)
  • Health insurance with Allianz
  • DoctoGrowth long-term employee value sharing plan
  • Mental health and coaching services through Moka.care

๐Ÿšš Relocation support

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’