Lead Site Reliability Engineer

Added
13 days ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

terraform aws python django kubernetes

๐Ÿ“‹ Description

  • Set the technical vision and long-term reliability strategy across critical platforms.
  • Lead design of foundational, security-critical services with strong availability and scalability.
  • Drive adoption of SRE practices: SLIs, SLOs, error budgets, and reliability-driven decisions.
  • Identify systemic reliability risks and bottlenecks; lead cross-team durable fixes.
  • Automate infrastructure and reduce toil; improve reliability at scale.
  • Own observability, alerting, and incident response to shorten detection and recovery times.

๐ŸŽฏ Requirements

  • Senior technical leader with cloud-native SRE experience.
  • Hands-on coding in production (e.g., Python, Go) to build internal platforms.
  • Extensive experience with distributed cloud-native systems and failure modes.
  • Strong experience operating Kubernetes in production and related deployment strategies.
  • Familiar with infrastructure as code (Terraform) and declarative configs.
  • Collaborates with product/security to build reliability in early and throughout the lifecycle.

๐ŸŽ Benefits

  • Hybrid Dublin-based role.
  • Relocation support available.
  • Generous benefits: health, welfare, wellbeing.
  • Inclusive, ownership-driven culture with continuous learning.

๐Ÿšš Relocation support

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’