Related skills
terraform aws python django kubernetes๐ Description
- Set the technical vision and long-term reliability strategy across critical platforms.
- Lead design of foundational, security-critical services with strong availability and scalability.
- Drive adoption of SRE practices: SLIs, SLOs, error budgets, and reliability-driven decisions.
- Identify systemic reliability risks and bottlenecks; lead cross-team durable fixes.
- Automate infrastructure and reduce toil; improve reliability at scale.
- Own observability, alerting, and incident response to shorten detection and recovery times.
๐ฏ Requirements
- Senior technical leader with cloud-native SRE experience.
- Hands-on coding in production (e.g., Python, Go) to build internal platforms.
- Extensive experience with distributed cloud-native systems and failure modes.
- Strong experience operating Kubernetes in production and related deployment strategies.
- Familiar with infrastructure as code (Terraform) and declarative configs.
- Collaborates with product/security to build reliability in early and throughout the lifecycle.
๐ Benefits
- Hybrid Dublin-based role.
- Relocation support available.
- Generous benefits: health, welfare, wellbeing.
- Inclusive, ownership-driven culture with continuous learning.
๐ Relocation support
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!