Related skills
sre cloud infrastructure terraform aws mlops๐ Description
- Lead and grow SRE, Cloud Infrastructure, and MLOps teams.
- Coach engineering managers and senior ICs, fostering ownership.
- Build a Platform-as-a-Product mindset for infra and ML tooling.
- Own production health: availability, latency, durability.
- Define SLIs/SLOs and error budgets; drive reliability.
- Lead incident response and blameless postmortems.
๐ฏ Requirements
- 8+ years in infrastructure/SRE/cloud engineering; 3+ years leading teams.
- AWS (preferred) and Terraform.
- Proven track record guiding teams through production incidents.
- MLOps: GPU orchestration, model serving, data pipeline stability.
- Large-scale containerized systems; strong observability.
- Strategic communication to align roadmaps with business objectives.
- Experience with FinOps a plus.
- Background in AI agentic workflows or autonomous orchestration is a plus.
๐ Benefits
- Remote-first workplace with global hubs.
- HQ in Canada; flexible coworking in ON/BC.
- MacBook laptop, monthly phone/internet stipend, home-office budget.
- Health and wellness benefits starting day 1.
- Asynchronous collaboration across UK, India, and North America; focus time.
- Employee Resource Groups and award-winning workplace.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!