Overview
Senior Site Reliability / GitOps Engineer at Canonical (remote, worldwide). You will operate the infrastructure that supports Ubuntu and Canonical through code and cloud-native thinking.
What you'll do
- Design, build, and maintain reliable, scalable infrastructure for Canonical's platforms.
- Implement GitOps-based CI/CD pipelines and automated deployment workflows.
- Manage incident response, on-call rotations, and post-incident reviews.
- Improve monitoring, observability, and performance optimization across services.
- Collaborate with Platform, Cloud, and Engineering teams to drive reliability initiatives.
- Mentor engineers and promote best practices in SRE and automation.
Requirements
- 5+ years of Site Reliability Engineering or DevOps experience in production environments.
- Strong Linux/Unix proficiency and experience with cloud platforms (AWS, GCP, or Azure).
- Deep knowledge of containers and orchestration (Kubernetes, Docker).
- Infrastructure as Code using Terraform and/or Ansible.
- Observability tooling (Prometheus, Grafana, and related systems).
- Excellent problem-solving, collaboration, and communication skills.
About Canonical
Canonical is the company behind Ubuntu and a leader in open-source software and cloud-native infrastructure. We design, build, and support the platforms that power modern computing and cloud environments.