Manager Site Reliability Engineer

Added
43 minutes ago
Type
Full time
Salary
Salary not provided

Related skills

gitlab datadog terraform github actions azure devops

📋 Description

  • Lead and refine SRE processes across multiple clients
  • Manage incident resolution, post-mortems, and mitigation planning
  • Design and reinforce change management processes
  • Develop automation for incidents and service requests
  • Proactively mitigate security risks in code, infra, dependencies
  • Lead client-facing discussions; Azure certs beneficial during probation

🎯 Requirements

  • Terraform, ARM, and Bicep for IAC across clouds
  • Azure DevOps, GitHub Actions, GitLab; Harness beneficial
  • Monitoring with DataDog, Splunk, NewRelic, Azure Monitor
  • Dotnet, Java, Javascript proficiency
  • Troubleshooting and incident resolution of systemic issues
  • Implement fixes/features and reduce toil
  • Lead post-mortems, risk mitigation, and security practices
  • Client-facing SRE discussions and stakeholder engagement
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to DevOps Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Related DevOps Jobs

See more DevOps jobs →