Site Reliability Engineer

Added
5 days ago
Type
Full time
Salary
Salary not provided

Related skills

gitlab datadog terraform github actions splunk

๐Ÿ“‹ Description

\n
    \n
  • Act as a technical escalation point for unresolved data platform issues in the SRE Pod
  • \n
  • Monitor, maintain, and troubleshoot databases and data warehouses and related infra
  • \n
  • Collaborate with the data engineering team to ensure efficient data flow and transformation
  • \n
  • Develop and maintain accurate operational runbooks
  • \n
  • Perform standard pre-approved changes within the client Change Management Process
  • \n
  • Lead incident resolution and post-mortem analysis and mitigation
  • \n
\n

๐ŸŽฏ Requirements

\n
    \n
  • IAC tooling: Terraform preferred; or ARM/Bicep and CloudFront
  • \n
  • Core CI/CD tooling: Azure DevOps, GitHub Actions, GitLab
  • \n
  • Monitoring tooling: DataDog, Splunk, NewRelic, Azure Monitor, AWS CloudWatch
  • \n
  • Experience in multiple core tech: Dotnet, Java, Golang, AI/Data Engineering
  • \n
  • Troubleshooting incidents; identifying systemic failings; implementing fixes
  • \n
  • Automation for incident/service requests; leadership in incident resolution
  • \n
\n

๐ŸŽ Benefits

\n
    \n
  • Flexible locations and remote work options
  • \n
  • Private healthcare for you and your family
  • \n
  • 27 days annual leave, rising to 30
  • \n
  • Sabbatical options at 5 & 10 years
  • \n
  • 5 days study leave; generous pension
  • \n
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’