Senior Cloud Site Reliability Engineer

Added
3 days ago
Type
Full time
Salary
Salary not provided

Related skills

datadog docker ansible terraform grafana

πŸ“‹ Description

  • Create dashboards with observability for app health (SLI/SLO).
  • Consult with dev streams on SRE services to improve reliability.
  • Automate manual tasks to reduce toil and errors.
  • Design, define and scope new solutions; document results.
  • Document findings and share with other SREs.
  • Ensure monitoring is set up and enabled across services.

🎯 Requirements

  • Bachelor's degree in CS/BIS or related field (or equivalent) required.
  • 4+ years programming or scripting experience.
  • 4+ years cloud environments (public/private).
  • 4+ years SRE or related experience.
  • Experience with Agile, Jira, GitHub, monitoring, automation, dashboards.
  • 6+ years communicating in English in a technical field.

🎁 Benefits

  • Prometheus, Datadog, Grafana, Splunk experience.
  • APM tools: Dynatrace, AppDynamics, New Relic.
  • Kubernetes, Docker, microservices, serverless compute.
  • Ansible, Terraform experience.
  • C#, C++, Java, Python, Perl, or Ruby expertise.
  • Self-driven, proactive mindset and continuous learning.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’