Added
less than a minute ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

azure docker aws grafana prometheus

๐Ÿ“‹ Description

  • Ensure reliability and performance of business apps and IT services.
  • Monitor system health and identify potential risks.
  • Implement proactive tuning and capacity planning to prevent outages.
  • Maintain and improve SLOs and SLAs across systems.
  • Monitor incidents and support on-call rotations.
  • Conduct root cause analysis and implement long-term fixes.

๐ŸŽฏ Requirements

  • Bachelor's degree in Computer Science, IT, or related field.
  • 7+ years in SRE/DevOps/Systems engineering with networking, systems, and coding.
  • Experience with Azure, AWS, Google Cloud.
  • Scripting: PowerShell, Python, Go, Bash, or Java.
  • Monitoring tools: Dynatrace, Prometheus, Grafana, Datadog, Splunk.
  • Docker and Kubernetes; CI/CD pipelines and version control (ADO, Git, GitLab CI).
  • Automation/config mgmt: Ansible, Terraform, Puppet, Chef.
  • SQL and NoSQL databases (SQL, Cosmos, MongoDB) preferred.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’