Added
less than a minute ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
azure docker aws grafana prometheus๐ Description
- Ensure reliability and performance of business apps and IT services.
- Monitor system health and identify potential risks.
- Implement proactive tuning and capacity planning to prevent outages.
- Maintain and improve SLOs and SLAs across systems.
- Monitor incidents and support on-call rotations.
- Conduct root cause analysis and implement long-term fixes.
๐ฏ Requirements
- Bachelor's degree in Computer Science, IT, or related field.
- 7+ years in SRE/DevOps/Systems engineering with networking, systems, and coding.
- Experience with Azure, AWS, Google Cloud.
- Scripting: PowerShell, Python, Go, Bash, or Java.
- Monitoring tools: Dynatrace, Prometheus, Grafana, Datadog, Splunk.
- Docker and Kubernetes; CI/CD pipelines and version control (ADO, Git, GitLab CI).
- Automation/config mgmt: Ansible, Terraform, Puppet, Chef.
- SQL and NoSQL databases (SQL, Cosmos, MongoDB) preferred.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!