Service Reliability Engineer (Internal)

Added
less than a minute ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

azure docker aws grafana prometheus

πŸ“‹ Description

  • Ensure reliability, availability, and performance of business applications, IT services and infrastructure.
  • Monitor system health; identify potential risks.
  • Implement proactive measures such as performance tuning and capacity planning.
  • Maintain SLOs and SLAs across systems and services.
  • Participate in on-call rotations for timely incident resolution.
  • Collaborate with development and operations to improve observability and alerting.

🎯 Requirements

  • Bachelors degree in CS, IT, or related field.
  • 7+ years in SRE/DevOps/Systems Eng with networking, systems, and apps.
  • Cloud: Azure, AWS, or Google Cloud.
  • Scripting: PowerShell, Python, Go, Bash, Java.
  • Monitoring/observability tools: Dynatrace, Prometheus, Grafana, Datadog, Splunk.
  • CI/CD and version control: ADO, Git, GitLab CI.
  • Databases: SQL and NoSQL (Cosmos, MongoDB) experience.
  • Agile practices; tools like ADO and Teams or Confluence.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to DevOps Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related DevOps Jobs

See more DevOps jobs β†’