Site Reliability Engineer III (SRE III)

Added
9 days ago
Type
Full time
Salary
Salary not provided

Related skills

azure docker terraform linux aws

πŸ“‹ Description

  • Proactively identify and implement preventative measures to reduce customer impact.
  • Ensure all services are designed for 24/7 availability, scalability, and resilience.
  • Monitor, troubleshoot, and improve site latency, performance, and uptime.
  • Design, develop, and automate reliable cloud infrastructure and platform services.
  • Apply IaC principles to manage large-scale distributed systems.
  • Write and maintain scripts, tools, and automation to improve operations.

🎯 Requirements

  • Bachelor’s degree in Computer Science or STEM field.
  • 6+ years in engineering or operations with focus on reliability and automation.
  • Linux-based distributed environments with hands-on experience.
  • Kubernetes and Docker for containerization and orchestration.
  • AWS and/or Azure; Terraform IaC experience.
  • Python and Bash scripting; OO programming is a plus.
  • Observability with Prometheus, Grafana, and OpenTelemetry.
  • CI/CD and DevOps practices; incident management.

🎁 Benefits

  • Competitive pay
  • Flexible work
  • Inclusive, collaborative environment
  • Opportunity to work on AI-powered finance solutions
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’