Intermediate Site Reliability Engineer SRE – AI Reliability & Automation

Added
27 days ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

datadog azure java docker terraform

📋 Description

  • AI-Driven Observability: Build ML-based anomaly detection and pattern recognition.
  • Enhance telemetry with smart tagging and metadata for AI insights.
  • Intelligent Automation: Develop event-driven workflows and self-healing AI triggers.
  • Automate incident response with generative AI and AI agent orchestration.
  • Predictive Reliability: Time-series forecasting to anticipate failures.
  • AI-powered autoscaling and cost-aware resource allocation.

🎯 Requirements

  • 5+ years software engineering experience.
  • Experience with SRE principles.
  • Experience with AI/ML in production environments.
  • Strong debugging, problem-solving, and system design skills.
  • Passion for automation and intelligent systems.
  • Bonus: Experience with AIOps platforms.

🎁 Benefits

  • Benefits starting from Day 1
  • Retirement Plan Matching
  • Flexible Paid Time Off
  • Wellness Support Programs and Resources
  • Parental & Caregiver Leaves
  • Fertility & Adoption Support
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs →