Intermediate Site Reliability Engineer SRE – AI Reliability & Automation
Related skills
datadog azure java docker terraform📋 Description
- AI-Driven Observability: Build ML-based anomaly detection and pattern recognition.
- Enhance telemetry with smart tagging and metadata for AI insights.
- Intelligent Automation: Develop event-driven workflows and self-healing AI triggers.
- Automate incident response with generative AI and AI agent orchestration.
- Predictive Reliability: Time-series forecasting to anticipate failures.
- AI-powered autoscaling and cost-aware resource allocation.
🎯 Requirements
- 5+ years software engineering experience.
- Experience with SRE principles.
- Experience with AI/ML in production environments.
- Strong debugging, problem-solving, and system design skills.
- Passion for automation and intelligent systems.
- Bonus: Experience with AIOps platforms.
🎁 Benefits
- Benefits starting from Day 1
- Retirement Plan Matching
- Flexible Paid Time Off
- Wellness Support Programs and Resources
- Parental & Caregiver Leaves
- Fertility & Adoption Support
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!