Groupon

34 jobs posted

View company profile →

Please mention that you found this job on empllo.com. Thanks & good luck!

Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills
Tailors your resume and cover letter automatically
Works 24/7—so you don't have to

Activate JobCopilot

Follow us on LinkedIn!

Principal Site Reliability Engineer (AI-first SRE)

Added

7 days ago

Location

Type

Full time

Salary

Salary not provided

Related skills

terraform grafana prometheus python kubernetes

📋 Description

Architect and maintain self-healing systems with 99.9%+ availability targets.
Use AI/ML to automate infra governance and detect IaC anti-patterns.
Implement adaptive SLIs/SLOs that evolve automatically from real-time data.
Build AIOps-based observability and auto-remediation pipelines.
Apply predictive modeling to forecast failures before they impact users.
Lead chaos, performance, and resilience testing programs.

🎯 Requirements

10+ years in software/systems engineering, with 5+ years in SRE.
Strong experience with GCP (preferred) or AWS, Kubernetes, and Terraform.
Proficiency in Python or Go for automation and tooling.
Observability stacks (Prometheus/Grafana/OpenTelemetry) and service meshes.
Hands-on AIOps experience: anomaly detection, predictive analytics, ML-assisted operations.
Strong communication and influencing skills — data over hierarchy.

🎁 Benefits

Access to cutting-edge technologies in a transformative environment.
Professional growth and leadership development pathways.
A chance to shape reliable and scalable systems with impact.

Apply on employer's website

This employer gathers applications via their own applicant tracking system.

You will be redirected to an external application form.

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Activate JobCopilot