This job is no longer available

The job listing you are looking has expired.
Please browse our latest remote jobs.

See open jobs →
← Back to all jobs

Senior Manager, Site Reliability Engineering

Added
19 days ago
Type
Full time
Salary
Not Specified

Use AI to Automatically Apply!

Let your AI Job Copilot auto-fill application questions
Auto-apply to relevant jobs from 300,000 companies

Auto-apply with JobCopilot Apply manually instead
Save job

Xometry is seeking a Senior Manager, Site Reliability Engineering to lead and scale our SRE organization. In this senior leadership role, you will own the reliability of critical production systems, drive incident response and post-incident reviews, and partner with product engineering, platform, and security teams to design scalable, resilient infrastructures. You will mentor and grow a high-performing team, define SRE metrics (SLOs/SLIs) and standards, and champion automation and IaC practices across the company.

Responsibilities

  • Lead and grow a team of Site Reliability Engineers, setting strategy, goals, and career development plans.
  • Establish and track reliability metrics (SLOs/SLIs), runbooks, on-call rotations, and post-incident reviews.
  • Manage incident response, root cause analyses, and continuous improvement efforts to reduce mean time to recovery (MTTR).
  • Architect scalable, highly available cloud services; implement robust monitoring, alerting, and observability across microservices and infrastructure.
  • Drive automation and infrastructure as code using Kubernetes, containers, Terraform, and related tooling.
  • Collaborate with software engineers, security, and product teams to improve reliability and performance.
  • Mentor engineers, hire top talent, and shape a strong SRE culture within the organization.

Requirements

  • Bachelor's degree in Computer Science, Engineering, or a related field (Master's preferred).
  • 8+ years of experience in Site Reliability Engineering or DevOps, with at least 3 years in a leadership/managerial role.
  • Strong expertise with cloud platforms (preferably AWS), Kubernetes, containers, and microservices.
  • Proficiency in scripting languages such as Python and Go; experience with IaC tools like Terraform.
  • Experience with monitoring/observability stacks (Prometheus, Grafana, ELK) and incident management processes.
  • Excellent communication and collaboration skills; ability to lead across teams and levels.

Nice to have

  • Experience with security, data infrastructure, and scale-out architectures.

About Xometry

Xometry is the leading on-demand manufacturing platform that connects customers with a global network of manufacturers. We value reliability, collaboration, and continuous improvement.

What we offer

  • Competitive salary and bonus program
  • Comprehensive healthcare, dental, and vision
  • 401(k) plan with company match
  • Generous paid time off and flexible work options
  • Career growth and leadership development opportunities

Use AI to Automatically Apply!

Let your AI Job Copilot auto-fill application questions
Auto-apply to relevant jobs from 300,000 companies

Auto-apply with JobCopilot Apply manually instead
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Hybrid Engineering Jobs. Just set your preferences and Job Copilot will do the rest—finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs →