Related skills
kubernetes capacity planning observability job scheduling cluster managementπ Description
- Partner with Infrastructure to build scheduling and capacity systems.
- Impact cluster utilization, cost efficiency, and researcher velocity.
- Define compute platform evolution for training, fine-tuning, inference, and batch eval.
- Own roadmap for scheduling primitives, capacity policies, and observability tooling.
π― Requirements
- 7+ years in product management for compute infra or scheduling.
- Experience scaling infra products for internal or external customers.
- Balance utilization, latency, cost, and fairness for multiple users.
- Internalize schedulers and cluster managers to craft a product vision.
- Cross-functional fluency with Eng, Finance, and Leadership.
- Strong business-outcome focus; link utilization to cost and reliability.
- Scrappy and resourceful in fast-moving environments.
π Benefits
- Competitive compensation and benefits.
- Optional equity donation matching.
- Generous vacation and parental leave; flexible hours.
- Hybrid-friendly office space and collaborative environment.
π Visa sponsorship
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Product Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!