Added
29 days ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
kubernetes machine learning ci/cd cuda tpuπ Description
- Build and lead a high-performing team focused on developer productivity for Inference
- Own accelerator toolchain management across CUDA, TPU, and Trainium
- Build infrastructure for efficient accelerator usage during development (devboxes, validation)
- Establish and drive productivity metrics with dashboards and alerts
- Identify and eliminate inefficiencies across Inference engineering workflows
- Partner with Infrastructure to align on company-wide productivity initiatives
π― Requirements
- 3+ years of engineering management experience in infrastructure or developer productivity
- Strong background in systems engineering or ML infrastructure
- Experience managing toolchains/dev environments for compute-intensive workloads
- Familiarity with at least one accelerator ecosystem (CUDA/TPU/Trainium) and eagerness to learn others
- Proven track record of defining and using engineering metrics
- Experience partnering across orgs and influencing across a technical team
π Benefits
- Competitive compensation and benefits
- Optional equity donation matching
- Generous vacation and parental leave
- Flexible working hours
- Office space for in-person collaboration
π Visa sponsorship
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!