Software Engineer, Inference Deployment

Added
27 days ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

rust python kubernetes ci/cd deployment

πŸ“‹ Description

  • Own deployment orchestration across GPU/TPU/Trainium fleets, unattended
  • Improve capacity-aware deployment scheduling within constrained budgets
  • Extend deployment observability β€” dashboards and tooling for deployments
  • Drive down cycle time from merge to production via parallel pipelines
  • Optimize fleet rollout strategies across thousands of chips
  • Evolve self-service model onboarding for new models

🎯 Requirements

  • 5+ years building deployment/infrastructure at scale
  • Design systems with complex state machines and multi-stage pipelines
  • Experience with resource-constrained deployment (fleet capacity, hardware)
  • Automation that measurably improves deployment velocity and reliability
  • Proficiency with Kubernetes-based deployments and container orchestration
  • Comfort across the stack: backend services, databases, CLI tools, and web UIs

🎁 Benefits

  • Competitive compensation and benefits
  • Optional equity donation matching
  • Generous vacation and parental leave
  • Flexible working hours
  • Lovely office space in San Francisco

πŸ›ƒ Visa sponsorship

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’