Software Engineer, ML Performance Optimization

Added
3 hours ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

python pytorch distributed training tensorrt nsight

πŸ“‹ Description

  • Design ML training/inference performance optimizations for VLM/VLA in robotaxi.
  • Collaborate with ML researchers and engineers to align requirements and architecture.
  • Work with accelerators and distributed training techniques (quantization, distillation, pruning).
  • Build and operate ML tooling, model development, and serving for in- and off-vehicle use.

🎯 Requirements

  • 4+ years total experience, incl. 2+ years on large-scale training or inference.
  • Experience with PyTorch for distributed model training.
  • Experience with GPU-accelerated inference using TensorRT.
  • Experience with Nsight or PyTorch Profiler for bottleneck profiling.
  • Proficient in Python or C++.

🎁 Benefits

  • Paid time off (vacation, sick, bereavement)
  • Unpaid time off
  • Zoox Stock Appreciation Rights
  • Amazon RSUs
  • Health insurance
  • Disability insurance (long-term and short-term)
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’