Design ML training/inference performance optimizations for VLM/VLA in robotaxi.
Collaborate with ML researchers and engineers to align requirements and architecture.
Work with accelerators and distributed training techniques (quantization, distillation, pruning).
Build and operate ML tooling, model development, and serving for in- and off-vehicle use.

🎯 Requirements

4+ years total experience, incl. 2+ years on large-scale training or inference.
Experience with PyTorch for distributed model training.
Experience with GPU-accelerated inference using TensorRT.
Experience with Nsight or PyTorch Profiler for bottleneck profiling.
Proficient in Python or C++.

🎁 Benefits

Paid time off (vacation, sick, bereavement)
Unpaid time off
Zoox Stock Appreciation Rights
Amazon RSUs
Health insurance
Disability insurance (long-term and short-term)

Apply on employer's website

This employer gathers applications via their own applicant tracking system.

You will be redirected to an external application form.

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Activate JobCopilot