Added
less than a minute ago
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
linux cuda tensorrt nsight rocmπ Description
- Build real-time instrumentation for performance monitoring and benchmarking tools.
- Analyze metrics to identify GPU hotspots and root causes.
- Bring serial algorithms to the GPU to maximize compute utilization and latency.
- Design a middleware framework with the Core team for efficient CPU/GPU code.
π― Requirements
- BS in CS or related field; 3+ years experience.
- CUDA knowledge on Ampere/Blackwell; Nsight debugging/optimizing GPU kernels.
- Strong C++ knowledge; large code bases; Linux development.
- Experience with development, debugging, and profiling multiprocess systems (robotics, game engines).
- GPU kernel dev in real-time env; PTX, AVX, TensorRT/XLA.
- ML model optimization or hand-tuning GPU kernels (OpenGL, CUDA, ROCm).
- Proficiency with SQL, DataBricks, Looker or BI tools.
π Benefits
- Zoox RSUs and stock appreciation rights.
- Comprehensive benefits: health, dental, vision; life and disability insurance.
- Paid time off, sick leave, holidays.
- Sign-on bonus may be offered; base salary plus equity.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!