Added
less than a minute ago
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
cuda tensorrt gpu nsight xlaπ Description
- Build real-time instrumentation for perf monitoring and benchmarking tools.
- Analyze GPU metrics to identify hotspots and propose solutions.
- Port serial algorithms to the GPU to boost compute and latency.
- Help design a middleware framework for efficient CPU/GPU code.
- Instrument, monitor, and optimize GPU-based CV/DL algorithms.
π― Requirements
- BS in CS or related field; 7+ years of experience.
- Strong CUDA knowledge on Ampere/Blackwell; debug/optimize kernels with Nsight.
- Strong C++ knowledge; large code bases; Linux development.
- Experience in multi-process systems debugging/profiling (robotics, game engines).
- Real-time GPU kernel dev; PTX and AVX; TensorRT/XLA.
- ML model optimization (PTQ/pruning) or hand-tune GPU kernels (CUDA/OpenGL/RocM).
- BI tools: SQL, DataBricks, Looker.
π Benefits
- Paid time off including vacation, sick leave, and bereavement.
- Unpaid time off available.
- Zoox Stock Appreciation Rights.
- Amazon RSUs.
- Health insurance.
- Long-term and short-term disability and life insurance.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!