Added
38 minutes ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
python tensorflow pytorch cuda nvidia nsight systems๐ Description
- Conduct performance analysis and optimization of DL networks on the AV.
- Optimize software architecture, system performance, and latency for DL apps.
- Deploy DL models on the AV and train at scale in data centers.
- Troubleshoot performance using profiling and roofline techniques.
- Collaborate with cross-functional teams to enhance self-driving tech.
๐ฏ Requirements
- Minimum 5+ years of software engineering experience.
- BS, MS, or PhD in Computer Science or related field.
- CUDA, C++, Python; CV/transformer DL architectures.
- Extensive HPC/parallel programming; optimize GPU memory, latency, throughput.
- Nsight Systems/Compute and roofline model for optimization.
- DL/ML framework experience (PyTorch or TensorFlow).
๐ Benefits
- Annual bonus
- Equity compensation
- Comprehensive benefits package
- Hybrid work environment (3 days/week in-office)
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!