Added
38 minutes ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
python tensorflow pytorch cuda nvidia nsight systems๐ Description
- Conduct performance analysis of DL networks on the AV.
- Optimize software architecture, performance, and latency for DL apps.
- Deploy DL models on the AV and train at large scale data centers.
- Troubleshoot performance with profiling and roofline techniques.
- Collaborate with cross-functional teams to enhance self-driving tech.
๐ฏ Requirements
- 5+ years of professional software engineering.
- BS/MS/PhD in Computer Science or related field.
- CUDA, C++, Python programming skills.
- HPC and parallel programming; optimize GPU memory, latency, throughput.
- Nsight Systems/Compute; apply roofline model for optimization.
- DL/ML workloads optimization at framework level with PyTorch or TensorFlow.
๐ Benefits
- Hybrid work environment with in-office 3 days per week.
- Annual bonus, equity compensation, and benefits.
- Career growth and learning opportunities.
- Collaborative, cross-functional teams.
- Health, dental, vision benefits.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!