Related skills
rust python kubernetes pytorch cudaπ Description
- Build standardized distributed training frameworks for research and production.
- Profile model runtime and memory to pinpoint bottlenecks.
- Identify and evaluate new tech for training/inference (CUDA kernels, quantization, deployment).
- Collaborate with researchers and ML engineers on best-practices for resources.
- Create and improve tooling and dashboards for broad adoption.
π― Requirements
- MS/PhD or BS with 6+ years in CS/Robotics.
- Proficient in Python, C++, or Rust.
- Experience with PyTorch (or Jax).
- Skilled at profiling CPU/GPU code with PyTorch Profiler and Nsight.
- Experience with Nvidia embedded platforms (Jetson or Thor).
- Open-minded, collaborative team player.
π Benefits
- Competitive pay and equity.
- Medical, Dental, and Vision coverage.
- Unlimited vacation.
- Flexible hours and Work from Home support.
- Snacks and catered meals when in office.
- Team events on-site, off-site and virtual.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!