Added
12 days ago
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
python pytorch data pipelines cuda tritonπ Description
- Performance profiling and optimization to reduce step time
- Optimize distributed training pipelines with PyTorch Distributed
- Design and maintain GPU kernels in Triton or CUDA
- Build robust data loading pipelines to maximize throughput
π― Requirements
- Education: Bachelor's/Master's/PhD in CS, CE, or related
- Software engineering: Strong Python proficiency
- ML frameworks: Extensive PyTorch experience
- ML knowledge: Optimizing training/inference; ML concepts
- Problem solving: Analytical, data-driven approach
π Benefits
- Medical, dental, and vision coverage
- 401(k) with company match
- Health savings accounts
- Life and pet insurance
- Additional company benefits
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!