Related skills
python machine learning c/c++ cuda gpuπ Description
- Co-design future hardware with vendors for programmability and performance
- Help vendors develop optimal kernels and add support in our compiler
- Develop performance estimates for kernels across hardware configurations
- Build system performance models and analyze scale up/down and networking
- Collaborate with ML engineers, kernel engineers and compiler teams
- Manage communication with internal and external partners
π― Requirements
- 4+ years of experience in compute at scale and ML optimization
- Strong experience in software/hardware co-design
- Deep understanding of GPU or AI accelerators
- Experience with CUDA or Triton
- Strong coding in C/C++ and Python
- Familiar with fundamentals of deep learning computing and chip architecture
π Benefits
- Medical, dental, and vision insurance for you and your family
- Mental health and wellness support
- 401(k) plan with 4% matching
- Unlimited time off and 18+ company holidays per year
- Paid parental leave (20 weeks) and family-planning support
- Annual learning & development stipend ($1,500 per year)
π Relocation support
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!