Related skills
pytorch transformers deepspeed tensorrt tritonπ Description
- Build and maintain end-to-end data pipelines for large-scale image and video datasets.
- Implement diffusion, autoregressive, flow-based, and transformer models; sustain high-throughput PyTorch training.
- Run multi-GPU/multi-node training; debug instabilities and convergence issues.
- Apply quantization, pruning, and knowledge distillation to compress models.
- Translate state-of-the-art research papers into production-ready implementations.
- Build evaluation pipelines for image quality, video consistency, and perceptual metrics.
π― Requirements
- 2β5 years hands-on experience building and training ML systems.
- Fluency in PyTorch: training and inference code.
- Experience training/fine-tuning generative models (diffusion, transformers, VAEs).
- Distributed training know-how; debugging large runs.
- Ability to read/implement CV AI papers; familiarity with current models.
- Experience building data pipelines for large-scale image/video datasets.
π Benefits
- Competitive salary and equity
- Medical, dental, and vision insurance covered
- 42 days PTO incl. 15 vacation days, 10 sick days, 15 holidays, 2 floating holidays
- Generous parental leave, fertility support
- 401(k) retirement plan
- Lifestyle spending account and perks (lunch and One Medical membership)
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!