Added
less than a minute ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
pytorch distributed systems performance reinforcement learning jax๐ Description
- Build and improve the RL training infrastructure used daily
- Identify bottlenecks across the RL stack; debug and rearchitect
- Collaborate with researchers and eng teams to ship faster tooling
- Own the reliability and performance of research runs end-to-end
- Contribute to design decisions shaping RL at scale
๐ฏ Requirements
- Strong software engineering fundamentals; build reliable systems
- Experience with ML infrastructure, distributed systems, or research tooling
- Passion for enabling others' work via platforms
- Comfort across the stack from low-level to RL algorithms
- Bias toward shipping and fast iteration with high agency
- Experience with large-scale distributed training (RL, pre/post-training)
๐ Benefits
- Competitive compensation and benefits
- Optional equity donation matching
- Generous vacation and parental leave
- Flexible working hours
- Collaborative office culture
๐ Visa sponsorship
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!