Added
less than a minute ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
pytorch distributed systems scaling performance jaxπ Description
- Build and improve the RL training infrastructure used daily by researchers
- Identify and remove bottlenecks across the RL stack (debugging, profiling)
- Partner with researchers and adjacent engineering teams to ship tooling that makes them faster
- Own the reliability and performance of RL research runs end-to-end
- Contribute to design decisions shaping RL at scale
π― Requirements
- Strong software engineering fundamentals and reliable, performant systems
- Experience in ML infrastructure, distributed systems, or research tooling
- Enable others' work via platforms rather than individual experiments
- Comfortable across the stackβfrom low-level perf to RL algorithms
- Bias toward shipping and iterating quickly with high agency and low ego
π Benefits
- Competitive compensation and benefits
- Optional equity donation matching
- Generous vacation and parental leave
- Flexible working hours
- Office space for collaboration in San Francisco
π Visa sponsorship
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!