Added
2 days ago
Location
Type
Full time
Salary
Salary not provided
Related skills
python tensorflow pytorch machine learning rlhf📋 Description
- Design, implement, and deploy RL research at scale.
- Build and deploy RL pipelines at scale.
- Post-train multi-modal models to align with human intent.
- Lifecycle from idea to production deployment.
- Foster external collaborations with academia and industry.
- Collaborate with Engineering, ML Platform, and HPC teams.
🎯 Requirements
- Strong technical background with RL/model alignment to production.
- Practical, creative mindset with real-world impact.
- Solid mathematical background; masters/PhD or equivalent.
- Python + ML frameworks: PyTorch, TensorFlow, or JAX.
- Track record leading self-directed research with tangible results.
- Expertise in deep RL (RLHF/RLAIF/RLVR) is a plus.
- Hands-on scaling/deploying LLMs in real systems is a plus.
🎁 Benefits
- Diverse, international team across multiple countries.
- Hybrid work with in-office twice a week.
- Hack Fridays monthly for side projects.
- 30 days annual leave and mental health resources.
- Virtual Shares and ownership in DeepL.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!