Added
14 days ago
Location
Type
Full time
Salary
Salary not provided
Related skills
python tensorflow pytorch large language models reinforcement learning📋 Description
- Build and deploy state-of-the-art RL pipelines at scale.
- Post-train large models to align with human intent.
- Manage the full lifecycle from idea to production deployment.
- Build external collaborations with academic and industrial partners.
- Follow standards for experimentation, reproducibility, and evaluation.
- Collaborate with Engineering, ML Platform, and HPC teams for robust updates.
🎯 Requirements
- Deep RL or model alignment research with production impact.
- Strong track record leading self-directed research.
- Master's/PhD or equivalent in math/CS/physics.
- Python with PyTorch, TensorFlow, or JAX.
- Experience scaling and deploying LLMs in production.
- Collaborative across Eng, ML Platform, and HPC teams.
🎁 Benefits
- Diverse, globally distributed team across 90+ nationalities.
- Hybrid work with office twice a week.
- Monthly Hack Fridays for cross-team collaboration.
- 30 days of annual leave (plus holidays).
- Competitive, location-tailored benefits package.
- Virtual Shares linking your contribution to DeepL’s growth.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!