Related skills
python deep learning tensorflow pytorch llmsπ Description
- Conduct literature reviews and implement RL/self-distillation algorithms.
- Design and run experiments to evaluate methods on code/agentic tasks.
- Develop and maintain code for theory and practical implementations.
- Collaborate with researchers to analyze results and publish findings.
- Design mechanisms for large rollouts (summarization and sub-agents).
- Document progress, methodologies, and outcomes clearly.
π― Requirements
- Strong background in ML, especially RL and deep learning.
- Proficiency in Python; experience with PyTorch and TensorFlow.
- Familiarity with LLMs and their training paradigms.
- Experience with coding tasks, unit testing, or compiler tools is a plus.
- Currently pursuing a Masterβs or PhD in CS, ML, or a related field.
π Benefits
- Open and inclusive culture and work environment
- Work with a team on the cutting edge of AI research
- Weekly lunches and snacks, in-office meals
- Full health and dental benefits, including mental health budget
- 100% Parental Leave top-up for up to 6 months
- Remote-flexible with offices in major cities and stipend
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!