Added
9 hours ago
Location
Type
Internship
Salary
Upgrade to Premium to se...
Related skills
python pytorch llm rlhf rlπ Description
- Collaborate with mentor to identify high-impact LLM research directions.
- Independently run end-to-end SFT experiments on LLM-based agents.
- Assist with RL exploration: reward design and training iterations.
- Curate high-quality training datasets (instruction-following, agent trajectories, synthetic data).
- Contribute to public publications; support top-venue submissions during internship.
π― Requirements
- Highly motivated and able to put in extra hours as needed.
- Genuine passion for research; read papers and tinker with models.
- Independently capable of end-to-end model SFT; basic RL post-training methods (RLHF, DPO, PPO, GRPO).
- Excellent taste in model behavior; able to reason about what good looks like.
- Strong Python and PyTorch skills.
π Benefits
- Mentor pairing with a full-time engineer.
- Opportunity to contribute to top-tier publications during internship.
- Hands-on experience with LLMs, agent RL, and NewsBreak products.
- Collaborative, fast-paced research-focused team culture.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!