Added
less than a minute ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
open source machine learning large language models reinforcement learning evaluationπ Description
- Lead strategy and execution for evaluating model capabilities.
- Drive original research into new evaluation methods.
- Lead a small team of researchers and engineers.
- Span the full lifecycle of model development and evals.
- Collaborate across RL, Pretraining, Inference, Product, Safeguards.
- Shape evaluation narratives for model releases.
π― Requirements
- Significant experience designing/running evaluations for LLMs or ML systems.
- Led technical projects or teams with ownership of research directions.
- Comfortable designing experiments and writing code across research to production.
- Think strategically about what to measure and why.
- Synthesize information across teams to form a cohesive view of capabilities.
- Communicate complex findings to technical and non-technical audiences.
π Benefits
- Remote-friendly role with travel requirements.
- Generous vacation and parental leave.
- Flexible working hours.
- Optional equity donation matching.
- Collaborative, research-focused culture with office space.
π Visa sponsorship
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!