Related skills
tensorflow pytorch machine learning rlhf jaxπ Description
- Design experiments to evaluate model behavior across reasoning, style, and robustness.
- Develop new metrics and evaluation protocols beyond standard benchmarks.
- Analyze large-scale human voting data to reveal insights on performance.
- Collaborate with engineers to productionize research insights into systems.
- Prototype and rapidly test research ideas with rigor and speed.
- Partner with model providers to shape evaluation questions and responsible testing.
π― Requirements
- PhD or equivalent in ML/NLP or related field.
- Hands-on training of large-scale models (reward/preference models; RLHF/DPO fine-tuning).
- Strong ML/statistics foundation; design novel objectives and evaluation schemes.
- Fluent in full ML stack: dataset design, large-batch training, rigorous evaluation.
- Collaborative; able to productionize research insights with engineers/product teams.
- Experience publishing or contributing to open-source ML/NLP or AI evaluation.
π Benefits
- Competitive compensation and equity aligned to markets where our team members are based.
- Comprehensive health and wellness benefits, including medical, dental, and vision.
- The opportunity to work on cutting-edge AI with a small, mission-driven team.
- A culture that values transparency, trust, and community impact.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!