Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills

Tailors your resume and cover letter automatically

Works 24/7—so you don't have to

Design experiments to evaluate model behavior across reasoning, style, and robustness.
Develop new metrics and evaluation protocols beyond standard benchmarks.
Analyze large-scale human voting data to reveal insights on performance.
Collaborate with engineers to productionize research insights into systems.
Prototype and rapidly test research ideas with rigor and speed.
Partner with model providers to shape evaluation questions and responsible testing.

PhD or equivalent in ML/NLP or related field.
Hands-on training of large-scale models (reward/preference models; RLHF/DPO fine-tuning).
Strong ML/statistics foundation; design novel objectives and evaluation schemes.
Fluent in full ML stack: dataset design, large-batch training, rigorous evaluation.
Collaborative; able to productionize research insights with engineers/product teams.
Experience publishing or contributing to open-source ML/NLP or AI evaluation.

Competitive compensation and equity aligned to markets where our team members are based.
Comprehensive health and wellness benefits, including medical, dental, and vision.
The opportunity to work on cutting-edge AI with a small, mission-driven team.
A culture that values transparency, trust, and community impact.

Machine Learning Scientist - Open Source Lead

Meet JobCopilot: Your Personal AI Job Hunter