Scale AI

52 jobs posted

View company profile →

Please mention that you found this job on empllo.com. Thanks & good luck!

Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills
Tailors your resume and cover letter automatically
Works 24/7—so you don't have to

Activate JobCopilot

Follow us on LinkedIn!

Research Scientist, Safety Post Training

Added

less than a minute ago

Location

Type

Full time

Salary

Upgrade to Premium to se...

Related skills

ai rlhf dpo interpretability robustness

📋 Description

Design and run post-training pipelines to study safety, robustness, and alignment.
Develop interpretability-informed evaluations to reveal unsafe or undesirable behaviors.
Collaborate with policymakers, engineers, and researchers to translate findings into safety standards, benchmarks, and best practices.

🎯 Requirements

Commitment to safe, secure, and trustworthy AI deployments.
Experience with post-training and RL techniques such as RLHF, DPO, GRPO.
A track record of published ML research, particularly in generative AI.
At least three years addressing sophisticated ML problems in research or product development.
Strong written and verbal communication in cross-functional teams.
Nice to have: mechanistic interpretability, probing, or adversarial evaluation of post-trained models.

🎁 Benefits

Comprehensive health, dental, and vision coverage.
Retirement benefits.
Learning and development stipend.
Generous PTO.
Commuter stipend.
Equity-based compensation subject to board approval.

Apply on employer's website

This employer gathers applications via their own applicant tracking system.

You will be redirected to an external application form.

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Activate JobCopilot