Scale AI

39 jobs posted

Save Job

View company profile →

Please mention that you found this job on empllo.com. Thanks & good luck!

Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills
Tailors your resume and cover letter automatically
Works 24/7—so you don't have to

Activate JobCopilot

Scale AI

Research Scientist, Agent Robustness

On site

Engineering

Added

less than a minute ago

Location

Type

Full time

Salary

Upgrade to Premium to se...

Apply Now

Save Job

Related skills

ai rlhf dpo grpo agent_evaluation

📋 Description

Research AI agent capabilities focusing on safety, risk factors, and benchmarking methods.
Design harnesses to test agents' tendency to harmful actions under pressure or manipulation.
Design exploits and mitigations for failure modes as agents gain affordances like coding and web use.
Characterize and design mitigations for risks in multi-agent systems.

🎯 Requirements

Commitment to safe, secure, and trustworthy AI deployments.
Collaborative technical research; build evaluation harnesses and prototypes.
Experience with post-training and RL techniques: RLHF, DPO, GRPO.
Published ML research, especially in generative AI.
At least three years addressing sophisticated ML problems.
Strong written and verbal communication for cross-functional teams.

🎁 Benefits

Comprehensive health, dental, and vision coverage.
Retirement benefits.
Learning and development stipend.
Generous PTO.
Commuter stipend may be available.

Apply on employer's website

This employer gathers applications via their own applicant tracking system.

You will be redirected to an external application form.

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Activate JobCopilot