Anthropic

165 jobs posted

View company profile →

Please mention that you found this job on empllo.com. Thanks & good luck!

Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills
Tailors your resume and cover letter automatically
Works 24/7—so you don't have to

Activate JobCopilot

Follow us on LinkedIn!

Research Engineer, Safeguards Labs

Added

less than a minute ago

Location

🇺🇸 San Francisco

Type

Full time

Salary

Upgrade to Premium to se...

Related skills

python machine learning ai llms evaluation

📋 Description

Lead research projects on detecting Claude misuse and strengthening safeguards.
Design offline analyses of model usage to surface abuse patterns.
Develop prototypes to feed signals into real-time safeguards with engineers.
Study methods for detecting abusive behavior in chat-based workflows.
Build evaluations to measure safeguards effectiveness.
Publish findings to inform Trust & Safety, research, and product teams.

🎯 Requirements

Independently drive research projects to concrete AI/ML results.
Scope work and switch between research, engineering, and analysis.
Familiar with how large language models operate: sampling, prompting, training.
Proficient in Python and comfortable with large datasets.
Care about AI's societal impacts and reducing real-world harm.
Experience building ML models for abuse, fraud, or safety.

🎁 Benefits

Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
Office space for collaboration

🛃 Visa sponsorship

Apply on employer's website

This employer gathers applications via their own applicant tracking system.

You will be redirected to an external application form.

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Activate JobCopilot