Anthropic

256 jobs posted

View company profile →

Please mention that you found this job on empllo.com. Thanks & good luck!

Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills
Tailors your resume and cover letter automatically
Works 24/7—so you don't have to

Activate JobCopilot

Follow us on LinkedIn!

Research Engineer, Environment Scaling

Added

41 minutes ago

Location

🌍 North America

Type

Full time

Salary

Upgrade to Premium to se...

Related skills

data pipelines large language models reinforcement learning fine-tuning claude

📋 Description

Own end-to-end RL environment creation for new capabilities.
Improve and execute fine-tuning for Claude in new domains.
Manage external data vendors; evaluate data quality and rewards.
Collaborate with domain experts on data pipelines and evals.
Explore RL environment designs for high-value tasks.
Develop QA frameworks to catch reward hacking and env quality.

🎯 Requirements

Experience fine-tuning LLMs for specific domains or real-world use cases.
Experience with RL, reward design, or data curation for LLMs.
Comfortable managing vendor relationships and rapid iteration loops.
Strong project management and interpersonal skills.
Bachelor's degree or equivalent experience.
Excited about a role combining ML research, data ops, and PM.

🎁 Benefits

Competitive compensation and benefits.
Optional equity donation matching.
Generous vacation and parental leave.
Flexible working hours.
Office space for collaboration.

🛃 Visa sponsorship

Apply on employer's website

This employer gathers applications via their own applicant tracking system.

You will be redirected to an external application form.

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Activate JobCopilot