NewsBreak

201-500 employees
8 jobs posted

View company profile →

Please mention that you found this job on empllo.com. Thanks & good luck!

Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills
Tailors your resume and cover letter automatically
Works 24/7—so you don't have to

Activate JobCopilot

Follow us on LinkedIn!

Machine Learning Engineer, LLM Post-Training

Added

1 hour ago

Location

🇺🇸 Mountain View

Type

Full time

Salary

Upgrade to Premium to se...

Related skills

pytorch llm rlhf deepspeed cpt

📋 Description

Lead LLM post-training across CPT, SFT, and RL (RLHF).
Design and curate data for each training stage (datasets, rewards).
Collaborate with business/product teams to map use cases to training plans.
Train at scale on mid-to-large GPU clusters with distributed training.
Build evaluation and verifier pipelines to measure model quality.
Stay current with post-training research and ship production-ready code.

🎯 Requirements

Hands-on LLM post-training experience with RL (RLHF/PPO/DPO).
Strong ML data engineering; design data-prep plans for business needs.
Proven large-scale GPU training on mid-to-large hardware; distributed training.
Strong PyTorch fundamentals; TRL/Accelerate/DeepSpeed/FSDP; vLLM.
Solid tokenization, attention knowledge and alignment/failure modes.
Bias toward fast iteration with cross-team communication.

🎁 Benefits

Health, dental, and vision coverage for you and family (employee 100%).
401(k) plan with company matching.
Paid time off and holidays.
FSA/HSA and commuter benefits programs.
Team activity budget.

Apply on employer's website

This employer gathers applications via their own applicant tracking system.

You will be redirected to an external application form.

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Activate JobCopilot