Added
less than a minute ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
python pytorch model training large language models reinforcement learningπ Description
- Design and run alignment experiments focused on intent following, honesty, calibration, and robustness.
- Train and evaluate models using reinforcement learning, and other empirical ML methods.
- Develop evaluations for failure modes such as hallucination, instruction-following failures, reward hacking, covert actions, and scheming.
- Study methods that encourage models to verify their behavior and report shortcomings honestly, including confession-style training objectives.
- Build monitoring and inference-time interventions that ensure compliant behavior or surface model issues to users or downstream systems.
- Investigate how alignment methods scale with model capability, compute, data, context length, action length, and adversarial pressure.
- Integrate successful techniques into model training and deployment workflows.
- Produce externally publishable research when results advance the broader science of alignment.
- Collaborate with researchers and engineers across post-training, RL, evaluations, safety, and product-facing teams.
π― Requirements
- Hands-on experience training, evaluating, or debugging large ML models (LLMs).
- Strong Python and PyTorch engineering skills.
- Mathematical rigor and quantitative thinking.
- Experience with RL, post-training, or scalable ML research.
- Ability to work independently with minimal day-to-day guidance.
- Thrives in fast-paced, collaborative research environments.
- Strong track record in rigorous problem solving (competition, systems, etc.).
- Commitment to trustworthy, honest AI in high-stakes settings.
- Motivated to advance alignment methods that can be tested and deployed.
π Benefits
- Hybrid work model: 3 days in office per week.
- Relocation assistance for new employees.
- Exceptional remote candidates considered.
π Relocation support
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!