Related skills
python tensorflow pytorch llms rlhfπ Description
- Design, build, and ship ML systems for autonomous underwriting in production
- Build and close feedback loops turning underwriter behavior into training signals
- Develop confidence scoring to define when more autonomy is ready
- Work with LLMs to build reliable, auditable agentic workflows
- Partner with underwriters to extract domain knowledge and earn trust
- Contribute to observability and guardrails for AI underwriting safety
π― Requirements
- 4+ years building and shipping ML end-to-end models
- 4+ years with model deployment platforms (e.g., AWS Sagemaker)
- Finetuning SLMs/LLMs with RLHF, DPO, or LoRA
- Proficient in Python and ML frameworks (PyTorch, HuggingFace, TensorFlow)
- Production experience with LLMs: prompts, structured outputs, tools, eval
- Models built with limited labeled data (synthetic data, augmentation)
π Benefits
- Premium healthcare with 100% top-tier health, dental, and vision
- Fertility benefits and family-building support
- Unlimited PTO
- Daily meals and snacks
- Offices in SF, NYC, Dallas-Fort Worth, Chicago and LA
- Professional development coaching
- 401(k) plan
- Dog-friendly SF office
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!