Related skills
compliance sql spreadsheets dashboards evaluationπ Description
- Run evaluations to support model launch readiness and surface regressions.
- Coordinate with policy experts and Safeguards teams across the evaluation lifecycle to scope evals and keep them current.
- Manage evaluation outcomes with cross-functional stakeholders, interpret results, and drive mitigations.
- Improve eval quality by building scalable processes and high-signal paradigms.
- Develop product-specific evaluation frameworks as Anthropic's product surface expands.
- Design tooling improvements to enable self-serve eval creation for non-technical users.
π― Requirements
- Experience in trust and safety, content operations, or policy enforcement at a tech company.
- Thrive in ambiguous, fast-moving environments
- Experience building processes/workflows from scratch
- Strong program management instincts with timelines and deliverables
- Proficient with data tools (SQL, dashboards, spreadsheets)
- Clear, concise written and cross-functional communication
π Benefits
- Competitive compensation and benefits
- Optional equity donation matching
- Generous vacation and parental leave
- Flexible working hours
- Office space in San Francisco
π Visa sponsorship
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Operations Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!