Related skills
data analysis python llms measurement hci๐ Description
- Lead research on jobs-to-be-done benchmarks for AI systems
- Define task taxonomies grounded in real professional work
- Design benchmarks for AI copilots, not basic QA
- Run empirical studies on how people use AI to solve tasks
- Build AI systems and evaluation infrastructure for task-level measurement
- Collaborate with UX Research to leverage insights
๐ฏ Requirements
- PhD or equivalent in HCI, CS, Cognitive Science
- 3+ years post-PhD research experience
- Strong publication record in top-tier HCI venues CHI
- Experimental design and measurement expertise
- Python and data analysis ML tooling
- Familiarity with LLM APIs, agent frameworks, or AI tooling
๐ Benefits
- Equity in a fast-growing company
- 401(k) match and financial coaching
- Paid parental leave and fertility benefits
- Medical, dental, and vision and mental health support
- 2,000 learning stipend and ongoing development
- Remote and Office: internet reimbursement and SF office perks
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!