Added
7 days ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
data analysis python apis llms benchmarking📋 Description
- Lead high-impact research on jobs-to-be-done benchmarks for AI systems.
- Define task taxonomies grounded in real professional activities.
- Develop methods to measure human activity in AI-mediated workflows.
- Design benchmarks to assess AI copilots rather than autonomous agents.
- Build evaluation infrastructure and data pipelines for human–AI interaction.
- Collaborate with UX Research to translate insights and publish work.
🎯 Requirements
- PhD or equivalent in HCI, CS, Cognitive Science, or related field.
- 3+ years of academic or industry research post-PhD.
- Strong publication record in top-tier HCI venues; CHI experience preferred.
- Expertise in experimental design and measurement of human activity.
- Python and data analysis; experience building experimental systems.
- Ability to lead research agendas and collaborate across teams.
🎁 Benefits
- Equity in a fast-growing company.
- 401(k) match and financial coaching.
- Paid parental leave and fertility benefits.
- Medical, dental, and vision coverage; mental health support.
- $2,000 learning stipend and ongoing development.
- Remote and SF office perks and flexible PTO with holidays.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!