Related skills
llms genai evaluation human-in-the-loop labelingπ Description
- Drive high-impact GenAI-powered labeling and evaluation systems
- Identify where LLMs can improve quality, speed, coverage, and cost
- Develop prototypes for prompt optimization, task decomposition, quality estimation, and routing
- Design measurement frameworks to evaluate model performance and tradeoffs
- Partner with eng, product, and data science to productionize approaches
- Establish standards for trustworthiness including bias, calibration, and quality control
π― Requirements
- 10+ years of post-graduate exp applying scientific methods to large-scale data
- Deep hands-on IC experience solving complex data science or ML problems
- Strong experience applying LLMs or generative AI to practical workflows
- Turn ambiguous problems into rigorous analyses, experiments, and prototypes
- Proven track record of writing high-quality code shaping product direction
- Strong cross-functional collaboration and influence through data and judgment
π Benefits
- PinFlex: flexible, distributed work model
- Equity and competitive compensation
- Inclusive, equitable workplace
- Transparent compensation philosophy
- Opportunity to impact GenAI labeling platforms
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Data Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!