Related skills
prompt engineering llms genai quality control evaluationπ Description
- Execute GenAI-powered labeling and evaluation systems
- Identify opportunities where LLMs improve quality, speed, cost
- Build prototypes for prompts, task decomposition, quality estimation, and routing
- Design experiments and metrics to evaluate model performance and outcomes
- Partner with engineering, product, and data science to productionize approaches
π― Requirements
- 6+ years of post-grad/industry experience applying scientific methods to large-scale data (or PhD + 3 years)
- Strong hands-on IC experience solving complex data science/ML problems
- Experience applying LLMs or generative AI to workflows, systems, or products
- Ability to turn ambiguous problems into rigorous analyses, experiments, and prototypes
- Track record of writing high-quality code and influencing product direction
- Cross-functional collaboration and strong business/product sense
- Self-directed learner in a rapidly evolving technical landscape
π Benefits
- Hybrid work with in-office 1-2 times per quarter
- Location anywhere in the country under PinFlex
- PinFlex details available on the PinFlex page
- Inclusive, equal-opportunity employer
- Equity and benefits described on Pinterest careers pages
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Data Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!