Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills

Tailors your resume and cover letter automatically

Works 24/7—so you don't have to

Lead the AI Evaluation team, staffing, coaching, and delivery of evaluation frameworks.
Oversee AI evaluation lifecycle from pre-launch testing to post-deployment health monitoring.
Operationalize human-in-the-loop testing and feed reviewer feedback into improvement loops.
Oversee simulation environments to stress-test LLMs and detect hallucinations.
Partner with AI Platform & Governance to implement metrics, reporting, and health signals.
Develop dashboards and reporting to track evaluation coverage, accuracy, and confidence.

7+ years in AI/ML operations, quality, or evaluation; 2+ years people leadership.
Deep understanding of LLM behavior, prompt testing, and evaluation methodologies.
Familiarity with human-in-the-loop frameworks and prompt testing tools.
Strong program management and stakeholder communication skills.
SQL and Python proficiency; Looker or Snowflake experience.
Experience collaborating with Engineering, Data Science, and Risk/Compliance on AI initiatives.

Manager, AI Operations & Evaluation

Meet JobCopilot: Your Personal AI Job Hunter