Researcher: Agent Post-Training, API & Power-Users

Added
less than a minute ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

machine learning apis training llms ai tools

πŸ“‹ Description

  • Design and run experiments to improve model behavior in API workflows.
  • Build evals, graders, and environments from real developer workflows.
  • Turn observed failures into training data and model improvements.
  • Partner with API and power-users to identify high-leverage behavior gaps.
  • Own end-to-end model behavior projects from data generation to launch readiness.
  • Improve large-scale training machinery: speed, reliability, observability.

🎯 Requirements

  • Strong technical fundamentals in ML, software engineering, systems, statistics, or applied research.
  • Hands-on with LLMs, post-training, evals, graders, synthetic data, or production ML.
  • Ability to analyze transcripts/evals and form concrete hypotheses about model behavior.
  • Experience tackling ambiguous capability problems where signal is noisy and failures are qualitative.
  • Deep care for developer and expert-user experience with API products and agent harnesses.
  • Comfortable across research, product, infrastructure, data, evals, and safety; communicate clearly.

🎁 Benefits

  • Offers equity
  • Hybrid work environment
  • Opportunity to work on frontier AI research at OpenAI
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’