Senior Software Engineer - AI Interaction Evaluator (Codex / Claude Code, up to $200/hr)
Related skills
javascript python typescript cursor claude code๐ Description
- Evaluate AI-generated coding interactions end-to-end
- Judge usefulness, correctness, and engineering judgment
- Assess explanations and reasoning quality
- Distinguish levels of response quality (e.g., 2 vs 4)
- Provide direct, opinionated feedback on what worked, didn't, and what felt off
- Help define what great interaction looks like when using Cursor and AI tools
๐ฏ Requirements
- Staff / Principal-level engineer (or equivalent)
- Strong background in TypeScript / JavaScript
- Python experience
- Hands-on with OpenAI Codex
- Hands-on with Claude Code
- Cursor or similar AI-first IDEs
๐ Benefits
- Take-home evaluation exercise
- One behavioral interview
- Remote contract engagement
- Start ASAP
- ~10โ20 hours/week
- Through early May (possible extension)
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!