Related skills
python apis langchain telemetry langraph๐ Description
- Design and implement RL environments for large-scale agent evaluation.
- Build task generation pipelines and dynamic datasets with controlled stochasticity.
- Develop verifiers and reward models to score trajectories and evaluate model reasoning.
- Collaborate with infra and systems engineers to ensure scalable, instrumented environments.
- Design APIs and orchestration frameworks for running, resetting, and evaluating agents.
- Optimize environment performance, logging, and reward reproducibility across distributed setups.
๐ฏ Requirements
- Strong experience in Python software engineering.
- Minimum 3 years in Data Scientist, ML/Environment Engineering, or similar.
- Ability to work from 6 AM - 2 PM Pacific time.
- Bachelor's degree in Computer Science or related field.
- Practical knowledge of AI frameworks (Langchain, Langraph, mcp-server).
- Extensive AI experience, including prompt engineering.
- Familiarity with instrumentation, metrics, and data pipelines for RL evaluation.
- Expertise in planning your own work.
๐ Benefits
- 100% Remote
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!