Related skills
python ai code review large language models rlhf๐ Description
- Help train LLMs to write production-grade code across languages.
- Compare and rank code snippets, explaining best option.
- Repair and refactor AI-generated code for correctness and style.
- Inject feedback into the RLHF pipeline and reward signals.
- End result: model learns to propose, critique, and improve code.
- Work fully remote in an asynchronous environment.
๐ฏ Requirements
- 3+ years of professional software engineering experience in Python.
- Strong code-review instincts; spot logic errors, perf issues, and security.
- Extreme attention to detail and excellent written communication.
- Enjoy reading documentation and language specs; thrive asynchronously.
- No prior RLHF or AI training experience.
- No deep machine learning knowledge; weโll teach you the rest.
- Constraint programming experience is a bonus.
๐ Benefits
- Location: Fully remote โ work from anywhere.
- Hours: 15โ40 hours per week; flexible scheduling.
- Engagement: 1099 contract.
- Straightforward impact, zero fluff.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!