Related skills
c rlhf๐ Description
- Train LLMs to write production-grade code across languages.
- Compare and rank code snippets, explain best choice.
- Repair and refactor AI-generated code for quality.
- Inject feedback into the RLHF pipeline.
- End result: model learns to propose, critique, and improve code as you would.
- RLHF in one line: Generate code โ rank, edit, justify โ reward signals.
๐ฏ Requirements
- 3+ years of professional software engineering in C.
- Strong code-review instincts.
- Extreme attention to detail and strong written communication.
- Thrive in asynchronous, low-oversight environments.
๐ Benefits
- Location: Fully remote โ work from anywhere.
- Hours: 15โ40 hrs/week.
- Engagement: 1099 contract.
- Straightforward impact, zero fluff.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!