Related skills
go llms rlhf constraint programming๐ Description
- Train LLMs to write production-grade code across languages.
- Compare and rank code snippets; explain best choice.
- Repair and refactor AI-generated code for correctness.
- Inject feedback into RLHF pipeline; keep it running smoothly.
- End result: model learns to propose, critique, and improve code.
- RLHF: Generate code โ rank/edit โ reward signals โ tuning.
๐ฏ Requirements
- 3+ years of Go software engineering.
- Strong code-review instincts.
- Extreme attention to detail and clear written communication.
- Comfortable reading docs and specs; asynchronous work.
- Identity verification to work as independent contractor.
- No prior RLHF or AI training exp.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!