Related skills
ruby llm rlhf asynchronous๐ Description
- Help train LLMs to write production-grade code across languages
- Compare and rank code snippets, explaining the best choice
- Repair and refactor AI-generated code for correctness and style
- Inject feedback into the RLHF pipeline and keep it running
- End result: the model learns to propose, critique, and improve code
- RLHF in one line: generate code; rank, edit, justify; reward signals
๐ฏ Requirements
- 3+ years of professional Ruby software engineering
- Strong code-review instincts to spot errors
- Extreme attention to detail and clear written communication
- Comfortable reading docs and working asynchronously
- Identity verification required for independent contractor status
๐ Benefits
- Fully remote โ work from anywhere on accepted locations
- Compensation: $30โ$70/hr based on location and seniority
- Hours: 15โ40+ hrs/week, project-dependent
- Engagement: 1099 independent contractor
- Paid weekly via PayPal or Stripe
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!