Related skills
python ruby on rails dashboards rag llm๐ Description
- Build evaluation infrastructure for AI speed and accuracy offline and online.
- Create observability dashboards surfacing week-over-week quality metrics.
- Diagnose quality gaps and trace causes in retrieval, ranking, or prompting.
- Experiment with models and agent configurations using data to guide decisions.
- Prototype and validate RAG pipeline improvements such as chunking, retrieval, and re-ranking.
- Analyze how customers use AI features to identify improvements or new areas.
๐ฏ Requirements
- Production AI systems experience: RAG pipelines, search/retrieval, LLM apps, evaluative frameworks.
- Strong Python proficiency for prototyping.
- Open to learning Ruby on Rails and integrating with Rails codebase.
- Comfortable building infra and tooling (eval pipelines, dashboards, data processing).
- Deep understanding of RAG architecture: chunking, embeddings, retrieval, re-ranking, context.
- Proficient in English (CEFR C2 / ILR 5).
๐ Benefits
- Fully remote โ work from anywhere in the world.
- 35 days PTO annually plus a sabbatical after 5 years.
- Equity plus competitive cash compensation.
- 100% medical coverage for you and your family (or reimbursement where applicable).
- Parental leave and home office stipend.
- Learning and development stipend, plus annual bonus potential and company retreats.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!