Related skills
aws python typescript rag langgraph๐ Description
- Design and ship an end-to-end AI evaluation framework (offline and production).
- Define metrics: task completion, hallucination rates, and quality.
- Build eval datasets, test harnesses, and automated scoring.
- Architect reusable agent infra: multi-turn workflows and LangGraph.
- Scale RAG pipelines and vector store retrieval quality.
- Improve production AI systems with reliability and observability.
๐ฏ Requirements
- 5+ years of professional software engineering, AI/ML focus.
- Hands-on with LLM-based systems: prompts, RAG, and orchestration.
- Proven ability to work with data and statistics in experiments.
- Production-grade agent AI systems: multi-turn workflows and topologies.
- Strong AI evaluation: built eval frameworks; avoid vanity metrics.
- Python engineering (maintainable, testable); TypeScript a plus.
๐ Benefits
- Medical, dental, life, AD&D, and disability insurance.
- Wellness apps and remote work support.
- Remote stipend and one-time WFH setup stipend.
- Retirement plan, financial planning, and learning & development budget.
- Accommodations available on request.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!