Related skills
node.js docker python kubernetes typescriptπ Description
- Build agents that investigate incidents; surface anomalies; explain production issues.
- Write reusable skills and libraries for debugging and incident response.
- Own agent stack end-to-end: context, tooling, evals, tracing, cost.
- Build MCP servers, SDKs, and integrations to read telemetry and act.
- Collaborate with OSS contributors and customers; debug problems with them.
- Address latency, cost, context window limits, eval coverage, and telemetry quirks.
π― Requirements
- 5+ years software engineering; 1β2 years on LLM-powered systems or agents.
- Strong backend skills in TypeScript/Node.js and/or Python.
- Hands-on agent building: multi-step tool use, planning, memory, error recovery.
- Experience designing skills (Markdown workflows) and when to use a skill vs a tool.
- Experience with MCP: servers, tool design, auth, scoping, observability.
- Strong evals practice: golden sets, LLM-as-judge, regression detection.
- SQL proficiency; write ClickHouse queries directly.
- Familiarity with Docker and Kubernetes.
π Benefits
- Flexible, remote-friendly, globally distributed.
- Healthcare contributions by employer
- Equity / stock options
- Flexible time off
- $500 home office setup for remote employees
- Global gatherings β company-wide offsites
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!