Added
8 days ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
grafana prometheus kubernetes go langchainπ Description
- Design and deploy highly available, multi-tenant telemetry APIs.
- Modernize data interactions with agentic experiences and MCP servers.
- Develop agentic observability capabilities for guided debugging and optimization.
- Improve health of telemetry pipelines with correlation primitives and aggregation.
- Enhance performance, security, reliability, and latency; participate in on-call.
- Collaborate with internal teams to embed observability as a product.
π― Requirements
- Six+ years in software or infra engineering; distributed APIs.
- Customer-obsessed; product-minded surface for SDKs and CLIs.
- Reliability engineering concepts; error budgets; LLM evaluation datasets; multi-tenant design.
- Familiar with ClickHouse, Loki, VictoriaMetrics, Prometheus, Grafana.
- Experience building agentic apps/LLM features; grounding, tool calling, safety.
- Go primary language; able to collaborate with Python components.
π Benefits
- Medical, dental, and vision insurance - 100% paid by CoreWeave
- Company-paid Life Insurance
- Short and long-term disability insurance
- 401(k) with generous employer match
- Flexible PTO
- Tuition Reimbursement
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!