Related skills
rust python kubernetes go faiss📋 Description
- Design multi-tenant inference platform components with Atlas
- Collaborate to productionize embedding models and rerankers
- Add latency-aware routing, versioning, health, and observability
- Improve autoscaling, GPU utilization, and cloud-native efficiency
- Coordinate with product/infra/ML teams for Atlas scale
- Hands-on with vLLM and Kubernetes for orchestration
🎯 Requirements
- 5+ years building backend or infra systems at scale
- Go, Rust, Python, or C++, with performance focus
- Cloud-native architectures, distributed systems, multi-tenant design
- Familiar with ML model serving and inference runtimes
- Vector search knowledge (Faiss, HNSW) is a plus
- Comfortable collaborating across product/infra/ML teams
🎁 Benefits
- Base salary range: $118,000–$231,000 USD
- Equity and employee stock purchase program
- Generous parental leave (20 weeks)
- Fertility and adoption assistance
- 401(k) plan
- Mental health resources
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!