Related skills
rust python kubernetes go faiss๐ Description
- Design and build components of a multi-tenant inference platform integrated with Atlas
- Collaborate with AI engineers to productionize inference for embedding models and rerankers
- Contribute to latency-aware routing, model versioning, health monitoring, and observability
- Improve performance, autoscaling, GPU utilization, and resource efficiency in a cloud-native env
- Work across product, infrastructure, and ML teams to meet Atlas scale, reliability, and latency needs
๐ฏ Requirements
- 2+ years of experience building backend or infrastructure systems at scale
- Strong software engineering in Go, Rust, Python, or C++, focused on performance and reliability
- Experienced in cloud-native architectures, distributed systems, and multi-tenant service design
- Familiar with ML model serving and inference runtimes
- Knowledge of vector search systems (Faiss, HNSW, ScaNN) is a plus
- Comfortable working across cross-functional teams and with Atlas
๐ Benefits
- Be part of building the AI foundation for MongoDB Atlas
- Collaborate with ML researchers from Voyage.ai on scalable systems
- Tackle inference, observability, and distributed infrastructure challenges
- Culture of mentorship, ownership, and technical excellence
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!