Added
24 minutes ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

rust python kubernetes go faiss

๐Ÿ“‹ Description

  • Design and build components of a multi-tenant inference platform integrated with Atlas
  • Collaborate with AI engineers to productionize inference for embedding models and rerankers
  • Contribute to latency-aware routing, model versioning, health monitoring, and observability
  • Improve performance, autoscaling, GPU utilization, and resource efficiency in a cloud-native env
  • Work across product, infrastructure, and ML teams to meet Atlas scale, reliability, and latency needs

๐ŸŽฏ Requirements

  • 2+ years of experience building backend or infrastructure systems at scale
  • Strong software engineering in Go, Rust, Python, or C++, focused on performance and reliability
  • Experienced in cloud-native architectures, distributed systems, and multi-tenant service design
  • Familiar with ML model serving and inference runtimes
  • Knowledge of vector search systems (Faiss, HNSW, ScaNN) is a plus
  • Comfortable working across cross-functional teams and with Atlas

๐ŸŽ Benefits

  • Be part of building the AI foundation for MongoDB Atlas
  • Collaborate with ML researchers from Voyage.ai on scalable systems
  • Tackle inference, observability, and distributed infrastructure challenges
  • Culture of mentorship, ownership, and technical excellence
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’