Senior Software Engineer, Inference Platform

Added
15 days ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

rust python kubernetes go faiss

📋 Description

  • Design multi-tenant inference platform components with Atlas
  • Collaborate to productionize embedding models and rerankers
  • Add latency-aware routing, versioning, health, and observability
  • Improve autoscaling, GPU utilization, and cloud-native efficiency
  • Coordinate with product/infra/ML teams for Atlas scale
  • Hands-on with vLLM and Kubernetes for orchestration

🎯 Requirements

  • 5+ years building backend or infra systems at scale
  • Go, Rust, Python, or C++, with performance focus
  • Cloud-native architectures, distributed systems, multi-tenant design
  • Familiar with ML model serving and inference runtimes
  • Vector search knowledge (Faiss, HNSW) is a plus
  • Comfortable collaborating across product/infra/ML teams

🎁 Benefits

  • Base salary range: $118,000–$231,000 USD
  • Equity and employee stock purchase program
  • Generous parental leave (20 weeks)
  • Fertility and adoption assistance
  • 401(k) plan
  • Mental health resources
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs →