Staff Backend Software Engineer- (AI Platform)

Added
2 days ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

distributed systems apis scalability gpu vllm

πŸ“‹ Description

  • Design and build high-throughput, low-latency GPU inference systems.
  • Shape architecture for foundation model API across teams.
  • Design core systems and APIs powering Foundation Model Serving with scalability and reliability.
  • Drive architectural trade-offs to optimize performance and autoscaling for GPU serving.
  • Contribute to key components across serving infra including vLLM and SGLang.
  • Collaborate across product/platform/research; mentor engineers.

🎯 Requirements

  • 10+ years building and operating large-scale distributed systems.
  • Experience leading high-scale, operationally sensitive backend systems.
  • Track record of elevating teams' engineering excellence.
  • Strong foundation in algorithms, DS, and system design for low-latency serving.
  • Proven ability to deliver technically complex, high-impact initiatives.
  • Strong communication and cross-team collaboration in fast-moving environments.

🎁 Benefits

  • Comprehensive, region-specific benefits.
  • Career growth and mentorship opportunities.
  • Inclusive, diverse engineering culture.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’