Related skills
kubernetes distributed systems observability routing llm๐ Description
- Develop infrastructure and orchestration for large-scale LLM inference
- Work across the stack from customer features to low-level infrastructure
- Build platform capabilities for routing, autoscaling, scheduling, observability
- Improve reliability, scalability, and usability of our inference stack
- Collaborate with Model Performance engineers to enable inference optimizations for customers
- Define best practices for testing, release automation, benchmarking
๐ฏ Requirements
- Bachelor's, Master's, or Ph.D. in Computer Science, Engineering, or related field
- Strong background in distributed systems, backend infrastructure, or platform engineering
- Experience building and operating production systems where reliability, latency, and scale are first-class concerns
- Strong sense of developer experience: you think about how systems are used, not just how they work
- Motivated and willing to learn new languages, frameworks, and systems as needed
- Ability to debug complex systems across multiple layers of the stack
๐ Benefits
- Competitive compensation with meaningful equity
- 100% medical, dental, and vision coverage for dependents
- Flexible PTO including company Winter Break
- Paid parental leave
- Fertility and family-building stipend through Carrot
- Company-facilitated 401(k)
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!