Related skills
rust aws python kubernetes gcpπ Description
- End-to-end work on inference infrastructure for Claude.
- Address blockers to serve millions of users.
- Drive performance, scaling, and orchestration of services.
- Support multi-accelerator deployments across cloud platforms.
- Familiarity with LLM inference optimization encouraged.
π― Requirements
- Significant software engineering experience with distributed systems.
- Experience with performance optimization and large-scale orchestration.
- LLM inference optimization, batching, caching strategies.
- Kubernetes and cloud infrastructure (AWS, GCP).
- Python or Rust.
- Strong problem-solving and impact-driven mindset.
π Benefits
- Competitive compensation and benefits.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.
- Office space in London.
π Visa sponsorship
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!