Staff Software Engineer, Inference

Added
1 month ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

rust aws python kubernetes gcp

πŸ“‹ Description

  • End-to-end work on inference infrastructure for Claude.
  • Address blockers to serve millions of users.
  • Drive performance, scaling, and orchestration of services.
  • Support multi-accelerator deployments across cloud platforms.
  • Familiarity with LLM inference optimization encouraged.

🎯 Requirements

  • Significant software engineering experience with distributed systems.
  • Experience with performance optimization and large-scale orchestration.
  • LLM inference optimization, batching, caching strategies.
  • Kubernetes and cloud infrastructure (AWS, GCP).
  • Python or Rust.
  • Strong problem-solving and impact-driven mindset.

🎁 Benefits

  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • Office space in London.

πŸ›ƒ Visa sponsorship

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’