CoreWeave

11-50 employees
69 jobs posted

View company profile →

Please mention that you found this job on empllo.com. Thanks & good luck!

Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills
Tailors your resume and cover letter automatically
Works 24/7—so you don't have to

Activate JobCopilot

Follow us on LinkedIn!

Staff Software Engineer, Inference

Added

20 days ago

Location

Type

Full time

Salary

Upgrade to Premium to se...

Related skills

python kubernetes go cuda triton

📋 Description

Lead architecture, performance, and reliability across services.
Drive cross-team design initiatives for inference workloads.
Optimize latency, throughput, and GPU utilization in production.
Tackle scheduling, batching, and memory optimization in Kubernetes infra.
Provide hands-on technical leadership shaping engineering direction.
Work on distributed systems and Kubernetes-based infrastructure.

🎯 Requirements

8–12+ years building large-scale distributed systems or cloud platforms.
Proven cross-team leadership across multiple services.
Strong Go, Python, or C++ programming skills.
Kubernetes production-scale expertise.
Experience with inference frameworks: vLLM, Triton, TorchServe.
GPU systems experience: CUDA, NCCL, RDMA, NUMA.

🎁 Benefits

Medical, dental, and vision insurance – 100% paid by CoreWeave.
401(k) with generous employer match.
Flexible PTO.
Tuition Reimbursement.
Employee Stock Purchase Program (ESPP).
Parental leave and family-forming support.

Apply on employer's website

This employer gathers applications via their own applicant tracking system.

You will be redirected to an external application form.

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Activate JobCopilot