Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills
Tailors your resume and cover letter automatically
Works 24/7—so you don't have to

Activate JobCopilot

Staff Software Engineer, Ads ML Inference Infrastructure

Hybrid

Engineering

Added

2 hours ago

Location

Type

Full time

Salary

Upgrade to Premium to se...

Apply Now

Save job

Related skills

tensorflow pytorch cuda inference triton

📋 Description

Lead next-gen model inference and feature serving for 100x larger models.
Design low-latency, high-throughput inference pipelines to meet SLOs.
Collaborate to productionize new model architectures (LLMs, ranking) and scale globally.
Evolve online feature platform for coverage, freshness, consistency.
Evaluate GPU acceleration, model compression, Triton, vLLM, Dynamo.
Partner with infra/ML teams to boost reliability and velocity.

🎯 Requirements

BS degree in Computer Science or related field.
~8+ years designing/operating large-scale ML or distributed infra.
Deep knowledge of Java, C++, Python.
Distributed systems or ads infra (routing, storage, caching).
Hands-on with PyTorch or TensorFlow.
Proven track record leading complex projects and mentoring.

🎁 Benefits

Hybrid work model; in-person 1-2 days per week near Palo Alto/SF/Seattle.
PinFlex flexible working options and information page.

Apply on employer's website

This employer gathers applications via their own applicant tracking system.

You will be redirected to an external application form.

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Activate JobCopilot