Saviynt

501-1,000 employees
42 jobs posted

View company profile →

Please mention that you found this job on empllo.com. Thanks & good luck!

Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills
Tailors your resume and cover letter automatically
Works 24/7—so you don't have to

Activate JobCopilot

Follow us on LinkedIn!

AI Platform Engineer – Training & Inference

Added

5 days ago

Location

🇺🇸 San Francisco

Type

Full time

Salary

Salary not provided

Related skills

pytorch tensorrt ray gke nvidia triton

📋 Description

Own Ray ecosystem end-to-end on GKE
Operate Ray Train on multi-node H100 clusters
Build LLM inference mesh with Ray Serve
Optimize inference: fractional GPUs, batching, autoscaling
Design model routing layer for multi-tenant LLMs
Build RL training infra with Flyte and RLlib

🎯 Requirements

Experience in ML engineering with ML platform or MLOps
Production Ray depth: Train, Serve, Core, Data
LLM serving engines: vLLM, SGLang, NVIDIA Triton
Distributed training: DDP, FSDP, NCCL, mixed precision BF16/FP8
RL knowledge: PPO, policy gradient, RLHF
Model lifecycle ops: MLflow registry, shadow/A/B/canary, auto rollback
Vector databases: Pgvector or Qdrant
Python and PyTorch; Flyte or equivalent ML orchestrator

🎁 Benefits

Competitive total rewards package
Opportunities for growth and advancement

Apply on employer's website

This employer gathers applications via their own applicant tracking system.

You will be redirected to an external application form.

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Activate JobCopilot