Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills

Tailors your resume and cover letter automatically

Works 24/7—so you don't have to

Design and implement core systems and APIs powering Model Serving.
Drive architecture decisions to optimize CPU/GPU performance and autoscaling.
Contribute to components across serving infra: containers, routing, observability.
Collaborate with product, platform, and research teams to build reliable systems.
Lead technical initiatives to improve latency, availability, and cost.
Establish best practices for code quality, testing, and readiness; mentor engineers.

5+ years building and operating large-scale distributed systems.
Experience in model serving, inference systems, or related infra (routing, autoscaling).
Strong foundation in algorithms, data structures, and system design for low-latency serving.
Proven ability to deliver technically complex, high-impact initiatives with measurable value.
Experience architecting large-scale CPU/GPU inference systems.
Strong communication and collaboration across teams in fast-moving environments.

Staff Backend Software Engineer- (AI Platform)

Meet JobCopilot: Your Personal AI Job Hunter