Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills

Tailors your resume and cover letter automatically

Works 24/7—so you don't have to

Technical Leadership: Lead design and delivery of data plane components for large generative AI models.
System Design: Architect high-scale, multi-tenant AI inference cloud components.
Performance Optimization: Optimize distributed inference with tensor/data parallelism and caching.
Collaboration: Align roadmaps with PMs, customer teams, and engineers.
Mentorship: Mentor junior engineers and foster technical excellence.
Operational Excellence: Maintain high-scale services with observability and SLOs.

Distributed Systems Expertise: Distributed systems with microservices, messaging, databases, and infra as code.
AI/ML Domain Knowledge: Hosting large language or multimodal models with inference engines (vLLM, SGLang, Modular).
Inference Frameworks: Distributed inference serving frameworks (llm-d, NVIDIA Dynamo, Ray Serve).
Hardware & Interconnects: GPU optimization and interconnects (NVlink, XGMI, RoCE).
Architecture Proficiency: LLM architectures and optimizations (batching, quantization).
Software Engineering: GoLang or Python expert; experience with gRPC.
Cloud Operations: Shaping customer-facing software in high-scale environments.
Open Source Mindset: Building with open-source software.

Senior Engineer 2: Inference Data Plane

Meet JobCopilot: Your Personal AI Job Hunter