Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills

Tailors your resume and cover letter automatically

Works 24/7—so you don't have to

Design, build, and maintain AI inference infra with high throughput and low latency.
Own end-to-end deployment pipelines for real-time vision and LLMs.
Architect and scale GPU-enabled Kubernetes clusters with autoscaling.
Build WebRTC-based infra for real-time AI agents (STT/TTS) at low latency.
Drive inference scaling with speculative decoding, batching, and model parallelism.
Implement Terraform and GitOps for GPU AI environments.

5+ years infra engineering with 2+ years AI/ML in production.
Strong Kubernetes for GPU workloads: scheduling, autoscaling.
Hands-on with model serving and inference optimization for CV and LLM.
Inference optimization: speculative decoding, batching, quantization, scaling.
Experience provisioning infra for real-time AI systems including WebRTC clusters.
Familiarity with real-time video CV inference pipelines and low-latency STT/TTS.
IaC (Terraform or similar) and GitOps for GPU environments.
Fluent in English.

Senior AI Infrastructure Engineer (Europe based - Remote)

Meet JobCopilot: Your Personal AI Job Hunter