DigitalOcean

100 jobs posted

View company profile →

Please mention that you found this job on empllo.com. Thanks & good luck!

Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills
Tailors your resume and cover letter automatically
Works 24/7—so you don't have to

Activate JobCopilot

Follow us on LinkedIn!

Senior Engineer 2: Inference Optimizations

Added

3 minutes ago

Location

🌍 Europe

Type

Full time

Salary

Upgrade to Premium to se...

Related skills

cuda tensorrt rocm transformer moe

📋 Description

Lead benchmarking and perf optimizations for inference engines.
Engineer solutions for memory bandwidth and compute bottlenecks.
Implement cutting-edge optimization techniques to lead Gen AI landscape.
Improve batch size performance; tune AITER CK/ASK for FP8/BF16.
Identify kernel fusion opportunities for GLM-5 in Transformer blocks.
Tune gateway router kernels for MoE models like Qwen3-235B.

🎯 Requirements

5+ years in HPC or AI infra solving compute and memory bottlenecks.
Gen AI literacy across LLM/VLM/LMM landscapes.
Optimization expert: attention layers and distributed GPU parallelism.
Hardware fluency with NVIDIA/AMD GPUs and CUDA/ROCm.
Open source mastery; contributing to OSS projects.
Systems design: low-level GPU programming and memory patterns.

🎁 Benefits

We innovate with purpose and ship impactful AI tech.
Career development resources including conferences and courses.
Well-being support: EAP, local meetups, flexible time off.
Equal opportunity employer; inclusive, diverse culture.
Global remote-friendly culture with ownership and accountability.

Apply on employer's website

This employer gathers applications via their own applicant tracking system.

You will be redirected to an external application form.

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Activate JobCopilot