Hardware, Research Engineer

Added
1 minute ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

rust python pytorch jax cuda

πŸ“‹ Description

  • Build roofline simulator to track workloads and analyze architecture impact.
  • Debug gaps between performance simulation and real measurements; identify root causes.
  • Write emulation kernels for low-precision numerics and lossy compression.
  • Prototype numerics modules by pushing RTL through synthesis; own RTL end-to-end.
  • Proactively pull in new ML workloads; evaluate opportunities or risks.
  • Understand ML science to hardware optimization; deliver near-term deliverables.

🎯 Requirements

  • Strong Python, and C++ or Rust; clean extensibility.
  • Experience writing Triton, CUDA, or similar; mapping tensor ops to functional units.
  • Working knowledge of PyTorch or JAX; large ML codebases a plus.
  • Practical understanding of floating point numerics; ML quantization tradeoffs.
  • Deep understanding transformer models; rooflines and sharded training/inference.
  • RTL writing for floating point logic; PPA tradeoffs a plus.

🎁 Benefits

  • $230K–$460K USD plus equity.
  • Relocation assistance available.
  • Hybrid work: 3 days per week onsite in San Francisco.

🚚 Relocation support

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’