Related skills
rust c tensorrt quantization pruningπ Description
- Lead edge deployment strategy for STT/TTS on-device, on-prem, air-gapped across hardware targets.
- Optimize models for edge: quantization, pruning, distillation, and runtime tuning for latency, memory, and power.
- Partner with Qualcomm, Motorola, and vendors for efficient chipsets, SDK integration, benchmarking, and joint go-to-market.
- Support defense customers via AWS NatSec partnerships; translate mission needs into engineering deliverables.
- Design and build edge runtime infrastructure: packaging, deployment pipelines, OTA updates, telemetry for low-connectivity devices.
- Harden deployments for security-sensitive environments: secure boot, encrypted storage, tamper detection, audit logging.
π― Requirements
- 5+ years of experience in systems engineering, embedded computing, or edge AI deployment with production hardware.
- Strong proficiency in C, C++, and/or Rust; writing performance-critical code for constrained environments.
- Hands-on experience with edge model optimization: quantization, pruning, distillation, or architecture-specific compilation.
- Familiarity with ONNX Runtime, TensorRT, TFLite, or vendor SDKs (Qualcomm SNPE/QNN, MediaTek NeuroPilot).
- Experience with security-conscious development practices: secure boot, encrypted storage, code signing, and secure deployment pipelines.
- Strong understanding of hardware-software interaction: CPU/GPU/NPU architectures, memory hierarchies, power management, and impact on inference performance.
π Benefits
- Medical, dental, vision benefits
- Annual wellness stipend
- Mental health support
- Unlimited PTO
- Generous paid parental leave
- 401(k) plan with company match
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!