Related skills
linux kernel ebpf tensorrt perf flamegraphsπ Description
- Own system- and workload-level performance across CIQ products.
- Define and maintain benchmarking frameworks for OS, kernel, and apps.
- Profile CPU, memory, I/O, network, and accelerator subsystems.
- Drive AI-first performance improvements for inference and training.
- Integrate workloads into CI/CD pipelines via Fuzzball.
π― Requirements
- Deep OS internals knowledge: Linux scheduler, memory, I/O, networking.
- Proven profiling/tracing skills: perf, eBPF/bpftrace, Flamegraphs.
- Experience with AI/ML workloads: inference/training, GPU utilization.
- HPC experience: MPI, OpenMP, parallel filesystems, RDMA/InfiniBand.
- CI/CD automation: automated perf tests and regression pipelines.
- Strong analytical and communication skills; collaborative.
π Benefits
- Medical, dental, and vision insurance.
- Flexible paid time off.
- Employee stock options.
- Remote work; no travel required for most positions.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!