Related skills
python go infiniband rdma linux networking๐ Description
- Design, build, and operate networking systems for large-scale AI training and inference
- Improve performance, reliability, and scalability across host networking and WAN
- Develop automation for provisioning, updates, and lifecycle management
- Build tooling for observability, debugging, and automated remediation
- Optimize RDMA, RoCE, InfiniBand, Ethernet, and GPU interconnects
- Define networking protocols and continuous validation criteria
๐ฏ Requirements
- Have experience building or operating large-scale networking or distributed systems
- Comfortable working close to the hardware/software boundary
- Experience with Linux networking, kernel systems, NICs, and RDMA
- Have worked with InfiniBand, RoCE, DPDK, or Ethernet fabrics
- Experience with datacenter networking, WAN systems, or host networking stacks
- Enjoy debugging complex systems and performance bottlenecks
๐ Benefits
- Equal opportunity employer
- Reasonable accommodations for applicants with disabilities
- Background checks conducted in accordance with applicable law
- OpenAI privacy and compliance policies observed
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!