Related skills
machine learning distributed systems performance metrics benchmarking hpc📋 Description
- Build and lead measurement and performance intelligence for OpenAI’s compute platforms
- Define platform performance metrics, benchmarks, and acceptance criteria
- Partner with training, inference, and infrastructure teams to evaluate workloads
- Translate performance data into actionable recommendations for architecture, capacity, and vendor engagement
- Drive continuous performance validation across hardware, software, networking, and storage
- Collaborate with external partners to evaluate performance and improve reliability
🎯 Requirements
- Experience leading performance engineering, benchmarking, systems architecture, or infra optimization at scale
- Strong technical depth in distributed systems, AI/ML infrastructure, HPC, datacenter systems
- Experience defining performance metrics and validation frameworks for complex hardware
- Ability to operate across hardware, software, and infrastructure boundaries to drive improvements
- Ability to translate technical data into clear business, operational, and strategic recommendations
- Strong communication and leadership to influence senior technical and business stakeholders
🎁 Benefits
- Hybrid work model with 3 days in the office per week
- Relocation assistance for new employees
- Reasonable accommodations for applicants with disabilities
🚚 Relocation support
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!