Related skills
reliability engineering root cause analysis cooling systems generators upsπ Description
- Lead day-to-day ops of mission-critical facility infrastructure across AI compute campuses.
- Own operational readiness for new campus deployments and expansions.
- Partner with commissioning teams to transition facilities from construction to steady-state.
- Develop and implement operating procedures, maintenance programs, and response plans.
- Lead incident response and coordinate recovery during critical events.
- Drive root cause analysis and corrective actions to improve reliability.
π― Requirements
- 8+ years operating mission-critical facilities, data centers, or large-scale ops.
- Strong knowledge of electrical distribution, generators, UPS, cooling, and controls.
- Experience supporting commissioning, operational readiness, or infrastructure turnover.
- Led facility ops teams, contractors, and third-party vendors.
- Comfortable responding to incidents and making decisions under pressure.
- Experience developing maintenance strategies, SOPs, and reliability programs.
π Benefits
- Equal opportunity employer; inclusive hiring practices.
- OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement.
- Reasonable accommodations for applicants with disabilities.
- OpenAI Global Applicant Privacy Policy.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!