Added
1 minute ago
Type
Full time
Salary
Salary not provided

Related skills

ansible terraform linux kubernetes hpc

๐Ÿ“‹ Description

  • Design, operate, and scale Mistral's AI infra across HPC and cloud
  • Manage large-scale Linux environments (bare metal, clusters, cloud)
  • Monitor system health, troubleshoot incidents, high availability
  • Scale clusters toward hundreds to thousands of nodes
  • Automate ops with Python, Bash, Ansible, Terraform
  • Collaborate with infrastructure, HPC and research teams

๐ŸŽฏ Requirements

  • Strong Linux systems administration experience (core requirement)
  • Experience in large-scale HPC or cloud infrastructure
  • Experience with schedulers (Slurm)
  • Solid troubleshooting across systems, hardware, and networks
  • Containers/orchestration such as Kubernetes
  • Storage systems: Ceph, Lustre, NFS

๐ŸŽ Benefits

  • Impact: scale Mistral's AI infrastructure
  • Growth: shape data centre ops in a high-growth startup
  • Collaboration: cross-functional AI team
  • Flexibility: competitive compensation and benefits
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’