Staff Software Engineer - AI Research Infrastructure

Added
14 hours ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

rust kubernetes go ray gpus

📋 Description

  • Design infrastructures for large-scale experiments, data processing, and model training.
  • Enable rapid experiments by building abstractions for job submission, scheduling, and monitoring.
  • Build tooling to boost researcher productivity (experiments, CI, workflows).
  • Impact the long-term roadmap for research compute and model workflows.
  • Mentor other engineers in compute, infra, and AI systems.

🎯 Requirements

  • BS/MS or PhD in Computer Science or related field.
  • 5+ years in software engineering, with large-scale distributed systems or infrastructure.
  • Deep experience building/operating distributed systems and data pipelines (GPUs, clusters, cloud).
  • Proficient in one or more systems programming languages (C++, Rust, Go, Java, Scala).
  • Built or contributed to cluster schedulers or job orchestration systems (Kubernetes, Slurm, Ray).
  • Understand ML training/inference workflows (distributed training, model parallelism).

🎁 Benefits

  • Comprehensive benefits and perks for all employees.
  • Regional benefits information available for your location.
  • Career development and learning opportunities.
  • Inclusive, diverse team culture.
  • Flexible work options and modern office amenities.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs →