Related skills
rust kubernetes go ray gpus📋 Description
- Design infrastructures for large-scale experiments, data processing, and model training.
- Enable rapid experiments by building abstractions for job submission, scheduling, and monitoring.
- Build tooling to boost researcher productivity (experiments, CI, workflows).
- Impact the long-term roadmap for research compute and model workflows.
- Mentor other engineers in compute, infra, and AI systems.
🎯 Requirements
- BS/MS or PhD in Computer Science or related field.
- 5+ years in software engineering, with large-scale distributed systems or infrastructure.
- Deep experience building/operating distributed systems and data pipelines (GPUs, clusters, cloud).
- Proficient in one or more systems programming languages (C++, Rust, Go, Java, Scala).
- Built or contributed to cluster schedulers or job orchestration systems (Kubernetes, Slurm, Ray).
- Understand ML training/inference workflows (distributed training, model parallelism).
🎁 Benefits
- Comprehensive benefits and perks for all employees.
- Regional benefits information available for your location.
- Career development and learning opportunities.
- Inclusive, diverse team culture.
- Flexible work options and modern office amenities.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!