Software Engineer, ML Systems & Training Architecture

Added
less than a minute ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

machine learning distributed systems training infrastructure code review

๐Ÿ“‹ Description

  • Review and improve code across training frameworks.
  • Identify risky changes before landing; raise code quality.
  • Debug ML training systems, GPUs, and infrastructure.
  • Help unblock training jobs and workflows.
  • Improve reliability and usability of the training framework.
  • Move quickly on engineering problems affecting velocity.

๐ŸŽฏ Requirements

  • Strong software engineering fundamentals and code review judgment.
  • Experience with ML systems, training frameworks, GPUs, or distributed infra.
  • Read and debug unfamiliar code quickly; root cause analysis.
  • Ship high-quality code with velocity and pragmatic judgment.
  • Low-ego, responsive, and helpful to researchers.
  • Experience reviewing messy or AI-generated codebases.

๐ŸŽ Benefits

  • Relocation assistance to new employees.

๐Ÿšš Relocation support

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’