Added
18 days ago
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
java python kubernetes distributed systems goπ Description
- Build and improve systems that orchestrate ML jobs across the company.
- Design reliable software handling failures, retries, and high load.
- Build and maintain backend services and APIs for job submission.
- Investigate production issues, identify root causes, and implement fixes.
- Review designs and code to ensure simplicity, reliability, and scalability.
- Collaborate with infrastructure, ML, and data science teams to optimize compute.
π― Requirements
- Bachelor's in CS/CE/EE/Robotics; 4+ years experience (MS 2+; PhD optional)
- Experience designing high-scale distributed systems
- Experience building/operating large-scale job scheduling platforms
- Backend languages: Kotlin, Java, Go, Python; strong frameworks
- Distributed systems design for reliability, coordination, asynchronous processing
- 6+ years coding/design in related field (80% of time)
π Benefits
- Health insurance (medical, dental, vision)
- Health Savings Account with employer match
- Employer-matched 401(k) retirement plan
- Paid parental leave and paid medical leave
- Unlimited vacation and 15 paid holidays
- Daily lunches and snacks in all office locations
π Visa sponsorship
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!