Related skills
aws python kubernetes pytorch distributed systems๐ Description
- Design tooling for researchers to deploy and evaluate models
- Build and maintain high-performance, cost-efficient inference pipelines
- Identify bottlenecks and scope improvements for speed and reliability
- Develop and maintain user-facing APIs for ML systems
- Implement observability to monitor model performance and health
- Troubleshoot production issues across distributed systems
๐ฏ Requirements
- Strong backend engineering experience with Python
- Experience building and operating distributed, containerized apps, preferably on AWS
- Proficiency implementing observability (monitoring, logging, alerting, tracing)
- Ability to design and implement resilient, scalable architectures
- Track record delivering complex projects from problem to production
- Comfort navigating ambiguity and making pragmatic technical decisions
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!