Related skills
docker python kubernetes ci/cd observability📋 Description
- Design, build, and operate core AI platform components to train/deploy ML models.
- Own model serving and inference workflows end-to-end; improve reliability and performance.
- Lead efforts to optimize inference for throughput, latency, and cost on CPU/GPU.
- Design GPU-based inference and training workloads; tune performance and capacity.
- Own model lifecycle: packaging, versioning, testing, and deployment automation.
- Implement observability practices (metrics, logging, tracing, alerts) for ML services.
🎯 Requirements
- Bachelor’s with 4–6 years in ML or Master’s with hands-on production ML experience.
- Strong Python experience for ML systems, backends, or distributed data processing.
- Proven experience deploying and operating ML workloads in cloud environments.
- Solid understanding of model serving architectures, inference pipelines, and tradeoffs.
- Hands-on experience with GPU-based workloads in production settings.
- Experience designing CI/CD pipelines and workflows for reliable ML deployment.
🎁 Benefits
- Generous performance-based bonus plans.
- Rich medical, dental, and vision coverage.
- Generous retirement contributions with 100% immediate vesting.
- Quarterly all-company wellness days.
- Country-specific holidays plus a day off for your birthday.
- One-time home office stipend.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!