Related skills
java terraform aws python kubernetesπ Description
- Design, build, and operate scalable ML infra on AWS
- Develop distributed training and batch processing with Ray
- Build and maintain infrastructure-as-code with Terraform
- Support and evolve the feature store and feature pipelines
- Develop data ingestion and streaming systems (Kinesis, Kafka, Spark)
- Improve CI/CD workflows for ML models and platform components
π― Requirements
- 5+ years in ML infrastructure, platform engineering, or production ML systems
- ML lifecycle knowledge: data preprocessing, training, evaluation, deployment
- Distributed systems and cloud computing or large-scale data processing
- Docker and Kubernetes with orchestration
- AWS, Spark, and Ray experience
- Strong Python/Go/Scala/Java programming skills
π Benefits
- Four days in-office; Fridays from home for those near offices
- Backup child/elder/pet care plus commuter benefit
- Competitive salary based on experience
- 401k match plus medical/dental/vision/life/disability
- Generous vacation and company-wide days off
- Parental leave: up to 24 weeks birthing and 12 weeks non-birthing
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!