Related skills
s3 python kubernetes airflow spark📋 Description
- Build and maintain petabyte-scale storage infrastructure
- Tackle networking and performance challenges at scale
- Collaborate daily with researchers and engineers
- Build data layer for training and evaluation workloads
🎯 Requirements
- 4+ years in data storage infrastructure
- Strong Python programming
- Kubernetes, storage-focused (PV, CSI)
- Transform unstructured data to datasets across S3, GCS, POSIX
- Experience with Beam, Spark, or Flink
- Nice-to-have: BigQuery, Airflow, or dbt
🎁 Benefits
- Open and inclusive culture
- Work with cutting-edge AI research
- Weekly lunch stipend and snacks
- Health and dental benefits including mental health budget
- Parental leave top-up up to 6 months
- Remote-flexible with offices in major cities
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Data Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!