Related skills
redshift aws python gcp pysparkπ Description
- Design PySpark pipelines and collaborate on ML models for lakehouse
- Architect data DNA and enable AI-driven insights for biopharma
- Build and maintain end-to-end HCO data pipelines
- Govern and optimize data infrastructure; monitor performance
- Advance the long-term architectural roadmap; ensure data quality
- Adopt AI tooling to streamline development and solve data problems
π― Requirements
- Experience with Python and PySpark for large datasets
- Cloud-native software on AWS or GCP
- Design/maintain Data Lakes, Lakehouses, Warehouses (DeltaLake, Redshift)
- Operate LLM systems in production; multi-model orchestration
- Agile execution with strong English communication
π Benefits
- Comprehensive benefits package
- Fitness reimbursement
- Veeva Work Anywhere
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Data Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!