Related skills
redshift aws python gcp apache sparkπ Description
- Architect the 'Data DNA' to power AI-driven insights for biopharma.
- Lead global AI initiatives in high-stakes, high-impact environments.
- Design PySpark pipelines and integrate ML models into Lakehouse data.
- Identify, implement, and maintain end-to-end HCO data pipelines.
- Refine data structures and processing logic for changing market needs.
- Apply solid principles and clean patterns to data tasks.
π― Requirements
- Experience with Python and Apache Spark/PySpark on large datasets.
- Cloud-native software development in AWS or GCP.
- Architectures: Data Lakes, Lakehouses, Warehouses (DeltaLake, Redshift).
- Experience operating LLMs in production with third-party models and multi-model orchestration.
- Drive technical execution in Agile teams with strong English communication.
π Benefits
- Comprehensive benefits package.
- Fitness reimbursement.
- Veeva Work Anywhere.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!