Related skills
java postgresql sql python spring bootπ Description
- Own data platform end-to-end: ingestion, transform, storage, query, access.
- Build batch/streaming pipelines (PySpark/EMR, Kinesis) into an Iceberg lake.
- Model the warehouse: design SCD and event tables and data conventions.
- Collaborate across teams to turn data requests into reliable pipelines and docs.
- Own security/compliance backbone: audit logs, access control, temp access workflows.
- Optimize data queries at the data/product boundary; routing, indexing, caching in Java/Spring.
π― Requirements
- 5+ years building production data pipelines and platforms
- Deep Python (PySpark) and SQL fluency; tune Spark jobs at scale
- Experience with Java, Spring Boot around data layer services
- Experience with Spark/EMR, Kinesis, and S3
- Postgres fundamentals: query optimization, indexing, replication, bottleneck sense
- Experience with Iceberg and Trino (or similar)
π Benefits
- Collaborate across teams to turn data requests into reliable pipelines and docs.
- Accelerated career growth in a fast-growing company
- Coaching from experienced engineering leaders
- Direct access to executives and a transparent culture
- Opportunity to impact healthcare for millions
- Medical/Dental/Vision coverage
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Data Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!