Related skills
aws python kafka spark pysparkπ Description
- Lead Spark/PySpark pipelines for entity resolution and healthcare data processing.
- Own automatching, identity mapping, deduplication, and enrichment workflows.
- Build scalable processing frameworks for PubMed, clinical trials, ct.gov, and other data sources.
- Drive infrastructure optimization to improve throughput, runtime, observability, and cost efficiency.
- Partner with AI/ML teams to integrate matching models into EMERALD and improve precision and recall.
- Lead complex technical initiatives from architecture through deployment; mentor engineers and promote best practices.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!