Lead Spark/PySpark pipelines for entity resolution and healthcare data processing.
Own automatching, identity mapping, deduplication, and enrichment workflows.
Build scalable processing frameworks for PubMed, clinical trials, ct.gov, and other data sources.
Drive infrastructure optimization to improve throughput, runtime, observability, and cost efficiency.
Partner with AI/ML teams to integrate matching models into EMERALD and improve precision and recall.
Lead complex technical initiatives from architecture through deployment; mentor engineers and promote best practices.

Apply on employer's website

This employer gathers applications via their own applicant tracking system.

You will be redirected to an external application form.

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Activate JobCopilot