Related skills
sql python dbt airflow elasticsearchπ Description
- Build and own data pipelines powering search (ingestion, processing, enrichment, indexing).
- Design and maintain feature pipelines for ML models and ranking.
- Own embedding pipeline infrastructure at scale.
- Extend feature store infra for training and inference features.
- Establish data quality monitoring, validation, and alerting.
- Collaborate with ML Engineering on index refresh and schema evolution.
- Partner with Data Science for clean, reliable data for experimentation.
- Make pragmatic infra decisions and reuse existing systems where possible.
π― Requirements
- 6+ years of data engineering experience, 2-3+ years supporting ML or search.
- Built and operated batch and streaming pipelines at scale on AWS, GCP, or Azure.
- Strong SQL with Spark, Airflow, dbt, or similar.
- Familiar with ML infra: feature stores, vector databases, embeddings; experience with search systems (Elasticsearch, Solr, Vespa).
- Data ownership: schema design, validation, monitoring, and debugging.
π Benefits
- Cash compensation range: $128,500 - $231,500 USD.
- Medical plans with 100% covered premiums.
- Fertility and adoption benefits.
- Headspace mindfulness app subscription.
- Global Employee Assistance Program.
- Retirement benefits with employer match.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Data Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!