Related skills
aws postgresql sql python dbt📋 Description
- Build and own production data infrastructure and ETL pipelines.
- Ingest data from REST APIs, XML, and files into clean analytical layers.
- Model and curate data assets; enforce quality across sources.
- Support AI-native workflows with vector tooling for GenAI/LLM.
- Lead code/design reviews; establish testing and observability practices.
🎯 Requirements
- 5+ years building and operating production data pipelines end-to-end.
- Python and SQL fluency; production code; DAGs with Airflow or Prefect.
- Cloud data stack on AWS or GCP; Postgres; Spark or Trino; IAM basics.
- Vector databases: Pinecone or Weaviate; embedding retrieval.
- Unix, Docker, and CI/CD (GitHub Actions).
- Quick, action‑oriented mindset; strong communication.
🎁 Benefits
- Equity and comprehensive healthcare.
- Generous PTO.
- Hybrid/remote options aligned to SF or Boston hubs.
- Learn from top thinkers across disciplines.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!