Related skills
aws etl sql nosql spark📋 Description
- Design scalable data architectures for AI pipelines
- Build and optimize ETL pipelines for AI data
- Ingest historical case data and truth sets for model training
- Ensure data quality, validation, cleansing, and anonymization
- Enforce data governance and regulatory compliance (HIPAA/SOC2)
- Enable analytics and deployment support for AI models
🎯 Requirements
- Advanced degree (MS/PhD) in CS, Engineering, or related field
- 5+ years data engineering experience with AI data pipelines
- Proficiency in SQL, NoSQL, Spark, AWS; ETL for AI data
- Strong data modeling, warehousing, and data integration
- Data governance, quality assurance, HIPAA and SOC2 familiarity
- Agile development and Git collaboration with teams
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Data Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!