Related skills
airflow kafka apache spark delta lake trino๐ Description
- Lead the development and operation of a data lake for cyber data.
- Design schemas, partitions, and indexes for fast, cost-efficient queries.
- Partner with engineers and analysts to define query patterns and data products.
- Build and evolve ETL pipelines that are observable, recoverable, and resilient.
- Drive technical initiatives end-to-end from architecture to production.
- Establish data quality, documentation, and operational ownership.
๐ฏ Requirements
- 8+ years of experience in data engineering and/or data architecture.
- Mastery-level expertise building ETL pipelines and operating them in production.
- Deep experience with data lake architecture and systems used to query data lakes.
- Strong schema and index design: partitioning, indexing, and clustering.
- Experience with column-oriented databases in production environments.
- Proven leadership experience mentoring engineers and driving technical initiatives.
๐ Benefits
- Health: Medical, dental, and vision plans; life/AD&D; disability.
- Family: Paid parental leave for eligible full-time employees.
- Vacation: Paid holidays and flexible PTO. Take what you need.
- Retirement: 401(k) with pre-tax and Roth options; HSA/FSA options.
- Office: On-site facilities including parking, bike storage, fitness center, desk stipend.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Data Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!