Related skills
spark clickhouse trino druid apache hudi📋 Description
- Own and evolve the data platform (StarRocks, Hudi, Trino, Spark, Ranger) at scale.
- Build AI-optimised data layer for NLP queries and AI features.
- Own in-product data features: exports, dashboards, analytics, Custom Reports.
- Enable self-service pipelines for internal teams to scale data access.
- Enforce robust data security: RBAC, Ranger policies, AI output guardrails.
- Lead design reviews and set engineering standards for the data team.
🎯 Requirements
- 6+ years data engineering, with 2+ years in senior/lead roles.
- OLAP databases: StarRocks, ClickHouse, Druid.
- Data lake tech: Hudi, Iceberg, Delta Lake.
- Distributed query engines: Trino/Presto and Spark.
- Data security, RBAC, and Apache Ranger expertise.
- Hybrid AWS + open-source self-managed environment.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Data Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!