Related skills
sql python airflow kafka sparkπ Description
- Building scalable, fault-tolerant batch/streaming data pipelines
- Designing robust data solutions for self-service models
- Developing pipelines with data quality and resilience
- Defining mappings, transformations and data quality standards
- Debugging production clusters and improving performance
- Influencing architecture discussions and product roadmap
π― Requirements
- Strong SQL skills
- Proficiency in Python
- OO language experience (Java/Scala)
- Experience with HDFS/YARN/MapReduce/Hive/Kafka/Spark/Airflow/Presto
- AWS or GCP experience; Looker advantageous
- BS in Computer Science (MS preferred)
π Benefits
- Comprehensive health, dental and vision coverage
- Mental health and financial wellness resources
- Retirement options and generous leave
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!