Added
10 days ago
Type
Full time
Salary
Salary not provided
Related skills
aws apache airflow pyspark amazon quicksight aws glue๐ Description
- Build scalable ETL/ELT pipelines with PySpark on distributed frameworks
- Orchestrate workflows using Apache Airflow (DAG design, scheduling)
- Develop data ingestion and transformation jobs with AWS Glue
- Manage AWS Lake Formation for secure data access and governance
- Maintain Glue Data Catalog for metadata, schema, and table management
- Build dashboards using Amazon QuickSight to visualize data
๐ฏ Requirements
- Strong hands-on experience with PySpark for large-scale data processing
- Deep knowledge of Airflow DAGs, scheduling, and CI/CD integration
- Expertise in AWS data services: Glue, Lake Formation, Data Catalog
- Experience building dashboards with Amazon QuickSight
- Familiarity with data modeling, partitioning, and optimization
- Ability to design secure, scalable data pipelines with governance
๐ Benefits
- Experience with S3, Athena, Redshift, or EMR
- Knowledge of Python-based automation and testing
- Exposure to cloud-native DevOps (IaC, Terraform/CloudFormation)
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!