Added
less than a minute ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
aws sql python scala spark📋 Description
- Build graph-based algorithms and data pipelines for identity graph and KYC.
- Analyze large datasets for entity resolution and anomaly detection.
- Develop ETL pipelines using Spark/PySpark and AWS (EMR, S3).
- Support feature engineering and A/B testing with senior data scientists.
- Evaluate new data sources and their impact on coverage and model performance.
- Collaborate with Product, Engineering, and Compliance teams.
🎯 Requirements
- Master’s degree with 2+ years or PhD with 1+ years in data science/analytics.
- Python or Scala programming.
- Strong SQL for large datasets; data lake/warehouse experience.
- Hands-on Spark or PySpark and ML libraries (scikit-learn, XGBoost, etc.).
- UNIX environments and AWS (EMR, S3); graph tech like Neo4j, GraphFrames a plus.
- Bonus: Elasticsearch or DynamoDB; Airflow for data pipelines.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Data Jobs. Just set your
preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!