Added
less than a minute ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
python gcp airflow pyspark statistical modelingπ Description
- Design and implement scalable batch and real-time data processing systems across large datasets.
- Build ETL and streaming pipelines using modern GCP big data technologies.
- Lead decisions on data architecture, modeling, pipeline orchestration, and production systems.
- Develop statistical models and analytics capabilities for product intelligence.
- Design production-grade data workflows with Airflow, Dataflow, Pub/Sub, PySpark.
- Collaborate with Engineering and Product to deliver data-driven features and insights.
π― Requirements
- 7β10+ years as Data Scientist/ML Engineer with ownership of production systems.
- Strong experience building and operating large-scale data pipelines.
- GCP data services: Dataproc, Dataflow, Pub/Sub.
- Strong proficiency in Python and PySpark.
- Experience with real-time/streaming systems and orchestration (Airflow).
- Statistical modeling and applied data science techniques.
- Ability to collaborate across product, engineering, and analytics to deliver data-driven features.
π Benefits
- Stock Options and equity participation.
- Generous paid time off, holidays, and parental leave; health insurance options.
- 401(k) and Roth retirement accounts; wellness program and other benefits.
- Employee assistance programs and life insurance options.
- Flexible work environment and supportive culture.
- Competitive compensation package and opportunities for growth.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Data Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!