Related skills
docker terraform linux bash pythonπ Description
- Be scrappy to find new audio data sources and add them to our ingestion pipeline
- Operate and extend the cloud infrastructure for our ingestion pipeline (GCP) and Terraform
- Build high-quality datasets for model training at petabyte scale
- Collaborate with Scientists to shift cost, throughput, and quality for bigger-scale data
- Collaborate with the AI team and leadership to craft the dataset roadmap for next-generation products
- Liaise across disciplines to align data roadmap with model needs
π― Requirements
- BS/MS/PhD in Computer Science or a related field
- 5+ years of industry experience in software development
- Proficiency with bash/Python scripting in Linux
- Proficiency with Docker and infrastructure-as-code; experience with GCP
- Experience with web crawlers and large-scale data processing workflows is a plus
- Ability to handle multiple tasks and adapt to changing priorities
- Strong communication skills, both written and verbal
π Benefits
- Fast-growing environment where you can shape the company and product
- Entrepreneurial-minded team that supports risk, intuition, and hustle
- Hands-off management approach so you can focus and do your best work
- Opportunity to make a big impact in a transformative industry
- Competitive salaries, friendly and laid-back atmosphere, and asynchronous culture
- Work on a life-changing product millions use
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!