Software Engineer, Data Infrastructure & Acquisition - Busan, South Korea
Speechify is seeking a data-focused software engineer to design, build, and operate the data pipelines and infrastructure that power our products. This on-site role is based in Busan, South Korea.
Overview
As part of the Data Infrastructure & Acquisition team, you will design and maintain data ingestion pipelines, data lakes/warehouses, and supporting tooling to ensure scalable, high-quality data for experiments and product features.
Responsibilities
- Design, implement, and maintain data ingestion pipelines to acquire and process data from multiple sources.
- Build and maintain data infrastructure (data lake/warehouse, metadata catalog, streaming pipelines) to support analytics and product features.
- Ensure data quality, provenance, governance, and security across data platforms.
- Optimize data models and storage for performance and cost efficiency.
- Collaborate with data science, analytics, and platform teams to meet data needs.
- Monitor reliability, performance, and cost; document architectures and write tests.
Requirements
- 3+ years of experience in data engineering or data infrastructure.
- Strong Python and SQL skills.
- Experience with data warehousing and ETL/ELT processes (e.g., BigQuery, Snowflake, Redshift).
- Experience with cloud providers (AWS, GCP, or Azure) and data orchestration (Airflow, Dagster, etc.).
- Familiarity with streaming data and batch processing; knowledge of data governance, privacy, and security best practices.
- Excellent communication and collaboration skills; ability to work in a fast-paced startup environment.
Nice-to-have
- Experience with data privacy regulations and governance.
- Familiarity with ML/AI data pipelines or NLP data.
Location
Busan, South Korea (On-site)
What Speechify offers
- Competitive compensation
- Health benefits
- Flexible, collaborative work culture
- Opportunity to work on cutting-edge speech AI products