Software Engineer, Data Infrastructure Acquisition
Speechify is seeking a Software Engineer to help build and scale our data infrastructure in Campinas, Brazil. You will design and implement data ingestion pipelines, manage data acquisition from internal and external sources, and enable data-driven decisions across products.
What youll do
- Design, implement, and maintain scalable data ingestion pipelines for streaming and batch data.
- Build and optimize data infrastructure components (data lake, data warehouse, metadata catalogs).
- Partner with data scientists, product teams, and backend engineers to understand data requirements and deliver high-quality datasets.
- Develop data acquisition from external sources and APIs; monitor and troubleshoot ingestion processes.
- Ensure data quality, integrity, and governance; implement monitoring, alerting, and data lineage.
- Collaborate on data modeling, schema design, and performance tuning for analytics workloads.
Qualifications
- 3+ years of experience in data engineering or software engineering with a data focus.
- Strong Python and SQL skills.
- Experience with cloud platforms (AWS, GCP, or Azure).
- Experience with data pipeline orchestration tools (Airflow, Dagster, or Prefect).
- Familiarity with data warehousing concepts and tools (BigQuery, Redshift, Snowflake, PostgreSQL).
- Strong problem-solving, communication, and collaboration skills.
- Bachelor's degree in Computer Science or a related field (or equivalent experience).
Nice to have
- Experience with streaming technologies (Kafka, Kinesis) and data formats (Parquet, Avro).
- Experience with data quality frameworks and instrumentation.
Location
Campinas, Brazil (On-site).
About Speechify
Speechify is a leading AI-powered text-to-speech platform that helps people consume content more efficiently. Join a fast-growing team building data-driven features that scale globally.