Software Engineer, Data Infrastructure & Acquisition
Speechify is seeking a Software Engineer, Data Infrastructure & Acquisition to join our team in Vitória, Brazil. This role focuses on building and maintaining scalable data ingestion pipelines and data infrastructure to power analytics and product features.
Responsibilities
- Design, implement, and maintain data ingestion pipelines to collect, transform, and store data from multiple sources.
- Build and optimize data infrastructure to support analytics, product features, and data science initiatives.
- Collaborate with data engineers, software engineers, and product teams to understand data needs and ensure data quality and reliability.
- Instrument monitoring and observability for data pipelines; troubleshoot data quality issues and performance bottlenecks.
- Work with cloud-based data platforms and modern tooling (e.g., AWS, GCP, Airflow, Spark) to enable scalable data workflows.
Requirements
- 2+ years of experience in data engineering or software engineering with a data-focused scope.
- Strong programming skills in Python and solid SQL proficiency.
- Experience with data pipelines/tools (Airflow, ETL/ELT processes) and cloud platforms (AWS, GCP).
- Familiarity with data modeling, data warehouses, and batch/streaming data processing.
- Excellent problem-solving abilities and collaboration skills; ability to work with cross-functional teams.
Nice to Have
- Experience with Spark or similar big data frameworks; knowledge of BigQuery, Redshift, Snowflake, or equivalent data warehouses.
- Experience with streaming data, Kafka or Kinesis, and data governance/security practices.
This is an on-site role based in Vitória, Brazil. Join Speechify to help shape how data drives product decisions and user experiences.