Overview
Speechify is seeking a Senior Software Engineer, AI Model Serving to join our team in Campinas, Brazil. You will help design, build, and maintain scalable AI model serving infrastructure that powers production‑ready inference for our products.
Responsibilities
- Design and implement scalable AI model serving infrastructure (inference endpoints using REST and gRPC).
- Build and operate microservices on Kubernetes in the cloud (AWS).
- Optimize performance, latency, and throughput; monitor and troubleshoot production systems.
- Collaborate with ML engineers to containerize and serve models (TensorFlow, PyTorch).
- Develop and maintain CI/CD pipelines and automated testing for ML workloads.
- Improve telemetry, logging, tracing, and observability for reliability.
- Ensure security, cost-efficiency, and compliance in model serving platforms.
Qualifications
- 5+ years of software engineering experience.
- Experience with AI model serving, ML inference, or MLOps.
- Proficiency in Python and Go.
- Experience with Kubernetes and Docker.
- Familiarity with REST and gRPC APIs.
- Experience with cloud platforms (AWS preferred; others like GCP also acceptable).
- Strong problem-solving, communication, and teamwork skills.
- Bachelor's degree in Computer Science or related field (or equivalent practical experience).
About Speechify
Speechify is an AI-powered platform focused on making information accessible and actionable through advanced text-to-speech and AI-powered tools.