Overview
Speechify is seeking a Senior Software Engineer to design, build, and scale AI model serving infrastructure for our products. This role focuses on delivering fast and reliable model inference services to power Speechify's AI features.
About Speechify
Speechify builds AI-powered reading and text-to-speech tools used by millions of users worldwide. We value engineering excellence, collaboration, and impact.
Responsibilities
- Design, develop, and maintain scalable services for AI model serving
- Build high-performance APIs for model inference and real-time processing
- Optimize latency, throughput, and reliability of inference workloads
- Collaborate with ML researchers to deploy experiments into production
- Write tests, monitor production systems, and troubleshoot issues
- Mentor junior engineers and contribute to architectural decisions
Qualifications
- 5+ years of software engineering experience with production systems
- Proficiency in Python and/or Go; experience with REST APIs
- Experience deploying ML models and serving architectures (TensorFlow Serving, TorchServe) is a plus
- Familiarity with Docker, Kubernetes, and CI/CD pipelines
- Cloud experience (AWS, GCP, or Azure)
- Strong communication and collaboration skills
- Bachelor's or Master's degree in Computer Science or related field (or equivalent)
What we offer
- Competitive salary and benefits
- Flexible remote work with a global team
- Opportunities for growth and impact