Job Overview
Speechify is seeking a Senior Software Engineer, AI Model serving to build and scale AI model serving infrastructure for Speechify's AI-powered features. This on-site role is based in Bandar Seri Begawan, Brunei.
Responsibilities
- Design, implement, and maintain AI model serving infrastructure and APIs.
- Deploy models to production using Docker, Kubernetes, and cloud platforms.
- Monitor latency, throughput, and reliability; implement observability and alerting.
- Collaborate with ML researchers to translate research into scalable production features.
- Participate in code reviews, testing, and documentation.
Requirements
- Strong experience with Python and building production-grade ML model serving systems.
- Experience deploying ML models to production (MLOps) and serving infrastructure.
- Hands-on experience with Docker, Kubernetes and cloud platforms (AWS/GCP/Azure).
- Familiarity with API design, security, and performance optimization.
- BS/MS in Computer Science or related field, or equivalent practical experience.
About Speechify
Speechify builds AI-powered reading and listening experiences to help people learn more efficiently.