Overview
Speechify, a leader in AI-powered text-to-speech technology, is hiring a Senior Software Engineer to build and scale AI model serving infrastructure in Yokohama, Japan. You will collaborate with data scientists and machine learning engineers to deploy models to production and optimize latency and reliability.
Responsibilities
- Design, implement, and maintain scalable AI model serving infrastructure for production workloads.
- Collaborate with ML engineers to deploy, monitor, and optimize models for latency and throughput.
- Build robust APIs and services (gRPC/REST) and containerize workloads using Docker and Kubernetes.
- Improve CI/CD pipelines and observability (metrics, tracing, logging).
- Mentor junior engineers and contribute to the technical roadmap.
Qualifications
- Bachelor's or Master's degree in Computer Science or a related field.
- 5+ years of software engineering experience, with a focus on large-scale systems.
- Hands-on experience with AI model serving, inference pipelines, and ML frameworks.
- Strong programming skills (Python, Go, or similar).
- Experience with distributed systems, containers, and cloud infrastructure.
- Excellent problem-solving, debugging, and communication abilities.
Location
Yokohama, Japan (on-site)