Job Description
Speechify is looking for a Senior Software Engineer, AI Model Serving to join its Riga, Latvia team. In this role you will design, build, and maintain scalable AI model serving infrastructure that powers Speechify's real-time text-to-speech and other AI features. You will collaborate with machine learning researchers, software engineers, and data teams to deploy robust inference services, optimize latency, ensure reliability, and monitor performance in production.
Responsibilities
- Build and scale model serving infrastructure for AI models (TensorFlow, PyTorch) in production environments.
- Develop APIs and data pipelines for model deployment, versioning, monitoring, and rollback.
- Optimize latency and throughput through batching, caching, autoscaling, and resource tuning.
- Collaborate with ML researchers to convert research models into reliable services.
- Design and maintain CI/CD pipelines, testing, observability (metrics, tracing, logging).
- Ensure security, privacy, compliance, and data governance.
Requirements
- 5+ years of software engineering experience.
- Strong Python backend development; experience building scalable services.
- Hands-on experience with AI model serving frameworks (TensorFlow Serving, TorchServe, NVIDIA Triton) and deploying on Kubernetes.
- Experience with cloud platforms (AWS, Google Cloud, Azure).
- Proficiency with REST and gRPC APIs; containerization with Docker; familiarity with CI/CD.
- Excellent communication and collaboration skills; self-motivated and proactive.
Nice-to-have
- Experience with ML Ops and ML workflows; familiarity with large language models and serving at scale.
Benefits
- Competitive salary and equity where applicable.
- Comprehensive health, vision, and dental insurance.
- Opportunities for professional development and career growth.
- Relocation assistance may be available for the right candidate.