Speechify is seeking a Senior Software Engineer to join our AI Model Serving team in Sofia, Bulgaria. You will design, implement, and operate scalable AI model serving infrastructure powering Speechify's real-time features.
Responsibilities
- Design, implement, and operate scalable AI model serving services in production
- Collaborate with ML researchers and backend engineers to deploy models and APIs
- Optimize latency, throughput, and reliability; monitor production systems
- Build and maintain CI/CD pipelines for ML models and services
- Mentor junior engineers and participate in code reviews
- Ensure security and data privacy best practices
Requirements
- 5+ years of software engineering experience
- Strong Python proficiency
- Experience with distributed systems and microservices
- Hands-on experience with Kubernetes and Docker
- Cloud experience (AWS or GCP)
- Experience with ML model serving frameworks (TensorFlow Serving, TorchServe, or similar)
- Familiarity with monitoring tools (Prometheus, Grafana) and logging
- Excellent problem-solving and communication skills
Nice to have
- Experience with ML pipelines or large language models
- Familiarity with real-time streaming data