Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills

Tailors your resume and cover letter automatically

Works 24/7—so you don't have to

Job Overview

Speechify is seeking an AI Engineer & Researcher, Inference to join our San Francisco, USA team. This role blends research and engineering to build and optimize high-performance inference systems for speech models. You will collaborate with the ML research and product teams to push the state of the art in model latency, memory efficiency, and reliable deployment to production. The ideal candidate is proficient in modern ML frameworks and fluent in turning research insights into production-ready inference tooling.

Responsibilities

Design, implement, and optimize scalable inference pipelines for speech models.
Conduct applied research on efficient inference techniques (e.g., quantization, pruning, distillation) and integrate them into production systems.
Evaluate models for latency, accuracy, and memory usage; develop benchmarks and monitor performance in production.
Collaborate with ML researchers and product teams to deploy reliable, real-time speech solutions.
Build tooling to automate experiments, track results, and support model lifecycle management.

Requirements

Strong background in machine learning and systems, with a focus on inference for deep models.
Proficiency in Python and at least one ML framework (PyTorch or TensorFlow).
Experience with C++/CUDA and performance-oriented programming is a plus.
Familiarity with distributed systems, GPUs, and cloud-based deployments.
Excellent collaboration skills and ability to translate research ideas into production-ready solutions.

Nice to have

Experience in speech recognition, speech synthesis, or related audio processing domains.
Publications or open-source contributions in ML research topics.
Experience with quantization, pruning, distillation, or other model compression techniques.

Speechify

Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

AI Engineer & Researcher, Inference - San Francisco, USA

Job Overview

Responsibilities

Requirements

Nice to have

Meet JobCopilot: Your Personal AI Job Hunter

Speechify

Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

AI Engineer & Researcher, Inference - San Francisco, USA

Job Overview

Responsibilities

Requirements

Nice to have

Meet JobCopilot: Your Personal AI Job Hunter

Related Engineering Jobs