AssemblyAI

51-200 employees
2 jobs posted

View company profile →

Please mention that you found this job on empllo.com. Thanks & good luck!

Tired of Manually Applying to Jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI copilot handle the job search while you sleep.

Applies for jobs that actually match your skills
Tailors your resume and cover letter automatically
Works 24/7—so you don't have to

Activate JobCopilot

Follow us on LinkedIn!

Research Engineer, Evaluations

Added

less than a minute ago

Location

🌍 North America

Type

Full time

Salary

Upgrade to Premium to se...

Related skills

cloud sql python ml benchmarking

📋 Description

Own end-to-end evaluation across accuracy, latency, and metrics.
Build and maintain benchmarking pipelines against competitors.
Design experiments to measure the impact of model changes.
Onboard, curate, and maintain evaluation datasets (public and internal).
Create evaluation subsets to stress-test capabilities and edge cases.
Collaborate with research and engineering teams to align with customer needs.

🎯 Requirements

ML fundamentals: understand model training and evaluation.
Strong Python skills; write evaluation scripts and data pipelines.
SQL and cloud infrastructure experience.
Metric intuition: define metrics capturing real-world performance.
Voice agent stack familiarity: VAD, ASR, turn detection, LLM, TTS.
Overlap with Eastern US Time Zone: 3-4 hours required.

🎁 Benefits

Fully remote team.
Shape product through research.
Pay transparency and pay equity.
Collaborative, diverse team.

Apply on employer's website

This employer gathers applications via their own applicant tracking system.

You will be redirected to an external application form.

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Activate JobCopilot