-
Research
Research Engineer, Audio & Speech Models
Zyphra · Palo Alto
Open-weight TTS lab behind Zonos (trained on 200K+ hours); postgraduate degree in CS, EE, physics, or ML preferred.
Apply →
-
Research
Sr. Researcher, Multimodal AI
Dolby · Sydney
Spatial audio and multimodal AI for entertainment in Dolby's Advanced Technology Group.
Apply →
-
Research
Research Staff, Voice AI Foundations
Deepgram · San Francisco
Foundation model research across STT, TTS, and full speech-to-speech; requires strong statistical learning theory background and prior deployment experience.
Apply →
-
Engineering
Research Engineer
ElevenLabs · Remote
Apply →
-
Engineering
Research Engineer, Machine Learning Systems
Deepgram · Remote
Apply →
-
Engineering
Senior Machine Learning Engineer, Voice AI
Together AI · San Francisco
$200,000–$260,000; own the model serving stack for Whisper, Parakeet, Orpheus, and Kokoro across STT, TTS, and speech-to-speech.
Apply →
-
Engineering
Applied Audio ML Engineer
David AI · San Francisco
Apply →
-
Engineering
ML Engineer, Audio
Sandbar · New York
Small team building Stream, a private voice ring covered by WSJ, Bloomberg, Wired, and Fast Company; role spans cloud and on-device audio ML.
Apply →