MLOps.community  cover image

Voice and Language Tech // Catherin Breslin // Coffee Sessions #129

MLOps.community

00:00

Streaming Up the Audio as Soon as a Person Talks

Usually I think the speech recognition part is the bit which is the slowest. And that takes most of the time when you're talking about interacting with a pipeline like this. Most voice assistants don't use those, they really try and respond quickly. But in other scenarios, the back end just isn't responsive enough for that. So there is a lot of engineering trying to architect these systems to be as fast as possible.

Play episode from 32:22
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app