MLOps.community  cover image

Large Language Models in Production Round-table Conversation

MLOps.community

00:00

The Divergence of Real-Time Use Cases

I think we potentially have like a divergence right where you're actually don't get the one like larger model mega model that it just gets smaller and cheaper. We actually get to back to its highly specialized models or you know I've actually seen implemented recently. You can go into a use case and then route from there into something a lot more specialized that accumulates latency though, he says. He thinks truly real-time use cases are kind of out of the question in point today and time at least for now.

Play episode from 41:06
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app