MLOps.community  cover image

Large Language Models in Production Round-table Conversation

MLOps.community

00:00

The Cost, Quantity and Latency Triangle in Software Development

The latency of an end-to-end user experience is going to be king. You have to stay within a flow state for your user experience such that you know we could they can make those functionalities better but they don't want you because they want to maintain a very very low latency. That's how we think we need to start thinking about this. We've talked about when in our conversations with companies is a cost, quantity and latency triangle right depending on your use case you care about one of those more than the other two.

Play episode from 31:49
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app