MLOps.community  cover image

Machine Learning SRE // Niall Murphy // MLOps Coffee Sessions #54

MLOps.community

00:00

Oncall for Machine Learning

I'm trying to think of coming up with a separate oncall process for the data signs team. We have our own a incident response management system, but now that we're going to be getting more models being used it's time to look at this differently. I can tell you two possible approaches. The first one is to decide that the peculiarities of the m l situation was impracticable to expect an essere team,. even when composed of a bunch of dits e statistics, to actually master in a useful period of time. This almost always means model rollback. And obviously, in the gl internal production system, m roll back. In the classic semy space, we

Play episode from 29:32
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app