Last Week in AI cover image

#235 - Sonnet 4.6, Deep-thinking tokens, Anthropic vs Pentagon

Last Week in AI

00:00

Masking Updates Improves Adaptive Optimizers

They review research showing masked updates and MAGMA yield big optimizer gains in training stability.

Play episode from 43:31
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app