Cheeky Pint cover image

Reiner Pope of MatX on accelerating AI with transformer-optimized chips

Cheeky Pint

00:00

Latency differences: HBM vs SRAM

Reiner quantifies HBM latency (~20ms) versus SRAM (~1ms) and why that impacts user-facing responsiveness.

Play episode from 17:11
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app