
Breaking the Memory Wall in the Age of Inference
The Data Exchange with Ben Lorica
00:00
Building SRAM-First Inference Engines
Sid describes packing more SRAM close to compute and D-Matrix's SRAM-based accelerator with higher capacity for models.
Play episode from 05:18
Transcript


