Semi Doped cover image

Can Pre-GPT AI Accelerators Handle Long Context Workloads?

Semi Doped

00:00

Why Cerebras built a wafer-scale engine

Austin and Vikram explain wafer-scale design choices to maximize on-chip compute and reduce off-chip data movement.

Play episode from 20:10
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app