Not Investment Advice cover image

260: Nvidia GTC, AI Inference Explained, Jensen new Steve Jobs & Super Micro’s $2.5B Smuggling Scheme

Not Investment Advice

00:00

Throughput vs Token Speed Tradeoff

Trung breaks down throughput and per-token latency tension and why Grok targets high-quality, fast inference.

Play episode from 25:28
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app