TR 40: InferenceMAX, Video games, Rare earths

62 snips

Oct 10, 2025

The hosts dive into the groundbreaking InferenceMAX, exploring its role in establishing open benchmarks for AI performance. They analyze how historical GPU tactics shaped today’s standards and discuss the significance of performance-per-dollar metrics over simple FLOPS. Transitioning to gaming, they examine EA's recent acquisition and the trend of sovereign funds investing in studios. The intriguing dynamics of rare earths and their geopolitical implications highlight the concentration of processing power in China, raising questions about the future of tech supply chains.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Performance Is A Moving Target

Inference performance shifts often due to drivers, frameworks, compilers, and model changes.
Continuous testing captures those moving parts better than annual point-in-time comparisons.

INSIGHT

Cost And Power Beat Peak FLOPS

Throughput-per-dollar and tokens-per-megawatt matter more to AI infra builders than raw peak performance.
Inference Max shows scenarios where AMD is cost-competitive while Nvidia often leads performance and energy efficiency.

ADVICE

Run Regular Automated Benchmarks

Use frequent, automated benchmarking to catch software and driver regressions and improvements in real time.
Update models and test stacks daily to reflect actual deployment behavior and inform purchasing.

Get the Snipd Podcast app to discover more snips from this episode

Get the app