Transistor Radio

TR 40: InferenceMAX, Video games, Rare earths

62 snips
Oct 10, 2025
The hosts dive into the groundbreaking InferenceMAX, exploring its role in establishing open benchmarks for AI performance. They analyze how historical GPU tactics shaped today’s standards and discuss the significance of performance-per-dollar metrics over simple FLOPS. Transitioning to gaming, they examine EA's recent acquisition and the trend of sovereign funds investing in studios. The intriguing dynamics of rare earths and their geopolitical implications highlight the concentration of processing power in China, raising questions about the future of tech supply chains.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Performance Is A Moving Target

  • Inference performance shifts often due to drivers, frameworks, compilers, and model changes.
  • Continuous testing captures those moving parts better than annual point-in-time comparisons.
INSIGHT

Cost And Power Beat Peak FLOPS

  • Throughput-per-dollar and tokens-per-megawatt matter more to AI infra builders than raw peak performance.
  • Inference Max shows scenarios where AMD is cost-competitive while Nvidia often leads performance and energy efficiency.
ADVICE

Run Regular Automated Benchmarks

  • Use frequent, automated benchmarking to catch software and driver regressions and improvements in real time.
  • Update models and test stacks daily to reflect actual deployment behavior and inform purchasing.
Get the Snipd Podcast app to discover more snips from this episode
Get the app