
ProductLed Podcast The GPU Gold Rush: How Vast.ai Scaled With AI Demand
Mar 27, 2026
Travis Cannell, CEO and early employee at Vast.ai, runs a two-sided marketplace for on-demand GPUs. He explores why inference is driving massive GPU demand. He explains marketplace dynamics, pricing strategies, and how software and support defend a low-cost model. He also discusses rapid growth triggers and how AI reshapes hiring and org design.
AI Snips
Chapters
Transcript
Episode notes
Inference Is Fueling GPU Demand
- Inference is the primary driver of current GPU demand as models must run on GPUs to respond to prompts.
- Travis explains training produces a model file, but inference loads that model into GPU RAM whenever a user interacts with it, making ongoing compute essential.
When Vast.ai Hit Hypergrowth
- Vast.ai's hypergrowth began around November and accelerated in early 2026 tied to Opus 4.6 and CloudCode releases.
- Travis compares the moment to a bigger ChatGPT 3.0 moment and notes worldwide pickup across APAC and Europe.
Balance Low Price With Strong Verification
- Compete on price plus platform experience: keep marketplace bare-bones, verify hosts, and let suppliers set prices to maintain low cost.
- Vast.ai doesn't own hardware; verification and bandwidth tests preserve trust while enabling low GPU prices.
