Live from Transition-AI 2026: Inside Google’s massive AI CapEx

28 snips

Apr 23, 2026

Amin Vahdat, Google’s chief technologist for AI infrastructure who designs data centers, chips, and power systems. He talks about Google’s massive 2026 CapEx and the shift from training to distributed inference. Conversations cover rethinking reliability to favor more compute, using on-site power as a bridge, microgrids and software control, and co-designing chips, buildings, and models.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Inference Doesn't Need Gigawatt Data Centers

Inference workloads generally don't require gigawatt-scale individual data centers and can run effectively in much smaller deployments.
Amin notes racks trend toward hundreds of kilowatts and that serving can be done in tens of megawatts with co-located compute, storage, and networking.

INSIGHT

Capacity Will Shift From Training To Serving

We're entering an age where most capacity will shift from training to serving as models proliferate and efficiency improves.
Amin compares it to search: compute once dominated index-building but quickly shifted to serving the index at scale.

INSIGHT

Reliability Tradeoffs Shift With Compute Cost

High reliability requirements grew when compute was a small cost, but with compute now dominant, customers often prefer more capacity over ultra-high availability.
Amin says internal customers will trade fewer nines for double capacity in many cases.

Get the Snipd Podcast app to discover more snips from this episode

Get the app

As the race to build out artificial intelligence accelerates, the infrastructure required to support it is undergoing a remarkable transformation. In February, Google announced a plan to spend $175 billion to $185 billion in CapEx for 2026— a figure roughly equivalent to the GDP of Hungary.

In this special live episode, recorded at Transition-AI 2026 in San Francisco, Shayle sits down with Amin Vahdat, Google’s chief technologist for AI infrastructure. Amin pulls back the curtain on how the hyperscaler is rethinking everything from data center reliability and behind-the-meter power generation to real-time inference.

Shayle and Amin discuss:

How Google’s shift from focusing on training to inference can enable more distributed, smaller-scale data center deployments
Why Google is moving away from traditional "five nines" reliability for certain workloads in exchange for doubling compute capacity
How on-site generation can serve as a "bridge" to manage interconnection latency
Google’s milestone agreement with utilities for one gigawatt of demand response
How software can co-optimize chip design, building cooling and power generation to create superefficient and flexible "AI factories"

Catalyst: The rise of flexible data centers
Catalyst: Will inference move to the edge?
Catalyst: The mechanics of data center flexibility
Open Circuit: The natural gas ‘bridge’ becomes a highway
Open Circuit: Are investors losing faith in the AI infrastructure frenzy?
Latitude Media: Energy Vault is expanding into infrastructure for AI
Latitude Media: The rise of the AI infrastructure asset class

Credits: Hosted by Shayle Kann. Produced and edited by Max Savage Levenson. Original music and engineering by Sean Marquand. Stephen Lacey is our executive editor.

Catalyst is brought to you by FischTank PR, an award-winning climate and energy tech, renewables, and sustainability-focused PR firm dedicated to elevating the work of both early-stage and established companies. Learn more about their PR approach and how they can support your company’s messaging by visiting fischtankpr.com.

Catalyst is brought to you by EnergyHub. EnergyHub helps utilities build next-generation virtual power plants that unlock reliable flexibility at every level of the grid. See how EnergyHub helps unlock the power of flexibility at scale, and deliver more value through cross-DER dispatch with their leading Edge DERMS platform, by visiting energyhub.com.

Tune into Critical Capital, a brand new podcast from Crux and Latitude Studios. Hosted by Crux CEO Alfred Johnson, Critical Capital explores the interlocking forces powering clean and critical infrastructure. Join us every other Tuesday for in-depth conversations at the intersection of energy, government, finance, and global markets. Listen here, or wherever you get podcasts.