The New Stack Podcast

Google AI Infrastructure PM On New TPUs, Liquid Cooling and More

21 snips
May 13, 2025
Chelsie Czop, Senior Product Manager for AI Infrastructure at Google Cloud, dives into cutting-edge developments in AI hardware. She discusses the impressive new Ironwood TPUs, boasting 42.5 exaflops, and the advancements in liquid cooling, essential for managing heat. Chelsie clarifies the ongoing debate between using TPUs or GPUs, noting significant performance boosts for some users. Moreover, she highlights the collaboration with DeepMind to stay ahead of evolving model architectures and the sustainable innovations shaping the future of data centers.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Synergy of Hardware and Software

  • Close collaboration between hardware and software teams is vital for TPU usability.
  • Software like Google's Pathways framework unlocks hardware potential for large model training and inference.
INSIGHT

Bridging Hardware and Model Evolution

  • Models evolve rapidly while TPU hardware releases follow a steady annual cadence.
  • Software optimizations help bridge the gap between fast model changes and slower hardware iterations.
ANECDOTE

Faster Hardware Cadence at Google

  • Chelsie Czop came from IBM, where hardware design took 3 years, to Google, where TPU development cycles are much faster.
  • She admired Google's philosophy of designing hardware to fail and relying on software to maintain reliability.
Get the Snipd Podcast app to discover more snips from this episode
Get the app