Tech Talks Daily

How DDN And NVIDIA Are Rethinking AI Infrastructure For The Rubin Era

4 snips
Mar 24, 2026
Alex Bouzari, founder and CEO of Data Direct Networks, a leader in AI and HPC data infrastructure. He explains why data movement and memory architecture now bottleneck AI, how rack-scale systems like Rubin change operations, and why industrial engineering metrics such as cost-per-token, rack utilization, and time-to-value will decide winners.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Data Movement Is The New AI Bottleneck

  • The bottleneck in modern AI has shifted from GPUs to the data layer and architecture.
  • Alex Bouzari warns that if data movement is chaotic, expensive GPUs will stall and investments become costly science experiments.
ADVICE

Industrialize AI Instead Of Hoarding GPUs

  • Do industrialize and operationalize AI infrastructure rather than just buy more GPUs.
  • Alex Bouzari recommends focusing on predictable, efficient operation because owning GPUs alone won't deliver value or ROI.
INSIGHT

Inference Economics Drive AI Value

  • Inference economics is now the primary value driver, not just model size or training.
  • DDN redesigned the data path, tiered KV cache, and offloaded overhead to DPUs to reach 95–99% GPU utilization and faster first-token times.
Get the Snipd Podcast app to discover more snips from this episode
Get the app