MLOps.community  cover image

Performance Optimization and Software/Hardware Co-design across PyTorch, CUDA, and NVIDIA GPUs

MLOps.community

00:00

Grace Blackwell architecture implications

Chris explains Grace Blackwell MCM, CPU memory pools, arithmetic intensity, and data movement limits.

Play episode from 26:31
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app