

Chris Fregly
AI systems performance engineer and author with experience at AWS, Databricks, and Netflix, and author of 'AI Systems Performance Engineering', specializing in GPU, CUDA, and PyTorch optimization.
Best podcasts with Chris Fregly
Ranked by the Snipd community

59 snips
Mar 10, 2026 • 1h 12min
973: AI Systems Performance Engineering, with Chris Fregly
Chris Fregly, AI systems performance engineer and author with experience at AWS, Databricks, and Netflix, discusses GPU-centric performance engineering. He focuses on memory bandwidth over FLOPS. Topics include full-stack hardware–software co-design, low-level profiling and CUDA, inference optimizations like KV cache, and practical use of AI coding assistants and continuous evals.

58 snips
Feb 24, 2026 • 1h 26min
Performance Optimization and Software/Hardware Co-design across PyTorch, CUDA, and NVIDIA GPUs
Chris Fregly, AI performance engineer, founder, and author, walks through software/hardware co-design for PyTorch, CUDA, and NVIDIA GPUs. He talks mechanical sympathy, GPU generations, NVLink and networking, kernel tuning with coding agents, and infrastructure trade-offs for training versus inference. Short, technical, and focused on building scalable, high-performance AI systems.


