

Kyle Kranen
Engineering leader and architect of NVIDIA Dynamo, a datacenter-scale inference framework; specializes in inference systems, scale-out serving, and optimizing cost/latency/quality tradeoffs.
Best podcasts with Kyle Kranen
Ranked by the Snipd community

359 snips
Mar 10, 2026 • 1h 24min
NVIDIA's AI Engineers: Agent Inference at Planetary Scale and "Speed of Light" — Nader Khalil (Brev), Kyle Kranen (Dynamo)
Kyle Kranen, an engineering leader behind NVIDIA Dynamo who builds datacenter-scale inference systems. Nader Khalil, a DevRel leader focused on GPU developer UX and Brev’s developer onboarding. They discuss Dynamo’s scale-out inference approach, prefill vs decode disaggregation, Kubernetes-based scaling, SOL (Speed of Light) urgency culture, model‑hardware co-design, long‑context limits, and agent security and tooling.

Mar 3, 2022 • 1h 57min
Becoming a deep learning researcher without a PhD, graph neural network(GNN), time series, recommender system with Kyle Kranen - The Data Scientist Show#028
Exploring deep learning research topics with Kyle Kranen, a Deep Learning Software Engineer at Nvidia. Topics include Graph Neural Network (GNN), Temporal Fusion Transformer (TFT), time series, and other insights into his career journey.


