Machine Learning Street Talk (MLST)

Why Your GPUs are underutilised for AI - CentML CEO Explains

28 snips
Nov 13, 2024
Gennady Pekhimenko, CEO of CentML and associate professor at the University of Toronto, dives into the intricacies of AI system optimization. He illuminates the challenges of GPU utilization, revealing why many companies only harness 10% efficiency. The conversation also touches on 'dark silicon,' the competition between open-source and proprietary AI, and the need for strategic refinement in enterprise AI infrastructure. Pekhimenko's insights blend technical depth with practical advice for enhancing machine learning applications in modern businesses.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Low GPU Utilization

  • Gennady Pekhimenko observed only 10% GPU utilization in early ML workloads at Microsoft Research.
  • This highlighted the gap in understanding compute between ML and systems communities.
INSIGHT

Open Source for Enterprise AI

  • Open-source models are crucial for enterprise AI adoption, allowing companies to build internal IP.
  • This approach reduces reliance on external providers and addresses data sensitivity concerns.
INSIGHT

Enterprise AI Adoption Challenges

  • Enterprises recognize GenAI's value but struggle to identify the right initial use cases and implementation strategies.
  • Building internal AI expertise and finding cost-effective solutions are key challenges.
Get the Snipd Podcast app to discover more snips from this episode
Get the app