
Mixture-of-Experts and Trends in Large-Scale Language Modeling with Irwan Bello - #569
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
00:00
Navigating Language Model Optimization
This chapter explores the complexities of optimizing large-scale language models, highlighting the challenges of reproducibility and the gap between academic research and industry needs. It discusses the evolution of research methodologies and the shift in focus from novel architectures to effectively scaling existing ones, particularly in the realm of computer vision.
Play episode from 40:04
Transcript


