The Nonlinear Library

LW - Research Post: Tasks That Language Models Don't Learn by Bruce W. Lee

Feb 23, 2024
The podcast discusses how large language models struggle to understand sensory aspects of language, presented through the H-Test benchmark. Key findings include plateauing performance with stronger models and minimal impact of example quantity.
Ask episode
Chapters
Transcript
Episode notes