
AI #140: Trying To Hold The Line
Don't Worry About the Vase Podcast
00:00
What language models do well and where they fail
Zvi contrasts mundane utility with notable failure modes and discusses the BBC evaluation of assistants' errors.
Play episode from 01:58
Transcript


