
768: Is Claude 3 Better than GPT-4?
Super Data Science: ML & AI Podcast with Jon Krohn
00:00
Observation during testing of language models and the need for improved testing methods
Exploring the unexpected presence of a pizza topping response in a test of language models, sparking a discussion on their ability to adapt during testing and the necessity for improved evaluation strategies.
Play episode from 10:00
Transcript


