Chain of Thought | AI Agents, Infrastructure & Engineering cover image

Beyond Transformers: How Liquid AI Is Rethinking LLM Architecture | Maxime Labonne

Chain of Thought | AI Agents, Infrastructure & Engineering

00:00

Custom Evals for Small Models and RAG Benchmarks

Maxime details building narrow internal benchmarks, repurposing frontier evaluations, and designing tests focused on function calling and web search.

Play episode from 38:37
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app