Everything Hard About Building AI Agents Today

158 snips

Jun 13, 2025

Guest

Willem Pienaar

Guest

Shreya Shankar

Join Willem Pienaar, CTO of Cleric and creator of Feast, along with PhD student Shreya Shankar, as they tackle the toughest challenges in building AI agents. They discuss the ambiguity of 'ground truth' in evaluations, revealing three key gulfs of human-AI interaction that hinder success. The duo emphasizes the importance of moving humans out of the feedback loop, using implicit signals for faster learning. Practical techniques like heat maps for task failures and the complexities of simulated environments are also explored, shedding light on the inevitable performance ceiling of AI.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ANECDOTE

Doc ETL's LLM MapReduce Pipeline

Shreya's Doc ETL system uses LLMs as map and reduce operators to process vast unstructured data.
Verification is challenging because users don't know if the LLM missed anything in the data.

INSIGHT

Bridging AI Communication Gulfs

Successful AI products must bridge the "gulfs" of specification and generalization in user intent communication.
Tools for detailed prompt engineering improve specification; other strategies address generalization errors.

ADVICE

Leverage Implicit User Feedback

Use implicit user feedback signals like clicks on expandable content to gauge AI response usefulness.
These interactions provide valuable metrics without forcing users to perform explicit evaluations.

Get the Snipd Podcast app to discover more snips from this episode

Get the app