MLOps.community

Everything Hard About Building AI Agents Today

158 snips
Jun 13, 2025
Join Willem Pienaar, CTO of Cleric and creator of Feast, along with PhD student Shreya Shankar, as they tackle the toughest challenges in building AI agents. They discuss the ambiguity of 'ground truth' in evaluations, revealing three key gulfs of human-AI interaction that hinder success. The duo emphasizes the importance of moving humans out of the feedback loop, using implicit signals for faster learning. Practical techniques like heat maps for task failures and the complexities of simulated environments are also explored, shedding light on the inevitable performance ceiling of AI.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Doc ETL's LLM MapReduce Pipeline

  • Shreya's Doc ETL system uses LLMs as map and reduce operators to process vast unstructured data.
  • Verification is challenging because users don't know if the LLM missed anything in the data.
INSIGHT

Bridging AI Communication Gulfs

  • Successful AI products must bridge the "gulfs" of specification and generalization in user intent communication.
  • Tools for detailed prompt engineering improve specification; other strategies address generalization errors.
ADVICE

Leverage Implicit User Feedback

  • Use implicit user feedback signals like clicks on expandable content to gauge AI response usefulness.
  • These interactions provide valuable metrics without forcing users to perform explicit evaluations.
Get the Snipd Podcast app to discover more snips from this episode
Get the app