
How can you test your code when you don’t know what’s in it?
The Stack Overflow Podcast
00:00
Reproducibility, evals, and LLM-based testing
Fitz outlines two approaches: workflow-skeleton tests and open-ended evals using LLMs to assess other LLM outputs probabilistically.
Play episode from 05:23
Transcript


