Open Source Startup Podcast

E186: Unlocking Your Unstructured Data with Typedef

12 snips
Nov 20, 2025
Kostas Pardalis and Yoni Michael, co-founders of Typedef, discuss their journey from major data infrastructure firms to creating solutions for unstructured data. They reveal how traditional systems like Spark fail under current AI demands and introduce Fennec, a tailored DataFrame library for LLMs. Insights on optimizing agentic systems and early go-to-market strategies add depth to their conversation. They also ponder the future of AI in data workflows and critique the explosion of benchmarks in the industry.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ADVICE

Remove Ops Burden Early

  • Abstract operational burden so engineers focus on business logic instead of cluster sizing or partitioning.
  • Build serverless-style ergonomics to remove ops friction from data teams' workflows.
INSIGHT

Make Inference Planner-Aware

  • Inference changes workload characteristics from CPU-bound to I/O-heavy and black-box UDFs break optimizer value.
  • Treat inference as part of the planner so semantic operators and optimizers can reason about models.
INSIGHT

DataFrame API For LLM Workflows

  • Fennec exposes a DataFrame API extended with semantic operators like semantic filter and semantic join.
  • It batches inference, handles rate limits, and unifies relational and LLM transforms ergonomically.
Get the Snipd Podcast app to discover more snips from this episode
Get the app