

Josh Wills
Member of technical staff at Datology AI focused on data curation and foundation-model training, with a 25-year career in data engineering including roles at Cloudera and Slack and contributions to dbt and DuckDB.
Best podcasts with Josh Wills
Ranked by the Snipd community

18 snips
May 7, 2026 • 55min
AI Agents Can't Fix Data - Josh Wills on Where AI Breaks in Data Engineering
Josh Wills, a 25-year data engineering veteran now at Datology AI who helped build tools like dbt and DuckDB. He talks about why AI agents misdiagnose messy pipelines, the rise of petabyte-scale multimodal datasets, fragile $200K vibe-coded pipelines with no training data, the enduring role for classical ML, and why managing unreliable agents is now part of the job.


