
The Test Set by Posit Emily Riederer: Column selectors, data quality, and learning in public
Jan 26, 2026
Emily Riederer, a data science manager at Capital One and cross-language tool author, talks about her journey through R, Python, and SQL. She dives into messy real-world data, the rise of dbt and better SQL tooling, and why column selectors (yes, really) change ergonomics. She also discusses learning in public, imposter syndrome, and solving boring but high-impact problems.
AI Snips
Chapters
Transcript
Episode notes
Practical Pipeline Breakdown
- Emily frames the data pipeline as extract, load, transform plus a prior logging/encoding step.
- Organization choices (files vs. database) trade off discoverability and constraints that help downstream users.
Discovering dbt Felt Like Serendipity
- Emily felt daily pain points that dbt later aimed to solve, like testing SQL and orchestration headaches.
- Hearing an early dbt interview felt serendipitous because it addressed the same problems she had been solving.
Selectors Unlock Scalable Column Ops
- Column selectors let you target groups of variables by name or type to apply transformations at scale.
- Selectors remove repetitive, error-prone copy-paste work and improve code clarity when tables have many columns.
