How AI Is Built

#007 Navigating the Modern Data Stack, Choosing the Right OSS Tools, From Problem to Requirements to Architecture

23 snips
May 17, 2024
Data engineering expert Nicolay Gerold and software-defined assets expert Jon Erich Kemi Warghed discuss selecting the right tools, implementing data governance, and the concept of software-defined assets. They highlight the importance of data governance, open source tooling, agile data platforms, and software-defined assets like Dagster for simplifying data orchestration and creating business value.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ADVICE

Practical Open-Source Due Diligence

  • Do validate open-source tools before adoption by checking commit frequency, backing, and community responsiveness.
  • Try to get a tool running within three to four hours as a practical litmus test for viability.
INSIGHT

From Problem To Requirements To Architecture

  • Move from problem interviews to clear capability requirements before selecting architecture and tools.
  • Measure tool impact by whether key user metrics actually improve after adoption.
ADVICE

Make Governance A Transparency Tool

  • Do treat data governance as transparency: catalog assets, provenance, usage, and quality.
  • Avoid treating governance as policing; use it instead as the organization's guiding star.
Get the Snipd Podcast app to discover more snips from this episode
Get the app