

DataNation - Podcast for Data Engineers, Analysts and Scientists
Alex Merced Podcasts
Welcome to "Datanation," the podcast where your host, Alex Merced, takes you on a captivating journey through the fascinating world of data. In each episode, we explore a wide range of data topics, from data engineering and data analytics to the art and science of data-driven decision-making.
In the age of information, data is the currency that drives innovation and progress. "Datanation" is your passport to this ever-evolving landscape, where we unravel the mysteries, dissect the trends, and celebrate the breakthroughs shaping the data-driven future.
Join Alex Merced, a seasoned data enthusiast and educator, as he engages in enlightening discussions, informative interviews, and thought-provoking explorations of data concepts and practices. Whether you're a seasoned data professional, a curious tech enthusiast, or someone simply intrigued by the power of data, this podcast offers valuable insights and knowledge.
Find all episodes at: https://host.alexmercedpodcast.com/series/datanation/
Follow Alex on Twitter @amdatalakehouse
Find Alex's Blogs and Social Links at AlexMerced.com
In the age of information, data is the currency that drives innovation and progress. "Datanation" is your passport to this ever-evolving landscape, where we unravel the mysteries, dissect the trends, and celebrate the breakthroughs shaping the data-driven future.
Join Alex Merced, a seasoned data enthusiast and educator, as he engages in enlightening discussions, informative interviews, and thought-provoking explorations of data concepts and practices. Whether you're a seasoned data professional, a curious tech enthusiast, or someone simply intrigued by the power of data, this podcast offers valuable insights and knowledge.
Find all episodes at: https://host.alexmercedpodcast.com/series/datanation/
Follow Alex on Twitter @amdatalakehouse
Find Alex's Blogs and Social Links at AlexMerced.com
Episodes
Mentioned books

8 snips
Jun 28, 2024 • 0sec
60 – Interoperability of Data Lake Table Format (Apache Iceberg, Apache Hudi, Delta Lake)
Discussion on interoperability of data lake table formats like Apache Iceberg, Apache Hudi, and Delta Lake, highlighting challenges and unique features. Emphasis on making informed architectural decisions in data lake environments.

4 snips
Jun 25, 2024 • 0sec
#59 – Apache Iceberg Catalogs (Nessie) vs Enterprise Data Catalogs (Colibra)
Dive into the intriguing world of data catalogs as different frameworks are explored. Learn how enterprise data catalogs like Alation serve as knowledge bases, while Apache Iceberg catalogs optimize metadata for efficient querying. Discover the unique features of Iceberg, including versioning and branching capabilities that enhance data governance. The discussion highlights the convergence of governance features in both catalog types and the distinctions in their purposes for users and tools. Stay updated on the evolving catalog ecosystem and promising projects!

Jun 12, 2024 • 0sec
58 – Databricks Announcements (Open Source Unity Catalog, Liquid Clustering, Nvidia)
Alex Merced discusses some of the Databricks announcement at the Data + AI summit Follow Alex by visit https://bio.alexmerced.com/data

Jun 5, 2024 • 0sec
57 – Databricks buys Tabular
I talk about the big news of the day. follow on Twitter @amdatalakehouse

Jun 4, 2024 • 0sec
56 – Open Source Apache Iceberg Catalogs (Nessie, Polaris, Gravitino)
Alex Merced discusses the value of Open Source Apache Iceberg catalogs in creating a truly open lakehouse environment without Vendor lock-in. Check out my article on the subject: https://open.substack.com/pub/amdatalakehouse/p/open-source-table-format-open-source?r=h4f8p&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true Follow me on twitter at @amdatalakehouse

May 16, 2024 • 0sec
55 – Discussing the Apache Iceberg Kafka Connect Connector
In this episode, we delve into the Apache Iceberg Kafka Connector, a critical tool for streaming data into your data lakehouse. We’ll explore how this connector facilitates seamless data ingestion from Apache Kafka into Apache Iceberg, enhancing your real-time analytics capabilities and data lakehouse efficiency. We’ll cover: Join us to understand how the Apache Iceberg […]

Apr 20, 2024 • 0sec
54 – Major Architectural Differences between Apache Iceberg and Delta Lake (Partition Evolution and Hidden Partitioning)
Alex Merced discusses some of the major differences in how Apache Iceberg and Delta Lake work that lead to: Follow me on social https://bio.alexmerced.com/data

Apr 17, 2024 • 0sec
53-Why Do Snowflake Bills Get So Large?
Alex Merced discusses the mistakes that makes Snowflake bills get so large. Hands-On Lakehouse Laptop Exercises:– MongoDB with Dremio: https://bit.ly/am-mongodb-dashboard– SQLServer with Dremio: https://bit.ly/am-sqlserver-dashboard– Postgres with Dremio: https://bit.ly/am-postgres-to-dashboard https://bio.alexmerced.com/data

Mar 28, 2024 • 0sec
52 – Apache Iceberg, Dremio and PuppyGraph
Discussing the benefits of Apache Iceberg's open data ecosystem. Exploring Graph Data Processing with Dremio, Puppy Graph, and Apache Iceberg. Efficiency and Flexibility of Apache Iceberg for data lakes, overcoming data duplication challenges and enabling diverse data modeling possibilities.

Mar 25, 2024 • 0sec
#1 – intro to catalogs, manifests and metadata. Oh my!
Alex Merced introduces his new podcast exploring open-source data projects like Apache Iceberg. The episode discusses the importance of catalogs, manifests, and metadata in developing advanced data systems affordably. Listeners are encouraged to subscribe for future in-depth explorations of open source project architectures.


