

The Data Exchange with Ben Lorica
Ben Lorica
A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].
Episodes
Mentioned books

Apr 15, 2021 • 35min
How Technology Companies Are Using Ray
In this episode of the Data Exchange, I speak Zhe Zhang, Engineering Manager at Anyscale where he leads the team that works on the Ray and its ecosystem of libraries and partners. Ray is an open source, general purpose framework for building distributed applications (more details in this post and video).Subscribe: Apple • Android • Spotify • Stitcher • Google • RSS.Detailed show notes can be found on The Data Exchange web site.Subscribe to The Gradient Flow Newsletter.

Apr 8, 2021 • 41min
Data quality is key to great AI products and services
In this episode of the Data Exchange, I speak with Abe Gong, CEO and co-founder at Superconductive, a startup founded by the team behind the Great Expectations (GE) open source project. GE is one of a growing number of tools aimed at improving data quality through tools for validation and testing. Other projects in this area include TensorFlow DV, assertr, dataframe-rules-engine, deequ, data-describe, and Apache Griffin.Subscribe: Apple • Android • Spotify • Stitcher • Google • RSS.Detailed show notes can be found on The Data Exchange web site.Subscribe to The Gradient Flow Newsletter.

Apr 1, 2021 • 43min
Machine Learning in Healthcare
In this episode of the Data Exchange, I speak with Parisa Rashidi, Associate Professor at the Department of Biomedical Engineering at University of Florida. Parisa is a computer scientist and machine learning researcher who specializes in applications of ML to healthcare and biomedical domains.Subscribe: Apple • Android • Spotify • Stitcher • Google • RSS.Detailed show notes can be found on The Data Exchange web site.Subscribe to The Gradient Flow Newsletter.

Mar 25, 2021 • 41min
Measuring the Impact of AI and Machine Learning Research
In this episode of the Data Exchange, our special correspondent and managing editor Jenn Webb organized a mini-panel composed of myself and Simon Rodriguez, Data Research Assistant at the Center for Security and Emerging Technology (CSET) at Georgetown University. Through a series of reports and data briefs, CSET provides policymakers with data rich material to inform and guide public policy.Subscribe: Apple • Android • Spotify • Stitcher • Google • RSS.Detailed show notes can be found on The Data Exchange web site.Subscribe to The Gradient Flow Newsletter.

Mar 18, 2021 • 44min
The Mathematics of Data Integration and Data Quality
In this episode of the Data Exchange, I speak with Ryan Wisnesky, CTO and co-founder of Conexus, a startup that uses techniques from mathematics and incorporates them into novel tools for data integration, data management, and knowledge management.Subscribe: Apple • Android • Spotify • Stitcher • Google • RSS.Detailed show notes can be found on The Data Exchange web site.Subscribe to The Gradient Flow Newsletter.

Mar 11, 2021 • 46min
Pricing Data Products
In this episode of the Data Exchange, I speak with Jian Pei, Professor, School of Computing Science, Simon Fraser University. His research spans data science, big data, data mining, and database systems. But in this podcast we talk about tools for estimating the economic value of data. Subscribe: Apple • Android • Spotify • Stitcher • Google • RSS.Detailed show notes can be found on The Data Exchange web site.Subscribe to The Gradient Flow Newsletter.

Mar 4, 2021 • 52min
Challenges, Opportunities, and Trends in EdTech
In this episode of the Data Exchange, our special correspondent and managing editor Jenn Webb and I speak with Sharon Zhou, a PhD student in Computer Science at Stanford University. Sharon has been teaching very popular courses on GANs (generative adversarial networks) on Coursera. In this conversation we examine the state of Education Technology (EdTech), learning platforms, and other tools for teaching online. A year into the global pandemic, we discuss advantages and disadvantages of various technologies for delivering classes, as well as broader issues in education.We also took the opportunity to discuss Sharon’s work on deep learning, including her work using GANs to help the general public and policy makers to better understand the implications of climate change.Subscribe: Apple • Android • Spotify • Stitcher • Google • RSS.Detailed show notes can be found on The Data Exchange web site.Subscribe to The Gradient Flow Newsletter.

Feb 25, 2021 • 42min
Towards Simple, Interpretable, and Trustworthy AI
In this episode of the Data Exchange I speak with Sheldon Fernandez, CEO at Darwin AI, and Alex Wong, Professor at the University of Waterloo, Co-Founder of DarwinAI (Chief Scientist) and Euclid Labs.Subscribe: Apple • Android • Spotify • Stitcher • Google • RSS.Detailed show notes can be found on The Data Exchange web site.Subscribe to The Gradient Flow Newsletter.

Feb 18, 2021 • 30min
The Rise of Metadata Management Systems
In this episode of the Data Exchange, our special correspondent and managing editor Jenn Webb organized a mini-panel composed of myself and Assaf Araki, investment manager at Intel Capital. Assaf and I have written a series of articles and this interview took place shortly before the release of our most recent collaboration: The Growing Importance of Metadata Management Systems. We devote this episode to how metadata management will impact many enterprise data systems.Subscribe: Apple • Android • Spotify • Stitcher • Google • RSS.Detailed show notes can be found on The Data Exchange web site.Subscribe to The Gradient Flow Newsletter.

Feb 11, 2021 • 42min
Tools for building robust, state-of-the-art machine learning models
In this episode of the Data Exchange I speak with Michael Mahoney, a researcher at UC Berkeley’s RISELab, ICSI, and Department of Statistics. Mike and his collaborators were recently awarded one of the best papers awards at NeurIPS 2020, one of leading research conferences in machine learning.Subscribe: Apple, Android, Spotify, Stitcher, Google, and RSS.Download the 2021 Trends Report: Data, Machine Learning, AI and learn emerging technologies for data management, data engineering, machine learning, and AI.Detailed show notes can be found on The Data Exchange web site.Subscribe to The Gradient Flow Newsletter.


