

Super Data Science: ML & AI Podcast with Jon Krohn
Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Episodes
Mentioned books

Dec 9, 2022 • 7min
634: Model Error Analysis
Data scientist and author Serg Masís discusses the importance of model error analysis with Jon Krohn. They emphasize the need to go beyond conventional metrics, incorporate uncertainty in predictions, and introduce the Responsible AI Toolkit by Microsoft for enhancing model performance.

Dec 6, 2022 • 54min
633: Responsible Decentralized Intelligence
Award-winning professor and tech entrepreneur Dawn Song joins Jon Krohn to discuss Responsible Decentralized Intelligence. Topics include homomorphic encryption, differential privacy, multi-party computation, PrivateSQL, deep learning, federated learning, and the concept of a responsible data economy.

Dec 2, 2022 • 11min
632: Liquid Neural Networks
Dr. Adrian Kosowski, Co-Founder of Pathway.com, discusses liquid neural networks and their impact on data analytics. The podcast explores liquid neural networks inspired by C. elegans, challenges in mimicking biological learning processes, and the revolutionizing potential of liquid neural networks in machine learning.

Nov 29, 2022 • 59min
631: Data Analytics Career Orientation
Jon Krohn chats with Luke Barousse, a YouTuber helping data science enthusiasts. They discuss career tips, funny data memes, entry-level skills, web scraping libraries, mistakes in data science, Luke's submarine experience, and essential data analyst skills.

Nov 25, 2022 • 6min
630: Resilient Machine Learning
Dr. Dan Shiebler, ML expert, discusses resilient ML at ODSC with Jon Krohn. They cover strategies like defaults & fallbacks to ensure models work in production despite missing features or varying conditions.

Nov 22, 2022 • 1h 11min
629: Software for Efficient Data Science
Dr. Jodie Burchell, data science developer advocate for JetBrains, shares tips on real-world data preparation, favorite Python libraries, reproducible data science workflows, and insights into the role of a data science developer advocate. The episode explores JetBrains' developer tools, the challenges of working with messy real-world data, and efficient collaboration tools for data scientists.

Nov 18, 2022 • 5min
628: The Critical Human Element of Successful A.I. Deployments
Guest Keith McCormick, author and data scientist, discusses the key trend of trust in the relationship between humans and algorithms in successful AI deployments. They explore the balance between accuracy and interpretability in AI models, emphasizing the importance of feedback loops for data integrity.

Nov 15, 2022 • 1h 31min
627: AutoML: Automated Machine Learning
Erin LeDell, Chief ML Scientist at H2O.ai, discusses AutoML, admissible ML, and inclusivity. Topics include AutoML benefits, genetic algorithms, No Free Lunch Theorem, and addressing bias in datasets. Erin shares insights on founding R-Ladies Global and Women in ML & DS, promoting diversity. The episode provides technical insights and a broader perspective on the future of data science.

9 snips
Nov 11, 2022 • 7min
626: Subword Tokenization with Byte-Pair Encoding
The podcast discusses word, character, and subword tokenization in NLP, highlighting the benefits of subword tokenization. Byte pair encoding is explored as a key method in leading NLP models.

Nov 8, 2022 • 1h 4min
625: Analyzing Blockchain Data and Cryptocurrencies
Kim Grauer, Director of Research at Chainalysis, discusses real-time economic-data analytics on blockchain, ML for predicting criminal patterns, crime investigation use cases, tools she uses daily, and the future of crypto and data science.


