

Super Data Science: ML & AI Podcast with Jon Krohn
Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Episodes
Mentioned books

Jun 2, 2023 • 6min
684: Get More Language Context out of your LLM
Explore the benefits of open-source LLMs and Flash Attention as a solution to self-attention issues in generative AI. Learn how FlashAttention could rival GPT-4's capabilities and revolutionize the field of AI technology.

May 30, 2023 • 1h 21min
683: Contextual A.I. for Adapting to Adversaries, with Dr. Matar Haller
Dr. Matar Haller discusses contextual AI in identifying malicious user-generated content online, monitoring live-streamed content, and utilizing a 'database of evil'. Topics also include leadership opportunities for women in STEM, Israel's R&D edge for AI, and the challenges of real-time content moderation on social media platforms.

May 26, 2023 • 28min
682: Business Intelligence Tools, with Mico Yuk
Mico Yuk, host of 'Analytics on Fire' and expert in business intelligence, discusses the BIDS framework for influencing decision makers. She explores the dominance of Power BI, its limitations, and Microsoft's strategic acquisitions. The conversation extends to career transitions, diversity concerns in tech, and unconventional data books for life lessons.

May 23, 2023 • 1h 12min
681: XGBoost: The Ultimate Classifier, with Matt Harrison
Best-selling author and leading Python consultant Matt Harrison delves into XGBoost, discussing key hyperparameters, optimal modeling scenarios, and when to use/not use XGBoost. He also shares his recommended Python libraries and production tips for upgrading your data science toolkit.

May 19, 2023 • 30min
680: Automating Industrial Machines with Data Science and the Internet of Things (IoT)
Product ownership expert Allegra Alessi discusses IoT integration in industrial machinery at Bobst, emphasizing the role of product owners in agile frameworks. They explore the transition from data science to product roles and the importance of technical collaboration for IoT solutions using Microsoft Azure.

May 16, 2023 • 1h 34min
679: The A.I. and Machine Learning Landscape, with investor George Mathew
George Mathew, an AI investor, discusses the generative AI stack, MLOps best practices, and tools for scalable products. Topics include venture capital in tech startups, RLHF for intuitive UI, risks in generative AI, and the impact of generative AI tools on the labor market.

May 12, 2023 • 12min
678: StableLM: Open-source "ChatGPT"-like LLMs you can fit on one GPU
Discover StableLM, a powerful family of open-source language models trained on a single GPU, offering flexibility for commercial use. Dive into its unique training process and fine-tuning capabilities, making it a versatile tool for specific datasets and applications.

May 9, 2023 • 1h 28min
677: Digital Analytics with Avinash Kaushik
Guest Avinash Kaushik, Chief Strategy Officer at Croud and former Sr. Director of Global Strategic Analytics at Google, discusses the transformative power of AI, 'four clusters of intent' framework, human touch with AI, and career challenges. Topics include brand vs performance analytics, incrementality-centric marketing, and the evolution of web analytics concepts. The podcast explores the intersection of analytics, marketing, and customer delight, the role of data tools, and the impact of AI on analytics.

May 5, 2023 • 13min
676: The Chinchilla Scaling Laws
Discover the Chinchilla Scaling Laws for Large Language Models, defining optimal data-to-model size ratios for efficient training. Learn how models like Cerebras-GPT leverage these laws for superior performance and cost-effectiveness.

May 2, 2023 • 1h 9min
675: Pandas for Data Analysis and Visualization
Stefanie Molin, author of Hands-On Data Analysis with Pandas, shares insights on data wrangling in Pandas, advantages over other libraries, creating Python packages, and using Matplotlib or Seaborn for visualization. Discussions include the benefits of chaining operations in Pandas, where to start learning, and her experience as a software engineer at Bloomberg.


