

Super Data Science: ML & AI Podcast with Jon Krohn
Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Episodes
Mentioned books

Sep 15, 2023 • 37min
714: Using A.I. to Overcome Blindness and Thrive as a Data Scientist
Tim Albiges discusses how blind individuals can excel in data science, using machine learning in healthcare. Topics include adaptive communication tools, applying AI to diagnose respiratory diseases, and empowering the visually impaired with cutting-edge technologies like object recognition and OCR.

Sep 12, 2023 • 1h 26min
713: Llama 2, Toolformer and BLOOM: Open-Source LLMs with Meta's Dr. Thomas Scialom
Dr. Thomas Scialom discusses Llama 2, Toolformer, and BLOOM: open-source LLMs. Topics include AGI, RLHF in AI, and advice for AI entrepreneurs. Exploring Toolformer's capabilities, the Galactica project, and AI models' responsible use. Insights on developing large-scale AI projects and the future of AI industry.

Sep 8, 2023 • 7min
712: Code Llama
In this podcast, host Jon Krohn explores the new language model Code Llama by Meta, designed for data scientists and coders. Code Llama offers advanced code completion and debugging features across multiple programming languages, including specialized versions for Python, proving to be a game-changer in the field.

Sep 5, 2023 • 1h 26min
711: Image, Video and 3D-Model Generation from Natural Language, with Dr. Ajay Jain
Dr. Ajay Jain, Co-Founder of Genmo.ai, discusses creative general intelligence in the video industry. Topics include multimodal models, Denoising Diffusion Probabilistic Models, Neural Radiance Fields, pedestrian behavior prediction at Uber, and cost-saving techniques in model training.

Sep 1, 2023 • 1h 3min
710: LangChain: Create LLM Applications Easily in Python
Kris Ograbek discusses LangChain, niching down, and continuous improvement in AI. They touch on data preprocessing, word embeddings, and chat GPT for enhancing daily interactions. The conversation also explores the transition to hosting the Super Data Science podcast and the importance of vector embeddings for large language models.

Aug 29, 2023 • 1h 21min
709: Big A.I. R&D Risks Reap Big Societal Rewards, with Meta's Dr. Laurens van der Maaten
Dr. Laurens van der Maaten from Meta delves into the transformative power of AI innovation, covering topics like t-SNE dimensionality reduction, protein synthesis, climate change mitigation, and wearable materials simulation. He discusses large-scale learning of image recognition models, A.I. for protein models, privacy-preserving ML frameworks, concerns about adversarial examples, and making a big impact in AI research.

Aug 25, 2023 • 23min
708: ChatGPT Code Interpreter: 5 Hacks for Data Scientists
Discover five essential hacks for data scientists using ChatGPT's Code Interpreter, including optimizing capabilities with GPT-4 model, data preprocessing, model training, error identification, and code explanation through natural language instructions and visual outputs.

Aug 22, 2023 • 1h 47min
707: Vicuña, Gorilla, Chatbot Arena and Socially Beneficial LLMs, with Prof. Joey Gonzalez
Professor Joey Gonzalez discusses developing models and platforms that leverage and improve LLMs, including Vicuña and Chatbot Arena. They delve into open vs closed-source LLMs, the future impact of AI on society, and advancements in large language APIs. The conversation touches on the significance of the Berkeley AI Research Lab and evaluating model performance in long context windows.

Aug 18, 2023 • 33min
706: Large Language Model Leaderboards and Benchmarks
Caterina Constantinescu discusses Large Language Models (LLMs), leaderboard comparisons, evaluation challenges, dataset contamination, and platforms like HELM and Chatbot Arena. Learn about LAMA 2, benchmark evolution, user preferences in chatbots, human feedback for model improvement, and the impact of perception on model evaluations.

Aug 15, 2023 • 1h 29min
705: Feeding the World with ML-Powered Precision Agriculture
Join Feroz Sheikh, Jeremy Groeteke, and Thomas Jung from Syngenta Group to explore ML in agriculture. Topics include generative chemistry, designing ML models for farming, and the social impact of data science in agriculture.


