Google AI: Release Notes

Google AI
undefined
27 snips
Nov 25, 2025 • 42min

Gemini 3: Launch day reactions

Tulsi Doshi and Josh Woodward join to discuss the exciting launch of Gemini 3. Tulsi, a product lead for generative AI models, shares innovative capabilities like multimodal understanding and agentic features. Josh highlights Gemini's integration across Google surfaces for developer access. They dive into real-world applications, such as transforming handwritten recipes into interactive apps and rapid game development. The duo also reflects on balancing model performance with accessibility, and how user feedback drives continuous improvements.
undefined
36 snips
Oct 16, 2025 • 48min

How a Moonshot Led to Google DeepMind's Veo 3

Dumi Erhan, co-lead of the Veo project at Google DeepMind, shares his extensive expertise in video-generation research. He delves into the fascinating journey of the Veo project, from its moonshot beginnings to the groundbreaking Veo 3 model with audio capabilities. Dumi discusses the challenges of long-duration video coherence and the impact of user feedback on future developments. He also explores the complexity of image-to-video generation and highlights innovative prompting methods that enhance user control.
undefined
45 snips
Sep 15, 2025 • 37min

GDM’s Pushmeet Kohli on solving science's biggest challenges with AI

Pushmeet Kohli, Head of Science and Strategic Initiatives at Google DeepMind, discusses the groundbreaking intersection of AI and science. He dives into transformative models like AlphaFold, showcasing their potential to revolutionize scientific discovery. Kohli emphasizes how AI can democratize research through tools like AI Co-scientist, enabling wider participation. The conversation also touches on the collaborative efforts behind these innovations and their significant impact on solving complex challenges in mathematics and biology.
undefined
49 snips
Aug 27, 2025 • 31min

Behind the scenes of Google's state-of-the-art "nano-banana" image model

Nicole Brichtova and Mostafa Dehghani from Google's Gemini team dive into the innovative features of their cutting-edge image model, Gemini 2.5 Flash. They discuss how the model enables intricate edits through interleaved generation and its ability to maintain character consistency. Listeners learn about the playful 'nano-banana' concept, showcasing real-time transformations that enhance user engagement. The duo also reflects on the integration of text rendering and user feedback, paving the way for future advancements in image generation technology.
undefined
144 snips
Aug 11, 2025 • 31min

Demis Hassabis on shipping momentum, better evals and world models

Demis Hassabis, CEO of Google DeepMind, dives into the evolution of AI from gaming to advanced thinking models. He discusses Genie 3 and its role in building world models that enable AI to grasp reality better. The conversation also touches on the necessity for improved evaluation methods through platforms like Kaggle’s Game Arena, as well as the integration of tool use in AI systems. Hassabis shares insights on scaling AI and the exciting future applications that could emerge from these advancements.
undefined
89 snips
Aug 6, 2025 • 40min

Building real-time voice applications with Live API

Shrestha Basu Mallick, Product lead for the Gemini API at Google, dives into the transformative power of the Gemini Live API, highlighting its seamless integration of real-time audio capabilities. She discusses how proactive audio and async functions enhance user interaction. Interesting topics include the importance of audio as an interface, imaginative use cases in applications like Photoshop, and a lighthearted banter about the constellation Gemini and development quirks. It's a vibrant conversation about innovation, creativity, and developer insights.
undefined
53 snips
Jul 23, 2025 • 43min

Building a frontier AI search experience

Robby Stein, VP of Product for Google Search, dives into the transformation of Search into a cutting-edge AI product. He discusses the shift from basic keyword searches to interactive, conversational queries, capable of handling complex tasks seamlessly. Stein highlights the innovative AI Mode and the role of Deep Search in personalizing user experiences. They touch on the emergence of visual and speech-based search capabilities, showcasing how the platform aims to empower users to 'ask anything' and leverage real-time AI tools for everyday tasks.
undefined
54 snips
Jul 2, 2025 • 44min

Gemini's Multimodality

Ani Baddepudi, the Product Lead for Gemini Model Behavior, shares her insights on the groundbreaking multimodal capabilities of Gemini. She explains why Gemini was designed as a multimodal model from the start, emphasizing its vision-first approach. The conversation dives into the intricacies of video and image understanding, showcasing advancements in higher FPS video sampling and tokenization methods. Ani also discusses the future of proactive AI assistants and the collaborative efforts behind Gemini’s evolution, revealing exciting possibilities for intuitive AI interactions.
undefined
11 snips
Jun 16, 2025 • 1h

Building Gemini's Coding Capabilities

Connie Fan, Product Lead, and Danny Tarlow, Research Lead for Gemini's coding capabilities, dive into the creation of groundbreaking AI coding models. They discuss the importance of foundational goals, the rise of 'vibe coding,' and its transformative effects on development. The duo explores strategies for managing large codebases and how Gemini's framework aims to democratize technology access. They also envision a future where coding tools evolve to meet complex user needs, fostering creativity and productivity in programming.
undefined
35 snips
Jun 16, 2025 • 27min

Sergey Brin on the Future of AI & Gemini

Join Sergey Brin, co-founder of Google and a pioneer in computer science, as he delves into the cutting-edge developments of Gemini. He shares insights on the innovative core text models and the integration of native audio, revealing how these advancements enhance storytelling. Brin discusses the rapid evolution of AI, the surprises in recent developments compared to past expectations, and the critical journey towards improved reasoning capabilities. With a focus on Google's vibrant startup culture, his enthusiasm for AI innovation is palpable.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app