Superhuman AI: Decoding the Future

Nano Banana Breakthrough: The Future of AI Images - Naina Raisinghani & Philipp Lippe, DeepMind

37 snips
Sep 25, 2025
Naina Raisinghani, a product lead at Google DeepMind, and Philipp Lippe, a researcher in multimodal AI, dive into the groundbreaking Nano Banana technology. They discuss how it achieves character consistency across various edits and its real-world applications like virtual try-ons and enhanced ads. Philipp highlights speed improvements that allow for nearly instantaneous image generation. The duo also shares unexpected user trends, including emotional photo restorations, and looks ahead to unified models that integrate text, images, and more.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Multimodality Improves Communication And Learning

  • Multimodality matters because visual inputs change how people ask and understand problems, improving education and communication.
  • Philipp notes images can convey solutions faster than long textual explanations in geometry or diagrams.
ADVICE

Get Fast Design Feedback From Images

  • Use screenshots and mockups as inputs to get design feedback and rapid UX iteration from the model.
  • Hassan shows how asking Gemini to 'think deeply' yields UI improvement suggestions and simplified CTAs.
INSIGHT

Key Technical Gaps Left

  • Remaining technical gaps include text rendering in images and 'no-op' edit failures that skip requested changes.
  • Philipp says improving consistent composition and reducing failed edits are priority follow-ups.
Get the Snipd Podcast app to discover more snips from this episode
Get the app