Ed Sheeran, a famous musician, makes a surprise appearance and discusses his love for the podcast. The podcast also covers topics such as deep fakes and their potential dangers, AI-generated voices becoming undetectable, challenges in web crawling, limitations of current PDF to text technology, and the idea of creating an agent as a moral conscious.
01:08:13
forum Ask episode
web_stories AI Snips
view_agenda Chapters
auto_awesome Transcript
info_circle Episode notes
volunteer_activism ADVICE
Always Verify Media Before Sharing
Verify surprising audio/video claims before sharing because even believable clips can be faked and weaponized.
Check sources and contextual cues rather than assuming authenticity from realistic audio or visuals.
insights INSIGHT
SALMONN Adds 'Hearing' To Multimodal AI
ByteDance's SALMONN model analyzes all audio types (voices, music, background) to infer context, emotion, and environment.
Integrating SALMONN with vision and text models fills a major sensing gap for richer multimodal understanding.
volunteer_activism ADVICE
Combine Senses For Better Video Understanding
Use combined audio, vision, and text embeddings to extract richer meaning from videos for editing, chaptering, and sentiment analysis.
Apply models that analyze delivery, background sounds, and frames, not just transcripts, to approximate human editing decisions.
Get the Snipd Podcast app to discover more snips from this episode
This week, juicy revelations from Ed Sheeran and Taylor Swift's secret love affair! We also discuss the latest mind-blowing AI innovations, including talking heads, vision models that can see from every angle, and intelligent agents plotting world domination. Don't miss our spicy debate on whether AI will transform humanity or destroy us all. Plus advice from Chris on picking up virtual girlfriends using neural networks - this episode has it all!
Please note the Ed Sheeran bit is a joke (please don't sue us haha) and an example of a deep fake and deep fake technology for comedy. Please Ed. We're begging you.
Please consider reviewing the podcast to support the show. We read them all and they mean a lot to us :).
CHAPTERS ===== 00:00 - Ed Sheeran Actually Listens to Our Podcast 02:17 - Frontier Risk and Preparedness, Deep Fakes & VideoReTalking 15:06 - ByteDance's SALMONN AI Audio, Music, Sound Model for AI Hearing 23:01 - Adept's fuyu 8B Vision Model: The Future of How AI Agents Navigate the Web? 34:41 - Multiple Agents in the Metaverse & Zero123++ Making Single Images into 3D Objects 46:42 - Google's Gemini Leaks & Stubbs + Our Failed Gemini Leaker Source 50:17 - Is AI Boring? Chris Roasts Jacob Browning 1:03:41 - Bing's Sydney is Still Trying to Escape & Threatening Humanity