ThursdAI - The top AI news from the past week

📆 ThursdAI - Oct 23: The AI Browser Wars Begin, DeepSeek's OCR Mind-Trick & The Race to Real-Time Video

118 snips
Oct 24, 2025
Paul Klein, founder of BrowserBase, discusses agentic browser automation and the intriguing integration with 1Password that enhances secure access for browsing agents. Joining him is Quinn Kramer, an expert in real-time video technology, who explores the architecture behind low-latency multimodal interactions and the fascinating world of real-time lip sync for avatars. The conversation dives into the revolutionary DeepSeek OCR model, highlighting its innovative text compression methods that could redefine how we process images.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Tokenization May Be Due For Change

  • Multiple groups (DeepSeek, Zifu/Glyph) converged on visual text compression, indicating tokenization may be suboptimal today.
  • This convergence hints at a broader shift toward visual representations to scale context windows.
ANECDOTE

Atlas Agent Tried On Compliance Training

  • Alex used ChatGPT Atlas for hours and ran an agent to try to complete a compliance training, finding it capable but flaky.
  • He was impressed by Atlas's deep ChatGPT integration despite some annoyances and agent supervision needs.
INSIGHT

Agentic Browsers Change Security Landscape

  • Atlas gives agents access to logged-in sessions and browser cookies, raising new security and injection risks.
  • OpenAI includes mitigations like private mode and watch mode, but agents still need careful supervision.
Get the Snipd Podcast app to discover more snips from this episode
Get the app